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Editor’s Notes 


Authors and reviewers are reminded of the editorial 
transition that will have taken place by the time this 
issue appears. Any new business or general correspon- 
dence about manuscripts currently in the review pro- 
cess should now be directed to the new editor: Profes- 
sor Lee Sigelman, American Political Science Review, 
Department of Political Science, George Washington 
University, 2201 G Street N.W., Washington, DC 
20052. 

Correspondence about articles scheduled to appear 
in the December 2001 issue only should continue to be 
addressed to the current editorial office: apsr@ssc. 
msu.edu or Professor Ada W. Finifter, American Polit- 
ical Science Review, Department of Political Science, 
Michigan State University, 303 South Kedzie Hall, East 
Lansing, MI 48824-1032. 

The information below about submissions of articles 
and books for review has been updated. There is also 
new information about the online version of the APSR 
in the section on “Electronic Access to the American 
Political Science Review.” 

Ada W. Finifter 
Editor 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 

The American Political Science Review strives to publish 
scholarly research of exceptional merit, demonstrating 
the highest standards of excellence in conceptualiza- 
tion, exposition, methodology, and craftsmanship. Be- 
cause the Review reaches a diverse audience of scholars 
and practitioners, contributors should demonstrate 
how their analysis illuminates a significant research 
problem, or answers an important research question, of 
general interest in political science. For the same 
reason, authors should strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

Since the Review publishes original research, authors 
should not submit articles containing tables, figures, or 
substantial amounts of text already published or forth- 
coming in other places, or that are included in other 
manuscripts submitted for review, whether to book 
publishers or periodicals (including online journals), or 
are otherwise committed. In many such cases, subse- 
quent publication of this material would violate the 
copyright of the other publisher. The APSR also does 
not consider papers, currently under review at other 
journals or that duplicate or overlap with parts of 
larger manuscripts that have been submitted to other 
publishers (including publishers of both books and 
periodicals). Since space is at a premium, submission of 
articles substantially similar to manuscripts submitted 
or published elsewhere, or to part of a book or other 
larger work, is also strongly discouraged. If there is any 
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doubt about whether these policies would apply to a 
particular case, the author should discuss any such 
publications related to a submission in a cover letter to 
the Editor. The Editor should also be notified of any 
related submissions to other publishers, whether for 
book or periodical publication, that occur while an 
article is under review at the APSR and that would fall 
within the scope of this policy. Copies of related 
publications may be requested. 

For manuscripts containing quantitative evidence 
and analysis, authors should describe their empirical 
procedures in sufficient detail to permit reviewers to 
understand and evaluate what has been done and, in 
the event the article is accepted for publication, to 
permit other scholars to carry out similar analyses on 
other data sets. For example, for surveys, at the least, 
sampling procedures, response rates, and question 
wordings should be given; authors are encouraged to 
calculate response rates according to one of the stan- 
dard formulas given in The American Association for 
Public Opinion Research, Standard Definitions: Final 
Dispositions of Case Codes and Outcome Rates for RDD 
Telephone Surveys and In-Person Househould Surveys 
(Ann Arbor, MI: AAPOR, 1998). This document is 
available on the Internet at <http:/Avww.aapor.org/ 
ethics/stddef.html>. For experiments, provide full de- 
scriptions of experimental protocols, methods of sub- 
ject recruitment and selection, information about any 
payments and debriefing procedures, and other rele- 
vant details. It is desirable for articles to be self- 
contained, and authors should not refer readers to 
other publications for descriptions of these basic re- 
search procedures. 

Please indicate variables included in empirical anal- 
yses by capitalizing the first word in the variable name 
and italicizing the entire variable name the first time 
each is mentioned in the text. Authors should also be 
consistent in using the same names for variables in text 
and tables, and avoid overuse of acronyms and com- 
puter abbreviations when discussing variables in the 
text. All variables appearing in tables should have been 
mentioned in the text and the reason for their inclusion 
in the analysis discussed. 

As part of the review process, authors may be asked to 
submit additional documentation if procedures are not 
sufficiently clear, the review process works most efficiently 
if such information is given in the initial submission. If 
readers are advised that additional information is avail- 
able from the author, printed copies of that information 
should be submitted with the manuscript. If the amount 
of this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. Authors 
should follow the guidelines for preparing anonymous 


_copies in the “Specific Procedures” section below. 


Manuscripts that are largely or entirely critiques or 
commentaries on previously published articles will be 
reviewed using the same general procedures as for 
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regular articles with one exception. In addition to the 
usual number of reviewers, such manuscripts will also 
be sent to the scholar(s) whose work is being critiqued, 
in the same anonymous form as they are sent to other 
reviewers. Comments from the original author(s) to 
the editor will be invited as a supplement to the advice 
of regular reviewers. There are several purposes for 
this notice to the original author(s): (1) to encourage 
review of the details of analyses or research procedures 
that might occasionally escape the notice of disinter- 
ested reviewers; {2) to enable prompt publication of 
critiques by supplying critiqued authors with early 
notice of their existence and, therefore, more adequate 
time to reply; (3) as a courtesy to critiqued authors. 
Authors of critiques should therefore send as many 
additional copies of their manuscripts as will be re- 
quired for this purpose. 

Manuscripts being submitted for publication should 
be sent to Professor Lee Sigelman, American Political 
Science Review, Department of Political Science, 
George Washington University, 2201 G Street N.W., 
Washington, DC 20052. Correspondence concerning 
manuscripts under review may be sent to the same 
address. 


Manuscript Formatting 


Manuscripts must not be longer than 45 pages includ- 
ing text, all tables and figures, notes, references, and 
. The page size guideline is based on the 
US. standard 8.5 X il inch paper and must be adjusted 
accordingly by authors who are submitting manuscripts 
printed on longer paper. Font eo ses be al Bad dI 
point for all parts of the paper. The entire paper must be 
double-spaced, with the sole exception of tables for 
which double-spacing would require a second page 
otherwise not needed. All pages should be numbered 
in one sequence, and text should be formatted using a 
normal single column no wider than 6% inches, as is 
typical for manuscripts (rather than the double-column 
format of the published version of the Review), and 
printed on one side of the page only. Include an 
abstract of no more than 150 words. The American 
Political Science Review style of embedded citations 
should be used, and there must be a separate list of 
references at the end of the article. Please do not use 
footnotes for simple citations. These specifications are 
designed to make it easier for reviewers to read and 
evaluate papers. Papers not adhering to these guide- 
lines are subject to being rejected without review. 

For submission and review purposes, authors are 
encouraged to place footnotes at the bottom of the 
pages instead of using endnotes and may locate tables 
and figures (on separate pages and only one to a page) 
approximately where they fall in the text. However, 
manuscripts accepted for publication must be submit- 
ted with endnotes, and with tables and figures on 
separate pages at the back of the manuscript with 
standard indications of text placement, e.g., [Table 3 
about here]. In deciding how to format your initial 
submission, please consider the necessity of making 
these changes if your paper is accepted. Authors of 
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accepted manuscripts are also required to submit cam- 
era-ready copy of graphs or other types of figures. 
Instructions will be provided. 

For specific formatting style of citations and refer- 
ences, please refer to any articles in the most recent 
issue of the Review. For unusual: style or formatting 
issues, authors may consult The Chicago Manual of 
Style, 14th edition. For review purposes, we do not 

ire that citations and references be in specific 
APSR format, although some generally accepted for- 
mat should be used, and all citation and reference 
information should be provided. 


Specific Procedures 


Please follow these specific procedures for submission: 
1. Submit five copies of manuscripts. Please review all 
pages of all copies to make sure all copies contain all 
tioned in the manuscript and that all pages are legible. 
2. To comply with the Review’s policy of double- 
blind peer reviews, only one of the five copies 
submitted should be fully identified as to author- 
ship and four should be in anonymous format. 
3. For anonymous copies, if it is important to the 
development of the paper that an author’s previous 
publications be cited, this should be done in a way 
that does not make the authorship of the submitted 
paper obvious. This is usually most easily accom- 
plished by referring to oneself in the third person 
and including normal references to the work cited in 
the list of references. Under no circumstances 
should an author’s prior publications be included in 
the bibliography in their normal alphabetical loca- 
tion but with the author’s name deleted. Assuming 
that text references to an author's previous work are 
in third person, full citations should be included as 
usual in the bibliography. Use of other procedures to 
render manuscripts anonymous should be discussed 
with the Editor prior to submission. Authors also 
should not thank colleagues in notes or elsewhere in 
the body of the paper or mention institution names, 
web page addresses, or other potentially identifying 
information. All acknowledgements must appear on 
the title page of the identified copy only. Manuscripts 
that are not judged ous will not be reviewed. 
4. Cover pages: The first Se of the four anony- 
mous copies should contain only the title and an 
abstract of no more than 150 words. The first page 
of the identified copy should contain (a) the 
name, academic rank, institutional affiliation, and 
contact information (address, telephone, fax, e- 
mail) for all authors; (b) in the case of multiple 
authors, an indication of the author who will 
receive correspondence; (c) any relevant citations 
to the author’s previous work that have been 
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Editor’s Notes 


With this last issue of my six years as editor of the 
APSR, I am happy to join the ranks of former editors. 
This has been a challenging post, and many people 
contributed to the effort. I thank my editorial board for 
their dedication and support and for sharing their 
extensive knowledge of political science research, the 
submitting authors for their trust and patience with our 
sometimes lengthy review process, the reviewers for 
their generosity and advice, the interns for their very 
able work in processing manuscripts, the Book Review 
editor and his staff for their strong contributions in 
bringing a wider body of work to the attention of our 
readers, and the editorial staff for their dedication, 
efficiency, and attention to the myriad details involved 
in turning our daily stack of mail into one of the most 
highly respected journals in the discipline. 

The APSR is the most broadly based of all political 
science journals. The papers received during my edi- 
torship came from 2,380 different authors and were 
reviewed by 2,126 different reviewers. These groups 
overlap to only a limited degree: 3,690 individuals were 
involved in one or the other capacity. More than 1,300 
scholars devoted their time to review for the APSR 
even though they did not submit manuscripts over this 
same period. The work of 375 scholars was published in 
287 articles. At the back of this issue we list those who 
reviewed during the past year. As is customary in the 
December issue of a volume, we also include the 
annual index. A comprehensive index for 1996-2001 
will continue to be available at the web site, http:/ 
www.ssc.msu.edu/~APSR for the next few months. 


Ada W. Finifter 
Editor 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The Amencan Political Science Review (APSR) strives 
to publish scholarly research of exceptional merit, 
focusing on important issues and demonstrating the 
highest standards of excellence in conceptualization, 
exposition, methodology, and craftsmanship. Because 
the APSR reaches a diverse audience of scholars and 
practitioners, authors must demonstrate how their 
analysis illuminates a significant research problem or 
answers an important research question, of general 
interest in political science. For the same reason, 
authors must strive for a presentation that will be 
understandable to as many scholars as possible, consis- 
tent with the nature of their material. The APSR 
publishes original research. Therefore, authors should 
not submit articles containing tables, figures, or sub- 
stantial amounts of text that already have been pub- 
lished or are forthcoming in other places, or which are 
included in other manuscripts submitted for review to 
book publishers or periodicals (including online jour- 
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nals) or otherwise committed. In many such cases, 
subsequent publication of this material would violate 
the copyright of the other publisher. The APSR also 
does not consider papers that are currently under 
review at other journals or duplicate or overlap with 
parts of larger manuscripts that have been submitted to 
other publishers (including publishers of both books 
and periodicals). Submission of manuscripts substan- 
tially similar to those submitted or published else- 
where, or to part of a book or other larger work, is also 
strongly discouraged. If you have any questions about 
whether these policies apply in your particular case, 
you should discuss any such publications related to a 
submission in a cover letter to the Editor. You should 
also notify the Editor of any related submissions to 
other publishers, whether for book or periodical pub- 
lication, that occur while a manuscript is under review 
at the APSR and which would fall within the scope of 
this policy. The Editor may request copies of related 
publications. 

If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures in 
sufficient detail to permit reviewers to understand and 
evaluate what has been done and, in the event the 
article is accepted for publication, to permit other 
scholars to carry out similar analyses on other data sets. 
For example, for surveys, at the least, sampling proce- 
dures, response rates, and question wordings should be 
given; you should calculate response rates according to 
one of the standard formulas given by the American 
Association for Public Opinion Research, Standard 
Definitions: Final Dispositions of Case Codes and Out- 
come Rates for RDD Telephone Surveys and In-Person 
Household Surveys (Ann Arbor, MI: AAPOR, 1998). 
This document is available on the Internet at http:// 
www.aapor.org/ethics/stddef.html. For experiments, 
provide full descriptions of experimental protocols, 
methods of subject recruitment and selection, subject 
payments and debriefing procedures, and so on. It is 
desirable for articles to be self-contained, and you 
should not refer readers to other publications for 
descriptions of these basic research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first word in the variable name 
and italicizing the entire variable name the first time 
each is mentioned in the text. You should also use the 
same names for variables in text and tables, and 
wherever possible should avoid use of acronyms and 
computer abbreviations when discussing variables in 
the text. All variables appearing in tables should have 
been mentioned in the text and the reason for their 
inclusion discussed. 

As part of the review process, you may be asked to 
submit additional documentation if procedures are not 
sufficiently clear; the review process works most effi- 
ciently if such information is given in the initial sub- 
mission. If you advise readers that additional informa- 
tion is available, you should submit printed copies of 
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that information with the manuscript. If the amount of 
this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. You 
should follow the guidelines for preparing anonymous 
copies in the “Specific Procedures” section below. 

Manuscripts that are largely or entirely critiques or 
commentaries on previously published articles will be 
reviewed using the same general procedures as for 
other manuscripts, with one exception. In addition to 
the usual number of reviewers, such manuscripts will 
also be sent to the scholar(s) whose work is being 
criticized, in the same anonymous form as they are sent 
to other reviewers. Comments from the original au- 
thor(s) to the Editor will be invited as a supplement to 
the advice of regular reviewers. This notice to the 
original author(s) is intended: (1) to encourage review 
of the details of analyses or research procedures that 
might escape the notice of disinterested reviewers; (2) 
to enable prompt publication of critiques by supplying 
criticized authors with early notice of their existence 
and, therefore, more adequate time to reply; (3) as a 
courtesy to criticized authors. If you submit such a 
manuscript, you should therefore send as many addi- 
tional copies of your manuscript as will be required for 
this purpose. 

Manuscripts being submitted for publication should 
be sent to Lee Sigelman, Editor, American Political 
Science Review, Department of Political Science, The 
George Washington University, 2201 G Street N.W., 
Room 507, Washington, DC 20052. Correspondence 
concerning manuscripts under review may be sent to 
the same address, or e-mailed to apsr@gwu.edu. 


Manuscript Formatting 


Manuscripts must not be longer than 45 pages includ- 
ing text, all tables and figures, notes, references, and 
appendices. This page size guideline is based on the 
U.S. standard 8.5 X 11 inch paper, if you are submit- 
ting a manuscript printed on longer paper, you must 
adjust accordingly. Font size must be at least 11 point 
for all parts of the paper, including notes and refer- 
ences. The entire paper must be double-spaced, with 
the sole exception of tables for which double-spacing 
would require a second page otherwise not needed. All 
pages should be numbered in one sequence, and text 
should be formatted using a normal single column no 
wider than 614 inches, as is typical for manuscripts 
(rather than the double-column format of the pub- 
lished version of the APSR), and printed on one side of 
the page only. Include an abstract of no more than 150 
words. The APSR style of embedded citations should 
be used, and there must be a separate list of references 
at the end of the manuscript. Do not use footnotes for 
simple citations. These specifications are designed to 
make it easier for reviewers to read and evaluate 
papers. Papers not adhering to these guidelines are 
subject to being rejected without review. 

For submission and review purposes, you are en- 
couraged to place footnotes at the bottom of the pages 
instead of using endnotes and to locate tables and 
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figures (on separate pages and only one to a page) 
approximately where they fall in the text. However, 
manuscripts accepted for publication must be submit- 
ted with endnotes, and with tables and figures on 
separate pages at the back of the manuscript with 
standard indications of text placement, e.g., [Table 3 
about here]. In deciding how to format your initial 
submission, please consider the necessity of making 
these changes if your paper is accepted. If your papèr is 
accepted for publication, you will also be required to 
submit camera-ready copy of graphs or other types of 
figures. Instructions will be provided. 

For specific formatting style of citations and refer- 
ences, please refer to any articles in the most recent 
issue of the APSR. For unusual style or formatting 
issues, you should consult the latest edition of The 
Chicago Manual of Style. For review purposes, citations 
and references need not be in specific APSR format, 
although some generally accepted format should be 
used, and all citation and reference information should 
be provided. 


Specific Procedures 
Please follow these specific procedures for submission: 


1. You are invited to submit a list of scholars who 
would be appropriate reviewers of your manuscript. 
The Editor will refer to this list in selecting review- 
ers, though there obviously can be no guarantee that 
those you suggest will actually be chosen. Do not 
include on this list anyone who has already com- 
mented on your paper or an earlier version of it or 
any of your current or recent collaborators, institu- 
tional colleagues, mentors, students, or close 
friends. 

2. Submit five copies of manuscripts and a diskette 
containing the word-processed version of the manu- 
script. Please ensure that the paper and diskette 
versions you submit are identical; the diskette ver- 
sion should be of the fully identified copy (see 
below). Please review all pages of all copies to make 
sure all copies contain all tables, figures, appendices, 
and bibliography mentioned in the manuscript and 
that all pages are legible. Label the diskette clearly 
with the (first) author’s name, the title of the 
manuscript (in abridged form if need be), and 
identify the word processing program and operating 
system. 

3. To comply with the APSR’s procedure of double- 
blind peer reviews, only one of the five copies 
submitted should be fully identified as to authorship 
and four should be in anonymous format. 

4. For anonymous copies, if it is important to the 
development of the paper that your previous publi- 
cations be cited, please do this in a way that does not 
make the authorship of the submitted paper obvi- 
ous. This is usually most easily accomplished by 
referring to yourself in the third person and includ- 
ing normal references to the work cited in the list of 
references. Under no circumstances should your 
prior publications be included in the bibliography in 
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their normal alphabetical location but with your 
name deleted. Assuming that text references to your 
previous work are in the third person, you should 
include full citations as usual in the bibliography. 
Please discuss use of other procedures to render 
manuscripts anonymous with the Editor prior to 
submission. You should not thank colleagues in 
notes or elsewhere in the body of the paper or 
mention institution names, web page addresses, or 
other potentially identifying information. AH ac- 
knowledgments must appear on the title page of the 
identrfied copy only. Manuscripts that are judged not 
anonymous will not be reviewed. 

5. The first page of the four anonymous copies should 
contain only the title and an abstract of no more 
than 150 words. The first page of the identified copy 
should contain (a) the name, academic rank, insti- 
tutional affiliation, and contact information (mailing 
address, telephone, fax, e-mail address) for all au- 
thors; (b) in the case of multiple authors, an indica- 
tion of the author who will receive correspondence; 
(c) any relevant citations to your previous work that 
have been omitted from the anonymous copies; and 
(d) acknowledgments, including the names of any- 
one who has provided comments on the manuscript. 
If the identified copy contains any unique references 
or is worded differently in any way, please mark this 
copy with “Contains author citations” at the top of 
the first page. 


No copies of submitted manuscripts can be returned. 


ELECTRONIC ACCESS TO THE 
AMERICAN POLITICAL SCIENCE REVIEW 


Members of the American Political Science Associa- 
tion may download pdf files of articles and book 
reviews in current issues of the APSR at the Oxford 
University Press electronic journals web site. Logon is 
via http://www.apsanet.org/members/prv/apsr/ where 
name and APSA member ID number must be entered. 
Nonmembers may view the tables of contents and 
article abstracts, and search the contents, at http:// 
www.apsr.cupjournals.org/, but will be unable to 
download or view the text of articles or book reviews 
except for one issue made available as a free sample. 
This collection begins with Volume 95, Number 1, 
the March 2001 issue. In the future, issues beginning 
with Volume 92 (1998) are expected to be available. 
Oxford will send tables of contents as new issues go 
on line to anyone (nonmembers as well as members) 
who requests them at http://apsr.oupjournals.org/ 
subscriptions/. 

For back issues, full text of the APSR is available in 
several electronic formats and through several vendors. 

Except for the last three years (as an annually 
“moving wall”), back issues of the APSR beginning with 
Volume 1, Number 1 (November 1906) are available 
online through JSTOR (http:/Avww.jstor.org/). At 
present, JSTOR’s complete journal collection is avail- 
able only via institutional subscription, ¢.g., through 
many college and university libraries. For APSA mem- 
bers who do not have access to an institutional sub- 
scription to JSTOR, individual subscriptions to its 
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APSR content are available. Please contact Member 
Services at APSA for further information, including 
annual subscription fees. 

Full text access to current issues of both the APSR 
and PS is also available online by library subscription 
from a number of database vendors. Currently, these 
include University Microfilms Inc. (UMI) (via its CD- 
ROMs General Periodicals Online and Social Science 
Index, and the online database ProQuest Direct), 
Online Computer Library Center (OCLC) (through its 
online database First Search as well as’ on CD-ROMs 
and magnetic tape), and the Information Access Com- 
pany (IAC) (through its products Expanded Academic 
Index, InfoTrac, and several online services [see be- 
low]). Others may be added from time to time. 

APSR is also available on databases through six online 
services: Datastar (Datastar), Business Library (Dow 
Jones), Cognito (IAC), Encarta Online Library (IAC), 
IAC Business (Dialog), and Newsearch (Dialog). 

The editorial office of the APSR is not involved in the 
subscription process to either JSTOR for back issues or 
the other vendors for current issues. Please contact 
APSA, your reference librarian, or the database vendor 
for further information about availability. 


BOOK REVIEWS 


All books for review should be sent directly to the 
APSR book review editors, Susan Bickford and Greg 
McAvoy. The address is: Susan Bickford, American 
Political Science Review Book Review Co-Editor, 
Department of Political Science, University of 
North Carolina at Chapel Hill, CB# 3265, Chapel Hill, 
NC 27599-3265. E-mail: bickford@email-unc.edu; 
gemcavoy@uncg.edu. 

If you are the author of a book you wish to be 
considered for review, please ask your publisher to 
send a copy to the APSR book review editors per the 
mailing instructions above. If you are interested in 
reviewing books, please send your vita to Professor 
Bickford at the address above. 


OTHER CORRESPONDENCE 


The American Political Science Association’s address, 
telephone and fax are: 1527 New Hampshire Avenue, 
NW, Washington, DC 20036, (202) 483-2512 (voice), 
and (202) 483-2657 (fax). E-mail: apsa@apsanet.org. 
Please direct correspondence as follows: á 


Information, including news and notes, for PS 
Dr. Robert J-P. Hauck, Editor, PS 
E-mail: rhauck@apsanet.org 


Circulation and subscription correspondence (domes- 
tic claims for nonreceipt of issues must be made within 
four months of the month of publication; overseas 
claims, eight months) 
Elizabeth Weaver Engel, Director of Member 
Services 
E-mail: membership@apsanetorg 
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Reprint permissions 
Ed Lamb 
E-mail: elamb@apsanetorg 


Advertising information and rates 
Laura Barrantes, Advertising Manager 
E-mail: Ibarrantes@apsanet.org 


EXPEDITING REQUESTS FOR COPYING 
APSR AND PS ARTICLES FOR CLASS USE 
AND OTHER PURPOSES 


Class Use 


The Comprehensive Publisher Photocopy Agreement 
between APSA and the Copyright Clearance Center 
(CCC) permits bookstores and copy centers to receive 
expedited clearance to copy articles from the APSR 
and PS in compliance with the Association’s policies 
and applicable fees. The general fee for articles is 75 
cents per copy. However, current Association policy 
levies no fee for the first ten copies of a printed artide, 
whether in course packs or on reserve. Smaller classes 
that rely heavily on articles (ie., upper-level under- 
graduate and graduate classes) can take advantage of 
this provision, and faculty ordering ten or fewer course 
packs should bring it to the attention of course pack 
providers. APSA policy also permits free use of elec- 
tronic library reserve with no limit on the number of 
students who can access the electronic reserve. Both 
large and small classes that rely on these articles can 
take advantage of this provision. The Copyright Clear- 
ance Center’s address, telephone, and fax are: 222 
Rosewood Drive, Danvers, MA 01923, (978) 750-8400 
(voice), and (978) 750-4474 (fax). This agreement 
pertains only to the reproduction and distribution of 
APSA materials as hard copies (e.g., photocopies, 
microfilm, and microfiche). 

The Association of American Publishers (AAP) has 
created a standardized form for college faculty to 
submit to a copy center or bookstore to request 
copyrighted material for course packs. The form is 
available through the CCC, which will handle copyright 
permissions. 

APSA also has a separate agreement pertaining to 
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CCC's Academic E-Reserve Service. This agreement 
allows electronic access for students and instructors of 
a designated class at a designated institution for a 
specified article or set of articles in electronic format. 
Access is by password for the duration of a class. 
Please contact your librarian, CCC, or the APSA 
Reprints Department for further information. 


APSR Authors 


If you are the author of an APSR article, you may use 
your article in course packs or other printed materials 
without payment of royalty fees and you may post it at 
personal or institutional web sites as long as the APSA 
copyright notice is included. 


Other Uses of APSA-Copyrighted Materials 


For any further copyright issues, please contact the 
APSA Reprints Department. 
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Articles appearing in the APSR before June 1953 were 
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ergy Information Abstracts; Environmental Abstracts; 
Historical Abstracts; Index of Economic Articles; Infor- 
mation Service Bulletin, International Index, Interna- 
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Recently Published Articles; Reference Sources; Social 
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American History. Some of these sources may be avail- 
able in electronic form through local public or educa- 
tional libraries. Microfilm of the APSR, beginning with 
Volume 1, and the index of the Review through 1969 
are available through University Microfilms Inc., 300 
North Zeeb Road, Ann Arbor, MI 48106 (www.umi. 
com). The Cumulative Index to the American Political 
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through the American Political Science Association. 
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Measurement Validity: A Shared Standard for Qualitative and Quantitative 


Research 


ROBERT ADCOCK and DAVID COLLIER University of California, Berke 
cholars routinely make claims that presuppose the validity of the observations and measurements that 
G rer tt cons Yes despite recent advances in poll cience tds, suring 
little attention has been devoted to measurement validity. We address this gap by exploring four themes. 
tative and Galen hol 


First, we seek to establish a shared framework that allows 


to assess 


more effectively, and communicate about, issues of valid measurement. Second, we underscore the need to 
draw a clear distinction between measurement issues and disputes about concepts. Third, we discuss the 
contextual specificity of measurement claims, exploring a variety. of measurement strategies that seek to 
combine generality and validity by devoting greater attention to context. Fourth, we address the proliferation 
of terms for alternative measurement validation procedures and offer an account of the three main types of 


validation most relevant to political scientists. 


esearchers routinely make complex choices 

about linking concepts to observations, that is, 

about connecting ideas with facts. These choices 
raise the basic question of measurement validity: Do 
the observations meaningfully capture the ideas con- 
tained in the concepts? We will explore the meaning of 
this question as well as procedures for answering it. In 
the process we seek to formulate a methodological 
standard that can be applied in both qualitative and 
quantitative research. 

Measurement validity is specifically concerned with 
whether operationalization and the scoring of cases 
adequately reflect the concept the researcher seeks to 
measure. This is one aspect of the broader set of 
analytic tasks that King, Keohane, and Verba (1994, 
chap. 2) call “descriptive inference,” which also encom- 
passes, for example, inferences from samples to popu- 
lations. Measurement validity is distinct from the va- 
lidity of “causal inference” (chap. 3), which Cook and 
Campbell (1979) further differentiate into internal and 
external validity.1 Although measurement validity is 
interconnected with causal’ inference, it stands as an 
important methodological topic in its own right. 

New attention to measurement validity is overdue in 
political science. While there has been an ongoing 
concern with applying various tools of measurement 
validation (Berry et al. 1998; Bollen 1993; Elkins 2000, 
Hill, Hanna, and Shafqat 1997; Schrodt and Gerner 


Robert Adcock (adcockr@uclink4 berkeley.cdu) is a Ph.D cand 
date, Department of Political Sclence, and David Collier 
(deollier@socrates.berkeley.edu) is Professor of Political Science, 
University of California, Berkeley, CA 94720-1950. 

Among the many colleagues who have provided helpful comments 
on this article, we especially thank Christopher Achen, Kenneth 
Bollen, Henry Brady, Edward Carmines, Rubette Cowan, Paul Dosh, 
Zachary Elkins, John Gerring, Kenneth Greene, Ernst Haas, Edward 


Seawright, Jeff Sluyter, Richard Snyder, Ruth Stanley, Laura Stoker, 
and three anonymous reviewers. The usual caveats apply. Robert 
Adcock’s work on this project was supported by a National Science 
Foundation Graduate Fellowship. 


the cases being studied, 
to a broader set of cases (Cook and Campbell 1979, 50-9, 70-80). 


1994), no major statement on this topic has appeared 
since Zeller and Carmines (1980) and Bollen (1989). 
Although King, Keohane, and Verba (1994, 25, 152-5) 
cover many topics with remarkable thoroughness, they 
devote only brief attention to measurement validity. 
New thinking about measurement, such as the idea of 
measurement as theory testing (Jacoby 1991, 1999), has 
not been framed in terms of validity. 

Four important problems in political science re- 
search can be addressed through renewed attention to 
measurement validity. The first is the challenge of 
establishing shared standards for quantitative and qual- 
itative scholars, a topic that has been widely discussed 
(King, Keohane, and Verba 1994; see also Brady and 
Collier 2001; George and Bennett n.d.). We believe the 
skepticism with which qualitative and quantitative re- 
searchers sometimes view each others measurement 
tools does not arise from irreconcilable methodological 
differences. Indeed, substantial progress can be made 
in formulating shared standards for assessing measure- 
ment validity. The literature on this topic has focused 
almost entirely on quantitative research, however, 
rather than on integrating the two traditions. We 
propose a framework that yields standards for mea- 
surement validation and we illustrate how these apply 
to both approaches. Many of our quantitative and 
qualitative examples are drawn from recent compara- 
tive work on democracy, a literature in which both 
groups of researchers have addressed similar issues. 
This literature provides an opportunity to identify 
parallel concerns about validity as well as differences in 
specific practices. 

A second problem concerns the relation between 
measurement validity and disputes about the meaning 
of concepts. The clarification and refinement of con- 
cepts is a fundamental task in political science, and 
carefully developed concepts are, in turn, a major 
prerequisite for meaningful discussions of measure- 
ment validity. Yet, we argue that disputes about con- 
cepts involve different issues from disputes about mea- 
surement validity. Our framework seeks to make this 
distinction clear, and we illustrate both types of dis- 


putes. 
A third problem concerns the contextual specificity 
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of measurement validity—an issue that arises when a 
measure that is valid in one context is invalid in 
another. We explore several responses to this problem 
that seek a middle ground between a universalizing 
tendency, which is inattentive to contextual differences, 
and a particularizing approach, which is skeptical about 
the feasibility of constructing measures that transcend 
specific contexts. The responses we explore seek to 
incorporate sensitivity to context as a strategy for 
establishing equivalence across diverse settings. 

A fourth problem concerns the frequently confusing 
language used to discuss alternative procedures for 
measurement validation. These procedures have often 
been framed in terms of different “types of validity,” 
among which content, criterion, convergent, and con- 
struct validity are the best known. Numerous other 
labels for alternative types have also been coined, and 
we have found 37 different adjectives that have been 
attached to the noun “validity” by scholars wrestling 
with issues of conceptualization and measurement.? 
The situation sometimes becomes further confused, 
given contrasting views on the interrelations among 
different types of validation. For example, in recent 
validation studies in political science, one valuable 
analysis (Hill, Hanna, and Shafqat 1997) treats “con- 
vergent” validation as providing evidence for “con- 
struct” validation, whereas another (Berry et al. 1998) 
treats these as distinct types. In the psychomettrics 
tradition (i.e., in the literature on psychological and 
educational testing) such problems have spurred a 
theoretically productive reconceptualization. This liter- 
ature has emphasized that the various procedures for 
assessing measurement validity must be seen, not as 
establishing multiple independent types of validity, but 
rather as providing different types of evidence for valid- 
ity. In light of this reconceptualization, we differentiate 
between “validity” and “validation.” We use validity to 
refer only to the overall idea of measurement validity, 
and we discuss alternative procedures for i 
validity as different “types of validation.” In the final 
part of this article we offer an overview of three main 
types of validation, seeking to emphasize how proce- 
dures associated with each can be applied by both 
quantitative and qualitative researchers. 

In the first section of this article we introduce a 
framework for discussing conceptualization, measure- 
ment, and validity. We then situate questions of validity 
in relation to broader concerns about the meaning of 
concepts. Next, we address contextual specificity and 
equivalence, followed by a review of the evolving 
discussion of types of validation. Finally, we focus on 
three specific types of validation that merit central 


2 We have found the following adjectives attached to validity in 
discussions of conceptualizatton and measurement: a priori, appar- 
ent, assumption, common-sense, conceptual, concurrent, congrusnt, 
consensual, consequential, construct, content, convergent, criterion- 
related, curricular, definitional, differental, discriminant, empirical, 
face, factorial, mcremental, instrumental, intrinsic, linguistic, logical, 
nomological, postdictive, practical, pragmatic, predictive, rational, 
response, sampling, status, substantive, theoretical, and trait. A 
parallel proliferaton of adjectives, in relation to the concept of 
democracy, is discussed in Collier and Levitsky 1997. 
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attention in political science: content, convergent/dis- 
criminant, and nomological/construct validation. 


OVERVIEW OF MEASUREMENT VALIDITY 


Measurement validity should be understood in relation 
to issues that arise in moving between concepts and 
observations. 


Levels and Tasks 


We depict the relationship between concepts and ob- 
servations in terms of four levels, as shown in Figure 1. 
At the broadest level is the background concept, which 
encompasses the constellation of potentially diverse 
meanings associated with a given concept. Next is the 
systematized concept, the specific formulation of a 
concept adopted by a particular researcher or group of 
researchers. It is usually formulated in terms of an 
explicit definition. At the third level are indicators, 
which are also routinely called measures. This level 
includes any systematic scoring procedure, ranging 
from simple measures to complex aggregated indexes. 
It encompasses not only quantitative indicators but 
also the classification procedures employed in qualita- 
tive research. At the fourth level are scores for cases, 
which include both numerical scores and the results of 
qualitative classification. 

Downward and upward movement in Figure 1 can be 
understood as a series of research tasks. On the 
left-hand side, conceptualization is the movement from 
the background concept to the systematized concept. 
Operationalization moves from the systematized con- 
cept to indicators, and the scoring of cases applies 
indicators to produce scores. Moving up on the right- 
hand side, indicators may be refined in light of scores, 
and systematized concepts may be fine-tuned in light of 
knowledge about scores and indicators. Insights de- 
rived from these levels may lead to revisiting the 
background concept, which may include assessing al- 
ternative formulations of the theory in which a partic- 
ular systematized concept is embedded. Finally, to 
define a key overarching term, “measurement” involves 
the interaction among levels 2 to 4. 


Defining Measurement Validity 


Valid measurement is achieved when scores (including 
the results of qualitative classification) meaningfully 
capture the ideas contained in the co nding con- 
cept. This definition parallels that of Bollen (1989, 
184), who treats validity as “concerned with whether a 
variable measures what it is supposed to measure.” 
King, Keohane, and Verba (1994, 25) give essentially 
the same definition. 

If the idea of measurement validity is to do serious 
methodological work, however, its focus must be fur- 
ther specified, as emphasized by Bollen (1989, 197). 
Our specification involves both ends of the connection 
between concepts and scores shown in Figure 1. At the 
concept end, our basic point (explored in detail below) 
is that measurement validation should focus on the 
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FIGURE 1. Conceptualization and Measurement Levels and Tasks 


Level 1. Background Concept 
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Task: Conceptualization 
Formulating a systematized concept through 
reasoning about the background concept, In 
light of the goals of research 


The broad constellation of meanings and 
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A specific formulation of a concept used by a 
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commonly Involves an explicit definition. 
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Developing, on the basis of a systema- 
tized concept, one or more indicators 
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Task: Scoring Cases 
Applying these Indicators to produce 
scores for the cases being analyzed 


Measurement 


Level 4. Scores for Cases 
The scores for cases generated by a particular 


Level 3. Indicators 
Also referred to as “measures” and “opera- 
tlonalizations * In qualitative research, these 
are the operational definitions employed In 
Classifying cases 


Task: Modifying Systematized 
Concept. Fine-tuning the systematized 
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Task: Refining Indicators 


Modifying indicators, or potentially creating 
new indicators, In light of observed scores. 
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Indicator. These include both numerical scores 
and the results of qualitative classification 


relation between observations and the systematized 
concept; any potential disputes about the background 
concept should be set aside as an important but 
separate issue. With regard to scores, an obvious but 
crucial point must be stressed: Scores are never exam- 
ined in isolation; rather, they are interpreted and given 
meaning in relation to the systematized concept. 

In sum, measurement is valid when the scores (level 
4 in Figure 1), derived from a given indicator (level 3), 
can meaningfully be interpreted in terms of the system- 
atized concept (level 2) that the indicator seeks to 
operationalize. It would be cumbersome to refer re- 
peatedly to all these elements, but the appropriate 
focus of measurement validation is on the conjunction 
of these components. 





Measurement Error, Reliability, and Validity 


Validity is often discussed in connection with measure- 
ment error and reliability. Measurement error may be 
systematic—in which case it is called bias—or random. 
Random error, which occurs when repeated applica- 
tions of a given measurement procedure yield incon- 
sistent results, is conventionally labeled a problem of 
reliability. Methodologists offer two accounts of the 
relation between reliability and validity. (1) Validity is 
sometimes understood as exclusively involving bias, 
that is error that takes a consistent direction or form. 
From this perspective, validity involves systematic er- 
ror, whereas reliability involves random error (Car- 
mines and Zeller 1979, 14-5; see also Babbie 2001, 
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144-5). Therefore, unreliable scores may still be cor- 
rect “on average” and in this sense valid. (2) Alterna- 
tively, some scholars hesitate to view scores as valid if 
they contain large amounts of random error. They 
believe validity requires the absence of both types of 
error. Therefore, they view reliability as a necessary but 
not sufficient condition of measurement validity (Kirk 
and Miller 1986, 20; Shively 1998, 45). 

Our goal is not to adjudicate between these accounts 
but to state them clearly and to specify our own focus, 
namely, the systematic error that arises when the links 
among systematized concepts, indicators, and scores 
are poorly developed. This involves validity in the first 
sense stated above. Of course, the random error that 
routinely arises in scoring cases is also important, but it 
is not Our primary concern. 

A final point should be emphasized. Because error is 
a pervasive threat to measurement, it is essential to 
view the interpretations of scores in relation to system- 
atized concepts as falsifiable claims (Messick 1989, 
13-4). Scholars should treat these claims just as they 
would any casual hypothesis, that is, as tentative state- 
ments that require supporting evidence. Validity as- 
sessment is the search for this evidence. 


MEASUREMENT VALIDITY AND CHOICES 
ABOUT CONCEPTS 


A growing body of work considers the systematic 
analysis of concepts an important component of polit- 
ical science methodology.? How should we understand 
the relation between issues of measurement validity 
and broader choices about concepts, which are a 
central focus of this literature? 


Conceptual Choices: Forming the 
Systematized Concept 


We view systematized concepts as the point of depar- 
ture for assessing measurement validity. How do schol- 
ars form such concepts? Because background concepts 
routinely include a variety of meanings, the formation 
of systematized concepts often involves choosing 
among them. The number of feasible options varies 
greatly. At one extreme are concepts such as triangle, 
which are routinely understood in terms of a single 
conceptual systematization; at the other extreme are 
“contested concepts” (Gallie 1956), such as democracy. 
A careful examination of diverse meanings helps clarify 
the options, but ultimately choices must be made. 
These choices are deeply interwined with issues of 
theory, as emphasized in Kaplan’s (1964, 53) paradox 
of conceptualization: “Proper concepts are needed to 
formulate a good theory, but we need a good theory to 
arrive at the proper concepts.... The paradox is 
resolved by a process of approximation: the better our 


3 Examples of earlier work in this tradition are Sartori 1970, 1984 
and Sartor, Riggs, and Teune 1975. More recent studies include 
Collier and Levituky 1997; Collier and Mahon 1993; Gerring 1997, 
1999, 2001; Gould 1999; Kurtz 2000; Levitsky 1998; Schaffer 1998. 
Important work in polrtical theory includes Bevir 1999; Freeden 
1996; Gallie 1956; Pitkin 1967, 1987. 
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concepts, the better the theory we can formulate with 
them, and in turn, the better the concepts available for 
the next, improved theory.” Various examples of this 
intertwining are explored in recent analyses of impor- 
tant concepts, such as Laitin’s (2000) treatment of 
language community and Kurtz’s (2000) discussion of 
peasant. Fearon and Laitin’s (2000) analysis of ethnic 
conflict, in which they begin with their hypothesis and 
ask what operationalization is needed to capture the 
conceptions of ethnic group and ethnic conflict en- 
tailed in this hypothesis, further illustrates the interac- 
tion of theory and concepts. 

In dealing with the choices that arise in establishing 
the systematized concept, researchers must avoid three 
common traps. First, they should not misconstrue the 
flexibility inherent in these choices as suggesting that 
everything is up for grabs. This is rarely, if ever, the 
case. In any field of inquiry, scholars commonly asso- 
ciate a matrix of potential meanings with the back- 
ground concept. This matrix limits the range of plau- 
sible options, and the researcher who strays outside it 
runs the risk of being dismissed or misunderstood. We 
do not mean to imply that the background concept is 
entirely fixed. It evolves over time, as new understand- 
ings are developed and old ones are revised or fall from 
use. At a given time, however, the background concept 
usually provides a relatively stable matrix. It is essential 
to recognize that a real choice is being made, but it is 
no less essential to recognize that this is a limited 
choice. 

Second, scholars should avoid claiming too much in 
defending their choice of a given systematized concept. 
It is not productive to treat other options as self- 
evidently ruled out by the background concept. For 
example, in the controversy over whether democracy 
versus nondemocracy should be treated as a dichotomy 
or in terms of gradations, there is too much reliance on 
claims that the background concept of democracy 
inherently rules out one approach or the other (Collier 
and Adcock 1999, 546-50). It is more productive to 
recognize that scholars routinely emphasize different 
aspects of a background concept in developing system- 
atized concepts, each of which is potentially plausible. 
Rather than make sweeping claims about what the 
background concept “really” means, scholars should 
present specific arguments, linked to the goals and 
context of their research, that justify their particular 
choices. 

A third problem occurs when scholars stop short of 
providing a fleshed-out account of their systematized 
concepts. This requires not just a one-sentence defini- 
tion, but a broader specification of the meaning and 
entailments of the systematized concept. Within the 
psychometrics literature, Shepard (1993, 417) summa- 
rizes what is required: “both an internal model of 
interrelated dimensions or subdomains” of the system- 
atized concept, and “an external model depicting its 
relationship to other [concepts].” An example is Bol- 
len’s (1990, 9-12; see also Bollen 1980) treatment of 
political democracy, which distinguishes the two di- 
mensions of “political rights” and “political liberties,” 
clarifies these by contrasting them with the dimensions 
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developed by Dahl, and explores the relation between 
them. Bollen further specifies political democracy 
through contrasts with the concepts of stability and 
social or economic democracy. In the language of 
Sartori (1984, 51-4), this involves clarifying the seman- 
tic field. 

One consequence of this effort to provide a fleshed- 
out account may be the recognition that the concept 
needs to be disaggregated. What begins as a consider- 
ation of the internal dimensions or components of a 
single concept may become a discussion of multiple 
concepts. In democratic theory an important example 
is the discussion of majority rule and minority rights, 
which are variously treated as components of a single 
overall concept of democracy, as dimensions to be 
analyzed separately, or as the basis for forming distinct 
subtypes of democracy (Dahl 1956; Lijphart -1984; 
Schmitter and Karl 1992). This kind of refinement may 
result from new conceptual and theoretical arguments 
or from empirical findings of the sort that are the focus 
of the convergent/discriminant validation procedures 


Measurement Validity and the Systematized 
Versus Background Concept 


We stated earlier that the systematized concept, rather 
than the background concept, should be the focus in 
measurement validation. Consider an example. A re- 
searcher may ask: “Is it appropriate that Mexico, prior 
to the year 2000 (when the previously dominant party 
handed over power after losing the presidential elec- 
tion), be assigned a score of 5 out of 10 on an indicator 
of democracy? Does this score really capture how 
‘democratic’ Mexico was compared to other coun- 
tries?” Such a question remains underspecified until we 
know whether “democratic” refers to a particular sys- 
tematized concept of democracy, or whether this re- 
searcher is concerned more broadly with the back- 
ground concept of democracy. Scholars who question 
Mexico’s score should distinguish two issues: (1) a 
concern about measurement—whether the indicator 
employed produces scores that can be interpreted as 
adequately capturing the systematized concept used in 
a given study and (2) a conceptual concern—whether 
the systematized concept employed in creating the 
indicator is appropriate vis-à-vis the background con- 
cept of democracy. 

We believe validation should focus on the first issue, 
whereas the second is outside the realm of measure- 
ment validity. This distinction seems especially appro- 
priate in view of the large number of contested con- 
cepts in political science. The more complex and 
contested the background concept, the more important 
it is to distinguish issues of measurement from funda- 
mental conceptual disputes. To pose the question of 
validity we need a specific conceptual referent against 
which to assess the adequacy of a given measure. A 
systematized concept provides that referent. By con- 
trast, if analysts seek to establish measurement validity 
in relation to a background concept with multiple 
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competing meanings, they may find a different answer 
to the validity question for each meaning. 

By restricting the focus of measurement validation to 
the systematized concept, we do not suggest that 
political scientists should ignore basic conceptual is- 
sues. Rather, arguments about the background concept 
and those about validity can be addressed adequately 
only when each is engaged on its own terms, rather 
than conflated into one overly broad issue. Consider 
Schumpeter’s (1947, chap. 21) procedural definition of 
democracy. This definition explicitly rules out elements 
of the background concept, such as the concern with 
substantive policy outcomes, that had been central to 
what he calls the classical theory of democracy. Al- 
though Schumpeter’s conceptualization has been very 
influential in political science, some scholars (Harding 
and Petras 1988; Mouffe 1992) have called for a revised 
conception that encompasses other concerns, such as 
social and economic outcomes. This important debate 
exemplifies the kind of conceptual dispute that should 
be placed outside the realm of measurement validity. 

Recognizing that a given conceptual choice does not 
involve an issue of measurement validity should not 
preclude considered arguments about this choice. An 
example is the argument that minimal definitions can 
facilitate causal asséssment (Alvarez et al. 1996, 4; Karl 
1990, 1-2; Linz 1975, 181-2; Sartori 1975, 34). For 
instance, in the debate about a procedural definition of 
democracy, a pragmatic argument can be made that if 
analysts wish to study the casual relationship between 
democracy and socioeconomic equality, then the latter 
must be excluded from the systematization of the 
former. The point is that such arguments can effectively 
justify certain conceptual choices, but they involve 
issues that are different from the concerns of measure- 
ment validation. 


Fine-TunIng the Systematized Concept with 
Friendly Amendments 


We define measurement validity as concerned with the 
relation among scores, indicators, and the systematized 
concept, but we do not rule out the ihtroduction of new 
conceptual ideas during the validation process. Key 
here is the back-and-forth, iterative nature of research 
emphasized in Figure 1. Preliminary empirical work 
may help in the initial formulation of concepts. Later, 
even after conceptualization appears complete, the 
application of a proposed indicator may produce un- 
expected observations that lead scholars to modify 
their systematized concepts. These “friendly amend- 
ments” occur when a scholar, out of a concern with 
validity, engages in further conceptual work to suggest 
refinements or make explicit earlier implicit assump- 
tions. These amendments are friendly because they do 
not fundamentally challenge a systematized concept 
but instead push analysts to capture more adequately 
the ideas contained in it. 

A friendly amendment is illustrated by the emer- 
gence of the “expanded procedural minimum” defini- 
tion of democracy (Collier and Levitsky 1997, 442-4). 
Scholars noted that, despite free or relatively free 
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elections, some civilian governments in Central and 
South America to varying degrees lacked effective 
power to govern. A basic concern was the persistence 
of “reserved domains” of military power over which 
elected governments had little authority (Valenzuela 
1992, 70). Because procedural definitions of democracy 
did not explicitly address this issue, measures based 
upon them could result in a high democracy score for 
these countries, but it appeared invalid to view them as 
democratic. Some scholars therefore amended their 
systematized concept of democracy to add the differ- 
entiating attribute that the elected government must to 
a reasonable degree have the power to rule (Karl 1990, 
2; Loveman 1994, 108-13; Valenzuela 1992, 70). De- 
bate persists over the scoring of specific cases (Rabkin 
1992, 165), but this innovation is widely accepted 
among scholars in the procedural tradition (Hunting- 
ton 1991, 10; Mainwaring, Brinks, and Pérez-Lifidn 
2001; Markoff 1996, 102-4). As a result of this friendly 
amendment, analysts did a better job of capturing, for 
these new cases, the underlying idea of procedural 
minimum democracy. 


VALIDITY, CONTEXTUAL SPECIFICITY, AND 
EQUIVALENCE 


Contextual specificity is a fundamental concern that 
arises when differences in context potentially threaten 
the validity of measurement. This is a central topic in 
psychometrics, the field that has produced the most 
innovative work on validity theory. This literature 
emphasizes that the same score on an indicator may 
have different meanings in different contexts (Moss 
1992, 236-8; see also Messick 1989, 15). Hence, the 
validation of an interpretation of scores generated in 
one context does not imply that the same interpreta- 
tion is valid for scores generated in another context. In 
political science, this concern with context can arise 
when scholars are making comparisons across different 
world regions or distinct historical periods. It can also 
arise in comparisons within a national (or other) unit, 
given that different subunits, regions, or subgroups may 
constitute very different political, social, or cultural 
contexts. 

The potential difficulty that context poses for valid 
measurement, and the related task of establishing 
measurement equivalence across diverse units, deserve 
more attention in political science. In a period when 
the quest for generality is a powerful impulse in the 
social sciences, scholars such as Elster (1999, chap. 1) 
have strongly challenged the plausibility of seeking 
general, law-like explanations of political phenomena. 
A parallel constraint on the generality of findings may 
be imposed by the contextual specificity of measure- 
ment validity. We are not arguing that the quest for 
generality be abandoned. Rather, we believe greater 
sensitivity to context may help scholars develop mea- 
sures that can be validly applied across diverse con- 
texts. This goal requires concerted attention to the 
issue of equivalence. 
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Contextual Specificity In Political Research 


Contextual specificity affects many areas of political 
science. It has long been a problem in cross-national 
survey research (Sicinski 1970; Verba 1971; Verba, 
Nie, and Kim 1978, 32—40; Verba et al. 1987, Appen- 
dix). An example concerning features of national con- 
text is Cain and Ferejohn’s (1981) discussion of how 
the differing structure of party systems in the United 
States and Great Britain should be taken into account 
when comparing party identification. Context is also a 
concern for survey researchers working within a single 
nation, who wrestle with the dilemma of “inter-person- 
ally incomparable responses” (Brady 1985). For exam- 
ple, scholars debate whether a given survey item has 
the same meaning for different population sub- 
groups—which could be defined, for example, by re- 
gion, gender, class, or race. One specific concern is 
whether population subgroups differ systematically in 
their “response style” (also called “response sets”). 
Some groups may be more disposed to give extreme 
answers, and others may tend toward moderate an- 
swers (Greenleaf 1992). Bachman and O’Malley (1984) 
show that response style varies consistently with race. 
They argue that apparently important differences 
across racial groups may in part reflect only a different 
manner of answering questions. Contextual specificity 
also can be a problem in survey comparisons over time, 
as Baumgartner and Walker (1990) point out in dis- 
cussing group membership in the United States. 

The issue of contextual specificity of course also 
arises in macro-level research in international and 
comparative studies (Bollen, Entwisle, and Anderson 
1993, 345). Examples from the field of comparative 
politics are discussed below. In international relations, 
attention to context, and particularly a concern with 
“historicizing the concept of structure,” is central to 

“constructivism” (Ruggie 1998, 875). Constructivists 
argue that modern international relations rest upon 
“constitutive rules” that differ fundamentally from 
those of both medieval Christendom and the classical 
Greek world (p. 873). Although they recognize that 
sovereignty is an organizing principle applicable across 
diverse settings, the constructivists emphasize that the 
“meaning and behavioral implications of this principle 

from one historical context to another” (Reus- 
Smit 1997, 567). On the other side of this debate, 
neorealists such as Fischer (1993, 493) offer a general 
warning: If pushed to an extreme, the “claim to context 
dependency” threatens to “make impossible the collec- 
tive pursuit of empirical knowledge.” He also offers 
specific historical support for the basic neorealist posi- 
tion that the behavior of actors in international politics 
follows consistent patterns. Fischer (1992, 463, 465) 
concludes that “the structural logic of action under 
anarchy has the character of an objective law,” which is 
grounded in “an unchanging essence of human na- 
ture.” 

The recurring tension in social research between 
particularizing and universalizing tendencies reflects in 
part contrasting degrees of concern with contextual 


specificity. The approaches to establishing equivalence 
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discussed below point to the option of a middle ground. 
These approaches recognize that contextual differences 
are important, but they seek to combine this insight 
with the quest for general knowledge. 

The lessons for political science are clear. Any 
empirical assessment of measurement validity is neces- 
sarily based on a particular set of cases, and validity 
claims should be made, at least initially, with reference 
to this specific set. To the extent that the set is 
heterogeneous in ways that may affect measurement 
validity, it is essential to (1) assess the implications for 
establi equivalence across these diverse contexts 
and, if necessary, (2) adopt conteat-sensitive measures. 
Extension to additional cases requires similar proce- 
dures. 


Establishing Equivalence: Context-Specific 
Domains of Observation 


One important means of establishing equivalence 
across diverse contexts is careful reasoning, in the 
initial stages of operationalization, about the specific 
domains to which a systematized concept applies. Well 
before thinking about particular scoring procedures, 
scholars may need to make context-sensitive choices 
regarding the parts of the broader polity, economy, or 
society to which they will apply their concept. Equiva- 
lent observations may require, in different contexts, a 
focus on what at a concrete level might be seen as 
distinct types of phenomena. 

Some time ago, Verba (1967) called attention to the 
importance of context-specific domains of observation. 
In comparative research on political opposition in 
stable democracies, a standard focus is on political 
parties and legislative politics, but Verba (pp. 122-3) 
notes that this may overlook an analytically equivalent 
form of opposition that crystallizes, in some countries, 
in the domain of interest group politics. Skocpol (1992, 
6) makes a parallel argument in questioning the claim 
that the United States was a “welfare laggard” because 
social provision was not launched on a large scale until 
the New Deal. This claim is based on the absence of 
standard welfare programs of the kind that emerged 
earlier in Europe but fails to recognize the distinctive 
forms of social provision in the United States, such as 
veterans’ benefits and support for mothers and chil- 
dren. Skocpol argues that the welfare laggard charac- 
terization resulted from looking in the wrong place, 
that is, in the wrong domain of policy. 

Locke and Thelen (1995, 1998) have extended this 
approach in their discussion of “contextualized com- 
parison.” They argue that scholars who study national 
responses to external pressure for economic decentral- 
ization and “flexibilization” routinely focus on the 
points at which conflict emerges over this economic 
transformation. Yet, these “sticking points” may be 
located in different parts of the economic and political 
system. With regard to labor politics in different coun- 
tries, such conflicts may arise over wage equity, hours 
of employment, workforce reduction, or shop-floor 
reorganization. These different domains of labor rela- 
tions must be examined in order to gather analytically 
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equivalent observations that adequately tap the con- 
cept of sticking point. Scholars who look only at wage 
conflicts run the risk of omitting, for some national 
contexts, domains of conflict that are highly relevant to 
the concept they seek to measure. 

By allowing the empirical domain to which a system- 
atized concept is applied to vary across the units being 
compared, analysts may take a productive step toward 
establishing equivalence among diverse contexts. This 
practice must be carefully justified, but under some 
circumstances it can make an important contribution to 
valid measurement. 


Establishing Equivalence: Context-Spectfic 
Indicators and Adjusted Common 
Indicators 


Two other ways of establishing equivalence involve 
careful work at the level of indicators. We will discuss 
context-specific indicators,‘ and what we call adjusted 
common indicators. In this second approach, the same 
indicator is applied to all cases but is weighted to 
compensate for contextual differences. 

An example of context-specific indicators is found in 
Nie, Powell, and Prewitt’s (1969, 377) five-country 
study of political participation. For all the countries, 
they analyze four relatively standard attributes of par- 
ticipation. Regarding a fifth attribute—membership in 
a political party—they observe that in four of the 
countries party membership has a roughly equivalent 
meaning, but in the United States it has a different 
form and meaning. The authors conclude that involve- 
ment in U.S. electoral campaigns reflects an equivalent 
form of political participation. Nie, Powell, and Prewitt 
thus focus on a context-specific domain of observation 
(the procedure just discussed above) by shifting their 
attention, for the U.S. context, from party membership 
to campaign participation. They then take the further 
step of incorporating within their overall index of 
political participation context-specific indicators that 
for each case generate a score for what they see as the 

riate domain. Specifically, the overall index for 
the United States includes a measure of campaign 
participation rather than party membership. 

A different example of context-specific indicators is 
found in comparative-historical research, in the effort 
to establish a meaningful threshold for the onset of 
democracy in the nineteenth and early twentieth cen- 
tury, as opposed to the late twentieth century. This 
effort in turn lays a foundation for the comparative 
analysis of transitions to democracy. One problem in 
establi equivalence across these two eras lies in 
the fact that the plausible agenda of “full” democrati- 
zation has changed dramatically over time. “Full” by 
the standards of an earlier period is incomplete by later 
standards. For example, by the late twentieth century, 
universal suffrage and the protection of civil rights for 
the entire national population had come to be consid- 
ered essential features of democracy, but in the nine- 


4 This approach was ee ee 
chap. 6), who employed the label “system-specific indicator. 
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teenth century they were not (Huntington 1991, 7, 16). 
Yet, if the more recent standard is applied to the 
earlier period, cases are eliminated that have long been 
considered classic examples of nascent democratiza- 
tion in Europe. One solution is to compare regimes 
with respect to a systematized concept of full democ- 
ratization that is operationalized according to the 
norms of the respective periods (Collier 1999, chap. 1; 
Russett 1993, 15; see also Johnson 1999, 118). Thus, a 
different scoring procedure—a context-specific indica- 
tor—is employed in each period in order to produce 
scores that are comparable with respect to this system- 
atized concept.5 

Adjusted common indicators are another way to 
establish equivalence. An example is found in Moene 
and Wallerstein’s (2000) quantitative study of social 
policy in advanced industrial societies, which focuses 
specifically on public social expenditures for individuals 
outside the labor force. One component of their mea- 
sure is public spending on health care. The authors 
argue that in the United States such health care 
expenditures largely target those who are not members 
of the labor force. By .contrast, in other countries 
health expenditures are allocated without respect to 
employment status. Because U.S. policy is distinctive, 
the authors multiply health care expenditures in the 
other countries by a coefficient that lowers their scores 
on this variable. Their scores are thereby made roughly 
equivalent—as part of a measure of public expendi- 
tures on individuals outside the labor force—to the 
U.S. score. A parallel effort to establish equivalence in 
the analysis of economic indicators is provided by 
Zeitsch, Lawrence, and Salernian (1994, 169), who. use 
an adjustment technique in estimating total factor 
productivity to take account of the different operating 
environments, and hence the different context, of the 
industries they compare. Expressing indicators in per- 
capita terms is also an example of adjusting indicators 
in light of context. Overall, this practice is used to 
address both very specific problems of equivalence, as 
with the Moene and Wallerstein example, as well as 
more familiar concerns, such as standardizing by pop- 
ulation. 

Context-specific indicators and adjusted common 
indicators are not always a step forward, and some 
scholars have self-consciously avoided them. The use of 
such indicators should match the analytic goal of the 
researcher. For example, many who study democrati- 
zation in the late twentieth century deliberately adopt 
a minimum definition of democracy in order to con- 
centrate on a limited set of formal procedures. They do 
this out of a conviction that these formal procedures 
are important, even though they may have different 
meanings in particular settings. Even a scholar such as 
O'Donnell (1993, 1355), who has devoted great atten- 
tion to contextualizing the meaning of democracy, 
insists on the importance of also retaining a minimal 
definition of “political democracy” that focuses on 


3 A well-known example of applying different standards for democ- 
racy in making comparisons across international regions is Lipset 
(1959, 73-4). 
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basic formal procedures. Thus, for certain purposes, it 
can be analytically productive to adopt a standard 
definition that ignores nuances of context and apply the 
same indicator to all cases. 

In conclusion, we note that although Przeworski and 
Teune’s (1970) and Verba’s arguments about equiva- 
lence are well known, issues of contextual specificity 
and equivalence have not received adequate attention 
in political science. We have identified three tools— 
context-specific domains of observation, context-spe- 
cific indicators, and adjusted common indicators—for 
addressing these issues, and we encourage their wider 
use. We also advocate greater attention to justifying 
their use. Claims about the appropriateness of contex- 
tual adjustments should not simply be asserted; their 
validity needs to be carefully defended. Later, we 
explore three types of validation that may be fruitfully 
applied in assessing proposals for context-sensitive 
measurement. In particular, content validation, which 
focuses om whether operationalization captures the 
ideas contained in the systematized concept, is central 
to determining whether and how.measurement needs 
to be adjusted in particular contexts. 


ALTERNATIVE PERSPECTIVES ON TYPES 
OF VALIDATION 


Discussions of measurement validity are confounded 
by the proliferation of different types of validation, and 
by an even greater number of labels for them. In this 
section we review the emergence of a unified concep- 
tion of measurement validity in the field of psychomet- 
rics, propose revisions in the terminology for talking 
about validity, and examine the important treatments 
of validation in political analysis offered by Carmines 
and Zeller, and by Bollen. 


Evolving Unaarstandigs of Validity 


In the psychometric tradition, current thinking about 
measurement validity developed in two phases. In the 
first phase, scholars wrote about “types of validity” in a 
way that often led researchers to treat each type as if it 
independently established a distinct form of validity. In 
discussing this literature we follow its terminology by 
referring to types of “validity.” As noted above, in the 
rest of this article we refer instead to types of “valida- 
tion.” 

The first pivotal development in the emergence of a 
unified approach occurred in the 1950s and 1960s, 
when a threefold typology of content, criterion, and 
construct validity was officially established in reaction 
to the confusion generated by the earlier proliferation 
of types.6 Other labels continued to appear in other 
disciplines, but this typology became an orthodoxy in 


6 The second of these is often called crtterion-related validity. 
Regarding these official standards, see American Psychological As- 
sociation 1954, 1966; Angoff 1988, 25; Meauck 1989, 16-7; Shultz, 
Riggs, and Kottke 1998, 267-9. The 1954 standards mitially pre- 
sented four types of validity, which became the threefold typology m 
” and “concurrent” validity were combined as 
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psychology. A recurring metaphor in that field charac- 
terized the three types as “something of a holy trinity 
representing three different roads to psychometric sal- 
vation” (Guion 1980, 386). These types may be briefly 
defined as follows. 


e Content validity assesses the degree to which an 
indicator represents the universe of content entailed 
in the systematized concept being measured. 

e Criterion validity assesses whether the scores pro- 
duced by an indicator are empirically associated with 
scores for other variables, called criterion variables, 
which are considered direct measures of the phe- 
nomenon of concern. 

o Construct validity has had a range of meanings. One 

_central focus has been on assessing whether a given 
indicator is empirically associated with other indica- 
tors in a way that conforms to theoretical expecta- 
tions about their interrelationship. 


These labels remain very influential and are still the 
centerpiece in some discussions of measurement valid- 
ity, as in the latest edition of Babbie’s (2001, 143-4) 
widely used methods textbook for undergraduates. 

The second phase grew out of increasing dissatisfac- 
tion with the “trinity” and led to a “unitarian” ap- 
proach (Shultz, Riggs, and Kottke 1998, 269-71). A 
basic problem identified by Guion (1980, 386) and 
others was that the threefold typology was too often 
taken to mean that any one type was sufficient to 
establish validity (Angoff 1988, 25). Scholars increas- 
ingly argued that the different types should be sub- 
sumed under a single concept. Hence, to continue with 
the prior metaphor, the earlier trinity came to be seen 
“in a monotheistic mode as the three aspects of a 
unitary psychometric divinity” (p. 25). 

Much of the second phase involved a reconceptual- 
ization of construct validity and its relation to content 
and criterion validity. A central argument was that the 
latter two may each be necessary to establish validity, 
but neither is sufficient. They should be understood as 
part of a larger process of validation that integrates 
“multiple sources of evidence” and requires the com- 
bination of “logical argument and empirical evidence” 
(Shepard 1993, 406). Alongside this development, a 
reconceptualization of construct validity led to “a more 
comprehensive and theory-based view that subsumed 
other more limited perspectives” (Shultz, Riggs, and 
Kottke 1998, 270). This broader understanding of 
construct validity as the overarching goal of a single, 
integrated process of measurement validation is widely 
endorsed by psychometricians. Moss (1995, 6) states 
“there is a close to universal consensus among validity 
theorists” that “content- and criterion-related evidence 
of validity are simply two of many types of evidence 
that support construct validity.” 

Thus, in the psychometric literature (e.g., Messick 
1980, 1015), the term “construct validity” has become 
essentially a synonym for what we call measurement 
validity. We have adopted measurement validity as the 
name for the overall topic of this article, in part 
because in political science the label construct validity 
commonly refers to specific procedures rather than to 
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the general idea of valid measurement. These specific 
procedures generally do not encompass content valida- 
tion and have in common the practice of assessing 
measurement validity by taking as a point of reference 
established conceptual and/or theoretical relationships. 

We find it helpful to group these procedures into two 
types according to the kind of theoretical or conceptual 
relationship that serves.as the point of reference. 
Specifically, these types are based on the heuristic 
distinction between description and explanation.’ First, 
some procedures rely on “descriptive” expectations 
concerning whether given attributes are understood as 
facets of the same phenomenon. This is the focus of 
what we label “convergent/discriminant validation.” 
Second, other procedures rely on relatively well-estab- 
lished “explanatory” causal relations as a baseline 
against which measurement validity is assessed. In 
labeling this second group of procedures we draw on 
Campbell’s (1960, 547) helpful term, “nomological” 
validation, which evokes the idea of assessment in 
relation to well-established causal hypotheses or law- 
like relationships. This second type is often called 
construct validity in political research (Berry et al. 
1998; Elkins 2000).8 Out of deference to this usage, in 
the headings and summary statements below we will 
refer to nomological/construct validation. 


Types of Valldation In Political Analysis 


A baseline for the revised discussion of validation 
presented below is provided in work by Carmines and 
Zeller, and by Bollen. Carmines and Zeller (1979, 26; 
Zeller and Carmines 1980, 78-80) argue that content 
validation and criterion validation are of limited utility 
in fields such as political science. While recognizing 
that content validation is important in psychology and 
education, they argue that evaluating it “has proved to 
be exceedingly difficult with respect to measures of the 
more abstract phenomena that tend to characterize the 
social sciences” (Carmines and Zeller 1979, 22). For 
criterion validation, these authors emphasize that in 
many social sciences, few “criterion” variables are 
available that can serve as “real” measures of the 
phenomena under investigation, against which scholars 
can evaluate alternative measures (pp. 19-20). Hence, 
for many purposes it is simply not a relevant procedure. 
Although Carmines and Zeller call for the use of 
multiple sources of evidente, their emphasis on the 
limitations of the first two types of validation leads 
them to give a predominant role to nomological/ 
construct validation. 

In relation to Carmines and Zeller, Bollen (1989, 
185—6, 190-4) adds convergent/discriminant validation 


7 Description and explanation are of course intertwined, but we find 
this distinction invaluable for exploring contrasts among validation 


relations and hence are productively differentiated by 
our typology. 
§ Ses also the mam examples of construct validation presented in the 
major statements by Carmines and Zeller 1979, 23, and Bollen 1989, 
189-990. 
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to their three types and emphasizes content validation, 
which he sees as both viable and fundamental. He also 
raises general concerns about correlation-based ap- 
proaches to convergent and nomological/construct val- 
idation, and he offers an alternative approach based on 
structural equation modeling with latent variables (pp. 
192-206). Bollen shares the concern of Carmines and 
Zeller that, for most social research, “true” measures 
do not exist against which criterion validation can be 
carried out, so he likewise sees this as a less relevant 
type (p. 188). 

These valuable contributions can be extended in 
several respects. First, with reference to Carmines and 
Zeller’s critique of content validation, we recognize 
that this procedure is harder to use if concepts are 
abstract and complex. Moreover, it often does not lend 
itself to the kind of systematic, quantitative analysis 
routinely applied in some other kinds of validation. 
Yet, like Bollen (1989, 185-6, 194), we are convinced it 
is possible to lay a secure foundation for content 
validation that will make it a viable, and indeed essen- 
tial, procedure. Our discussion of this task below 
derives from our distinction between the background 
and the systematized concept. 

Second, we share the conviction of Carmines and 
Zeller that nomological/construct validation is impor- 
tant, yet given our emphasis on content and conver- 
gent/discriminant validation, we do not privilege it to 
the degree they do. Our discussion will seek to clarify 
some aspects of how this procedure actually works and 
will address the skeptical reaction of many scholars to 
it. 

Third, we have a twofold response to the critique of 
criterion validation as irrelevant to most forms of social 
research. On the one hand, in some domains criterion 
validation is important, and this must be recognized. 
For example, the literature on response validity in 
survey research seeks to evaluate individual responses 
to questions, such as whether a person voted in a 
particular election, by comparing them to official voting 
records (Anderson and Silver 1986; Clausen 1968; 
Katosh and Traugott 1980). Similarly, in panel studies 
it is possible to evaluate the adequacy of “recall” (ie., 
whether respondents remember their own earlier opin- 
ions, dispositions, and behavior) through comparison 
with nses in earlier studies (Niemi, Katz, and 
Newman 1980). On the other hand, this is not one of 
the most generally applicable types of validation, and 
we favor treating it as one subtype within the broader 
category of convergent validation. As discussed below, 
convergent validation compares a given indicator with 
one or more other indicators of the concept—in which 
the analyst may or may not have a higher level of 
confidence. Even if these other indicators are as fallible 
as the indicator being evaluated, the comparison pro- 
vides greater leverage than does looking only at one of 
them in isolation. To the extent that a well-established, 
direct measure of the phenomenon under study is 
available, convergent validation is essentially the same 
as criterion validation. 

Finally, in contrast both to Carmines and Zeller and 
to Bollen, we will discuss the application of the differ- 
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ent types of validation in qualitative as well as quanti- 
tative research, using examples drawn from both tradi- 
tions. Furthermore, we will employ crucial distinctions 
introduced above, including the differentiation of levels 
presented in Figure 1, as well as the contrast between 
specific procedures for validation, as opposed to the 
overall idea of measurement validity. 


THREE TYPES OF MEASUREMENT 
VALIDATION: QUALITATIVE AND 
QUANTITATIVE EXAMPLES 


We now discuss various procedures, both qualitative 
and quantitative, for assessing measurement validity. 
We organize our presentation in terms of a threefold 
typology: content, convergent/discriminant, and nomo- 
logical/construct validation. The goal is to explicate 
each of these types by posing a basic question that, in 
all three cases, can be addressed by both qualitative 
and quantitative scholars. Two caveats should be intro- 
duced. First, while we discuss correlation-based ap- 
proaches to validity assessment, this article is not 
intended to provide a detailed or exhaustive account of 
relevant statistical tests. Second, we recognize that no 
rigid boundaries exist among alternative procedures, 
given that one occasionally shades off into another. 
Our typology is a heuristic device that shows how 
validation procedures can be grouped in terms of basic 
questions, and thereby helps bring into focus parallels 
and contrasts in the approaches to validation adopted 
by qualitative and quantitative researchers. 


Content Validation 


Basic Question. In the framework of Figure 1, does a 
given indicator (level 3) adequately capture the full 
content of the systematized concept (level 2)? This 
“adequacy of content” is assessed through two further 
questions. First, are key elements omitted from the 
indicator? Second, are inappropriate elements in- 
cluded in the indicator?’ An examination of the scores 
(level 4) of specific cases may help answer these 
questions about the fit between levels 2 and 3. 


Discussion. In contrast to the other types considered, 
content validation is distinctive in its focus on concep- 
tual issues, specifically, on what we have just called 
adequacy of content. Indeed, it developed historically 
as a corrective to forms of validation that focused solely 
on the statistical analysis of scores, and in so doing 
overlooked important threats to measurement validity 
(Sireci 1998, 83~7). 

Because content validation involves conceptual rea- 
soning, it is imperative to maintain the distinction we 
made between issues of validation and questions con- 
cerning the background concept. If content validation 
is to be useful, then there must be some ground of 
conceptual agreement about the phenomena being 
investigated (Bollen 1989, 186; Cronbach and Meehl 


° Some readers may think of these questions as raising issues of “face 
validity.” We have found so many different definitions of face validity 
that we prefer not to use this label. 
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1955, 282). Without it, a well-focused validation ques- 
tion may rapidly become entangled in a broader dis- 
pute over the concept. Such agreement can be pro- 
vided if the systematized concept is taken as given, so 
attention can be focused on whether a particular 
indicator adequately captures its content. 


Examples of Content Validation. Within the psycho- 
metric tradition (Angof 1988, 27-8; Shultz, Riggs, and 
Kottke 1998, 267-8), content validation is understood 
as focusing on the relationship between the indicator 
(level 3) and the systematized concept (level 2), with- 
out reference to the scores of specific cases (level 4). 
We will first present examples from political science 
that adopt this focus. We will then turn to a somewhat 
different, “case-oriented” procedure (Ragin 1987, 
chap. 3), identified with qualitative research, in which 
the examination of scores for specific cases plays a 
central role in content validation. 

Two examples from political research illustrate, re- 
spectively, the problems of omission of key elements 
from the indicator and inclusion of inappropriate ele- 
ments. Paxton’s (2000) article on democracy focuses on 
the first problem. Her analysis is particularly salient for 
scholars in the qualitative tradition, given its focus on 
choices about the dichotomous classification of cases. 
Paxton contrasts the systematized concepts of democ- 
racy offered by several prominent scholars—Bollen, 
Gurr, Huntington, Lipset, Muller, and Rueschemeyer, 
Stephens, and Stephens—with the actual content of the 
indicators they propose. She takes their systematized 
concepts as given, which establishes common concep- 
tual’ ground. She observes that these scholars include 
universal suffrage in what is in effect their systematized 
concept of democracy, but the indicators they employ 
in operationalizing the concept consider only male 
suffrage. Paxton thus focuses on the problem that an 
important component of the systematized concept is 
omitted from the indicator. 

The debate on Vanhanen’s (1979, 1990) quantitative 
indicator of democracy illustrates the alternative prob- 
lem that the indicator incorporates elements that cor- 
respond to a concept other than the systematized 
concept of concern. Vanhanen seeks to capture the 
idea of political competition that is part of his system- 
atized concept of democracy by including, as a compo- 
nent of his scale, the percentage of votes won by parties 
other than the largest party. Boilen (1990, 13, 15) and 
Coppedge (1997, 6) both question this measure of 
democracy, arguing that it incorporates elements 
drawn from a distinct concept, the structure of the 
party system. 


Case-Oriented Content Validation. Researchers en- 
gaged in the qualitative classification of cases routinely 
carry out a somewhat different procedure for content 
validation, based on the relation between conceptual 
meaning and choices about scoring particular cases. In 
the vocabulary of Sartori (1970, 1040-6), this concerns 
the relation between the “intension” (meaning) and 
“extension” (set of positive cases) of the concept. For 
Sartori, an essential aspect of concept formation is the 
procedure of adjusting this relation between cases and 
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concept. In the framework of Figure 1, this procedure 
involves revising the indicator (i.e., the scoring proce- 
dure) in order to sort cases in a way that better fits 
conceptual expectations, and potentially fine-tuning 
the systematized concept to better fit the cases. Ragin 
(1994, 98) terms this process of mutual adjustment 
“double fitting.” This procedure avoids conceptual 
stretching (Collier and Mahon 1993; Sartori 1970), that 
is, a mismatch between a systematized concept and the 
scoring of cases, which is clearly an issue of validity. 

An example of case-oriented content validation is 
found in O”Donnell’s (1996) discussion of democratic 
consolidation. Some scholars suggest that one indicator 
of consolidation is the capacity of a democratic regime 
to withstand severe crises. O'Donnell argues that by 
this standard, some Latin American democracies 
would be considered more consolidated than those in 
southern Europe. He finds this an implausible classifi- 
cation because the standard leads to a “reductio ad 
absurdum” (p. 43). This example shows how attention 
to specific cases can spur recognition of dilemmas in 
the adequacy of content and can be a productive tool in 
content validation. 

In sum, for case-oriented content validation, upward 

movement in Figure 1 is especially important. It can 
lead to both refining the indicator in light of scores and 
fine-tuning the systematized concept. In addition, al- 
though the systematized concept being measured is 
usually relatively stable, this form of validation may 
lead to friendly amendments that modify the system- 
atized concept by drawing ideas from the background 
concept. To put this another way, in this form of 
validation both an “inductive” component and concep- 
tual innovation are especially important. 
Limitations of Content Validation. Content validation 
makes an important contribution to the assessment of 
measurement validity, but alone it is incomplete, for 
two reasons. First, although a necessary condition, the 
findings of content validation are not a sufficient con- 
dition for establishing validity (Shepard 1993, 414-5; 
Sireci 1998, 112). The key point is that an indicator 
with valid content may still produce scores with low 
overall measurement validity, because further threats 
to validity can be introduced in the coding of cases. A 
second reason concerns the trade-off between parsi- 
mony and completeness that arises because indicators 
routinely fail to capture the full content of a system- 
atized concept. Capturing this content may require a 
complex indicator that is hard to use and adds greatly 
to the time and cost of completing the research. It is a 
matter of judgment for scholars to decide when efforts 
to further improve the adequacy of content may be- 
come counterproductive. 

It is useful to complement the conceptual criticism of 
indicators by examining whether particular modifica- 
tions in an indicator make a difference in the scoring of 
cases. To the extent that such modifications have little 
influence on scores, their contribution to improving 
validity is more modest. An example in which their 
contribution is shown to be substantial is provided by 
Paxton (2000). She develops an alternative indicator of 
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democracy -that takes female suffrage into account, 
compares the scores it produces with those produced 
by the indicators she originally criticized, and shows 
that her revised indicator yields substantially different 
findings. Her content validation argument stands on 
conceptual grounds alone, but her information about 
scoring demonstrates the substantive importance of 
her concerns. This comparison of indicators im a sense 
introduces us to convergent/discriminant validation, to 
which we now turn. 


Convergent/Discriminant Validation 


Basic Question. Are the scores (level 4) produced by 
alternative indicators (level 3) of a given systematized 
concept (level 2) empirically associated and thus con- 
vergent? Furthermore, do these indicators have a 
weaker association with indicators of a second, differ- 
ent systematized concept, thus discriminating this sec- 
ond group of indicators from the first? Stronger asso- 
ciations constitute evidence that supports interpreting 
indicators as measuring the same systematized con- 
cept—thus providing convergent validation; whereas 
weaker associations support the claim that they mea- 
sure different concepts—thus providing discriminant 
validation. The special case of convergent validation in 
which one indicator is taken as a standard of reference, 
and is used to evaluate one or more alternative indi- 
catots, is called criterion validation, as discussed above. 


Discussion. Carefully defined systematized concepts, 
and the availability of two or more alternative indica- 
tors of these concepts, are the starting point for 
convergent/discriminant validation. They lay the 
groundwork for arguments that particular indicators 
measure the same or different concepts, which jn turn 
create expectations about how the indicators may be 
empirically associated. To the extent that empirical 
findings match these “descriptive” expectations, they 
provide support for validity. 

Empirical associations are crucial to convergent/ 
discriminant validation, but they are often simply the 
point of departure for an iterative process. What 
initially appears to be negative evidence can spur 
refinements that ultimately enhance validity. That is, 
the failure to find expected convergence may encour- 
age a return to the conceptual and logical analysis of 
indicators, which may lead to their modification. Alter- 
natively, researchers may conclude that divergence 
suggests the indicators measure different systematized 
concepts and may reevaliate the conceptualization 
that led them to expect convergence. This 
illustrates the intertwining of convergent and discrimi- 
nant validation. 

of Convergent/Discriminant . Validation. 
Scholars who develop measures of democracy fre- 
quently use convergent validation. Thus, analysts who 
create a new indicator commonly report its correlation 
with previously established indicators (Bollen 1980, 
380-2; Coppedge and Reincke 1990, 61; Mainwaring et 
al. 2001, 52; Przeworski et al. 1996, 52). This is a 
valuable procedure, but it should not be employed 
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atheoretically. Scholars should have specific conceptual 
reasons for expecting convergence if it is to constitute 
evidence for validity. Let us suppose a proposed indi- 
cator is meant to capture a facet of democracy over- 
looked by existing measures; then too high a correla- 
tion is in fact negative evidence regarding validity, for 
it suggests that nothing new is being captured. 

An example of discriminant validation is provided by 
Bollen’s (1980, 373-4) analysis of voter turnout. As in 
the studies just noted, different measures of democracy 
are compared, but in this instance the goal is to find 
empirical support for divergence. Bollen claims, based 
on content validation, that voter turnout is an indicator 
of a concept distinct from the systematized concept of 
political democracy. The low correlation of voter turn- 
out with other proposed indicators of democracy pro- 
vides discriminant evidence for this claim. Bollen con- 
cludes that turnout is best understood as an indicator 
of political participation, which should be conceptual- 
ized as distinct from political democracy 

Although qualitative researchers routinely lack the 
data necessary for the kind of statistical analysis per- 
formed by Bollen, convergent/discriminant validation 
is by no means irrelevant for them. They often assess 
whether the scores for alternative indicators converge 
or diverge. Paxton, in the example discussed above, in 
effect uses discriminant validation when she compares 
alternative qualitative indicators of democracy in order 
to show that recommendations derived from her as- 
sessment of content validation make a difference em- 
pirically. This comparison, based on the assessment of 
scores, “discriminates” among alternative indicators. 
Convergent/discriminant validation is also employed 
when qualitative researchers use a multimethod ap- 
proach involving “triangulation” among multiple indi- 
cators based on different kinds of data sources (Brewer 
and Hunter 1989; Campbell and Fiske 1959; Webb et 
al. 1966). Orum, Faegin, and Sjoberg (1991, 19) spe- 
cifically argue that one of the great strengths of the 
case study tradition is its use of triangulation for 
enhancing validity. In general, the basic ideas of con- 
vergent/discriminant validation are at work in qualita- 
tive research whenever scholars compare alternative 
indicators. 


Concerns about Convergent/Discriminant Validation. A 
first concern here is that scholars might think that in 
convergent/discriminant validation empirical findings 
always dictate conceptual choices. This frames the 
issue too narrowly. For example, Bollen (1993, 1208-9, 
1217) analyzes four indicators that he takes as compo- 
nents of the concept of political liberties and four 
indicators that he understands as aspects of democratic 
tule. An examination of Bollen’s covariance matrix 
reveals that these do not emerge as two separate 
empirical dimensions. Convergent/discriminant valida- 
tion, mechanically applied, might lead to a decision to 
eliminate this conceptual distinction. Bollen does not 
take that approach. He combines the two clusters of 
indicators into an overall empirical measure, but he 
also maintains the conceptual distinction. Given the 
conceptual congruence between the two sets of indica- 
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tors and the concepts of political liberties and demo- 
cratic rule, the standard of content validation is clearly 
met, and Bollen continues to use these overarching 
labels. 

Another concern arises over the interpretation of 
low correlations among indicators. Analysts who lack a 
“true” measure against which to assess validity must 
base convergent validation on a set of indicators, none 
of which may be a very good measure of the system- 
atized concept. The result may be low correlations 
among indicators, even though they have shared vari- 
ance that measures the concept. One possible solution 
is to focus on this shared variance, even though the 
overall correlations are low. Standard statistical tech- 
niques may be used to tap this shared variance. 

The opposite problem also is a concern: the limita- 
tions of inferring validity from a high correlation 
among indicators. Such a correlation may reflect fac- 
tors other than valid measurement. For example, two 
indicators may be strongly correlated because they 
both measure some other concept; or they may mea- 
sure different concepts, one of which causes the other. 
A plausible response is to think through, and seek to 
rule out, alternative reasons for the high correlation. 

Although framing these concerns in the language of 
high and low correlations appears to orient the discus- 
sion toward quantitative researchers, qualitative re- 
searchers face parallel issues. Specifically, these issues 
arise when qualitative researchers analyze the sorting 
of cases produced by alternative classification proce- 

. dures that represent different ways of operationalizing 
either a given concept (Le., convergent validation) or 
two or more concepts that are presumed to be distinct 
(.e., discriminant validation). Given that these scholars 
are probably working with a small N, they may be able 
to draw on their knowledge of cases to assess alterna- 
tive explanations for convergences and divergences 
among the sorting of cases yielded by different classi- 
fication procedures. In this way, they can make valu- 
able inferences about validity. Quantitative research- 
ers, by contrast, have other tools for making these 
inferences, to which we now turn. 


Convergent Validation and Structural Equation Models 
with Latent Variables. \n quantitative research, an 
important means of responding to the limitations of 
simple correlational procedures for convergent/dis- 
criminant validation is offered by structural equation 
models with latent variables (also called LISREL-type 
models). Some treatments of such models, to the 
extent that they discuss measurement error, focus their 
attention on random error, that is, on reliability (Hay- 
duk 1987, e.g., 118-24; 1996).11 However, Bollen has 
made systematic error, which is the concern of the 


10 On the appropriate size of the correlation, see Bollen and Lennox 
1991, 305-7. 

u To take a political science application, Green and Palmquist’s 
(1990) study also reflects this focus on random error. By contrast, 
Green (1991) goes farther by considering both random and system- 
atic error. Like the work by Bollen discussed below, these articles are 
an impressive demonstration of how LISREL-type models can 
mcorporate a concern with measurement error into conventional 
statistical analysis, and how this can in tum lead to a major 
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present article, a central focus in his major method- 
ological statement on this approach (1989, 190-206). 
He demonstrates, for example, its distinctive contribu- 
tion for a scholar concerned with convergent/discrimi- 
nant validation who is dealing with a data set with high 
correlations among alternative indicators. In this case, 
structural equations with latent variables can be used 
to estimate the degree to which these high correlations 
derive from shared systematic bias, rather than reflect 
the valid measurement of an underlying concept. 

This approach is illustrated by Bollen (1993) and 
Bollen and Paxton’s (1998, 2000) évaluation of eight 
indicators of democracy taken from data sets devel- 
oped by Banks, Gastil, and Sussman. For each indi- 
cator, Bollen and Paxton estimate the percent of total 
variance that validly measures democracy, as opposed 
to reflecting systematic and random error. The sources 
of systematic error are then explored. Bollen and 
Paxton conclude, for example, that Gastil’s indicators 
have “conservative” bias, giving higher scores to coun- 
tries that are Catholic, that have traditional monar- 
chies, and that are not Marxist-Leninist (Bollen 1993, 
1221; Bollen and Paxton 2000, 73). This line of re- 
search is an outstanding example of the sophisticated 
use of convergent/discriminant validation to identify 
potential problems of political bias. 

In discussing Bolien’s treatment and application of 
structural equation models we would like to note both 
similarities, and a key contrast, in relation to the 
practice of qualitative researchers. Bollen certainly 
shares the concern with careful attention to concepts, 
and with knowledge of cases, that we have emphasized 
above, and that is characteristic of case-oriented con- 
tent validation as practiced by qualitative researchers. 
He insists that complex quantitative techniques cannot 
replace careful conceptual and theoretical reasoning; 
rather they presuppose it. Furthermore, “structural 
equation models are not very helpful if you have little 
idea about the subject matter” (Bollen 1989, vi; see also 
194). Qualitative researchers, carrying out a case-by- 
case assessment of the scores on different indicators, 
could of course reach some of the same conclusions 
about validity and political bias reached by Bollen. A 
structural equation approach, however, does offer a 


Teevaluation of substantive i this case concerning party 
identification (Greene 1991, 67-71). 

12 Two pomts about structural equation models with latent variables 
should be underscored. First, as noted below, these models can also 


ments emphasis that the indicators that measure a given latent 
variable (i.e, concept) in these models are conventionally inter- 
preted as “effects” of this latent variable (Bollen 1989, 65; Bollen and 
Lennox 1991, 305-6). These effects, however, do not involve causal 


understand- 
mg of “explanation.” By contrast, the links between one latent 
variable and rts indicators are productively understood as involving a 


13 See, for example, Banks 1979; Gasti! 1988; Sussman 1982, 
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fundamentally different procedure that allows scholars 
to assess carefully the magnitude and sources of mea- 
surement error for large numbers of cases and to 
summarize this assessment systematically and con- 
cisely. 


Nomological/Construct Validation 


Basic Question. In a domain of research in which a 
given causal hypothesis is reasonably well established, 
we ask: Is this hypothesis again confirmed when the 
cases are scored (level 4) with the proposed indicator 
(level 3) for a systematized concept (level 2) that is one 
' of the variables in the hypothesis? Confirmation is 
treated as evidence for validity. 


Discussion. We should first reiterate that because the 
term “construct validity” has become synonymous in 
the psychometric literature with the broader notion of 
measurement validity, to reduce confusion we use 
Campbell’s term “nomological” validation for proce- 
dures that- address this basic question. Yet, given 
common usage in political science, in headings and 
summary statements we call this nomological/construct 
validation. We also propose an acronym that vividly 
captures the underlying idea: AHEM validation; that 
is, “Assume the Hypothesis, Evaluate the Measure.” 

Nomological validation assesses the performance of 
indicators in relation to causal hypotheses in order to 
gain leverage in evaluating measurement validity. 
Whereas convergent validation focuses on multiple 
indicators of the same systematized concept, and dis- 
criminant validation focuses on indicators of different 
concepts that stand in a “descriptive” relation to one 
another, nomological validation focuses on indicators 
of different concepts that are understood to stand in an 
explanatory, “causal” relation with one another. Al- 
though these contrasts are not sharply presented in 
most definitions of nomological validation, they are 
essential in identifying this type as distinct from con- 
vergent/discriminant validation. In practice the con- 
trast between description and explanation depends on 
the researcher’s theoretical framework, but the distinc- 
tion is fundamental to the contemporary practice of 
political science. 

The underlying idea of nomological validation is that 
scores which can validly be claimed to measure a 
systematized concept should fit well-established expec- 
tations derived from causal hypotheses that involve this 
concept. The first step is to take as given a reasonably 
well-established causal hypothesis, one variable in 
which corresponds to the systematized concept of 
concern. The scholar then examines the association of 
the proposed indicator with indicators of the other 
concepts in the causal hypothesis. If the assessment 
produces an association that the causal hypothesis 
leads us to expect, then this is positive evidence for 

Nomological validation provides additional leverage 
in assessing measurement validity. If other types of 
validation raise concerns about the validity of a given 
indicator and the scores it produces, then analysts 
probably do not need to employ nomological valida- 
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tion. When other approaches yield positive evidence, 
however, then nomological validation is valuable in 
teasing out potentially important differences that may 
not be detected by other types of validation. Specifi- 
cally, alternative indicators of a systematized concept 
may be strongly correlated and yet perform very dif- 
ferently when employed in causal assessment. Bollen 
(1980, 383-4) shows this, for example, in his assess- 
ment of whether regime stability should be a compo- 
nent of measures of democracy. 


of Nomological/Construct Validation. Lijp- 
hart’s (1996) analysis of democracy and conflict man- 
agement in India provides a qualitative example of 
nomological validation, which he uses to justify his 
classification of India as a consociational democracy. 
Lijphart first draws on his systematized concept of 
consociationalism to identify descriptive criteria for 
classifying any given case as consociational. He then 
uses nomological validation to further justify his scor- 
ing of India (pp. 262-4). Lijphart identifies a series of 
causal factors that he argues are routinely understood 
to produce consociational regimes, and he observes 
that these factors are present in India. Hence, classify- 
ing India as consociational is consistent with an estab- 
lished causal relationship, which reinforces the plausi- 
bility of his descriptive conclusion that India is a case of 
consociationalism. 
_ Another qualitative example of nomological valida- 
tion is found im a classic study in the tradition of 
comparative-historical analysis, Perry Anderson’s Lin- 
eages of the Absolutist State.4 Anderson (1974, 413-5) 
is concerned with whether it is appropriate to classify 
as “feudalism” the political and economic system that 
emerged in Japan beginning roughly in the fourteenth 
century, which would place Japan in the same analytic 
category as European feudalism. His argument is partly 
descriptive, in that he asserts that “the fundamental 
resemblance between the two historical configurations 
as a whole [is] unmistakable” (p. 414). He validates his 
classification by observing that Japan’s subsequent 
development, like that of postfeudal Europe, followed 
an economic trajectory that his theory explains as the 
historical legacy of a feudal state: “The basic parallel- 
ism of the two great experiences of feudalism, at the 
opposite ends of Eurasia, was ultimately to receive its 
most arresting confirmation of all, in the posterior 
destiny of each zone” (p. 414). Thus, he uses evidence 
concerning an expected explanatory relationship to 
increase confidence in his descriptive characterization 
of Japan as feudal. Anderson, like Lijphart, thus fol- 
lows the two-step procedure of making a descriptive 
claim about one or two cases, and then offering evi- 
dence for the validity of this claim by observing that it 
is consistent with an explanatory claim in which he has 
confidence. 

A quantitative example of nomological validation is 
found in Elkins’s evaluation of the that de- 
mocracy versus nondemocracy should be treated as a 
dichotomy, rather than in terms of gradations. One 


14 Sebastian Mazzuca suggested this example. 
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potential defense of a dichotomous measure is based 
on convergent validation. Thus, Alvarez and colleagues 
(1996, 21) show that their dichotomous measure is 
strongly associated with graded measures of democ- 
racy. Elkins (2000, 294-6) goes on to apply nomolog- 
ical validation, exploring whether, notwithstanding this 
association, the choice of a dichotomous measure 
makes a difference for causal assessment. He compares 
tests of the democratic peace hypothesis using both 
dichotomous and graded measures. According to the 
hypothesis, democracies are in general as conflict 
prone as nondemocracies but do not fight one another. 
The key finding from the standpoint of nomological 
validation is that this claim is strongly supported using 
a graded measure, whereas there is no statistically 
significant support using the dichotomous measure. 
These findings give nomological evidence for the 
greater validity of the graded measure, because they 
better fit the overall expectations of the accepted 
causal hypothesis. Elkins’s approach is certainly more 
complex than the two-step procedure followed by 
Lijphart and Anderson, but the basic idea is the same. 


Skepticism about Nomological Validation. Many schol- 
ars are skeptical about nomological validation. One 
concern is the potential problem of circularity. If one 
assumes the hypothesis in order to validate the indica- 
tor, then the indicator cannot be used to evaluate the 
same hypothesis. Hence, it is important to specify that 
any subsequent hypothesis-testing should involve hy- 
potheses different from those used in nomological 
validation. 

A second concern is that, in addition to taking the 
hypothesis as given, nomological validation also pre- 
supposes the valid measurement of the other system- 
atized concept involved in the hypothesis. Bollen 
(1989, 188-90) notes that problems in the measure- 
ment of the second indicator can undermine this 
approach to assessing validity, especially when scholars 
rely on simple correlational procedures. Obviously, 
researchers need evidence about the validity of the 
second indicator. Structural equation models with la- 
tent variables offer a quantitative approach to address- 
ing such difficulties because, in addition to evaluating 
the hypothesis, these models can be specified so as to 
provide an estimate of the validity of the second 
indicator. In small-N, qualitative analysis, the re- 
searcher has the resource of detailed case knowledge 
to help evaluate this second indicator. Thus, both 
qualitative and quantitative researchers have a means 
for making inferences about whether this important 
presupposition of nomological validation is indeed 
met. 

A third problem is that, in many domains in which 
political scientists work, there may not be a sufficiently 
well-established hypothesis to make this a viable ap- 
proach to validation. In such domains, it may be 
plausible to assume the measure and evaluate the 
hypothesis, but not the other way around. Nomological 
validation therefore simply may not be viable. Yet, it is 
helpful to recognize that nomological validation need 
not be restricted to a dichotomous understanding in 
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which the hypothesis either is or is not reconfirmed, 
using the proposed indicator. Rather, nomological 
validation may focus, as it does in Elkins (2000; see also 
Hill, Hanna, and Shafqat 1997), on comparing two 
different indicators of the same systematized concept, 
and on asking which better fits causal expectations. A 
tentative hypothesis may not provide an adequate 
standard for rejecting claims of measurement validity 
outright, but it may serve as a point of reference for 
comparing the performance of two indicators and 
thereby gaining evidence relevant to choosing between 
them. 

Another response to the concern that causal hypoth- 
eses may be too tentative a ground for measurement 
validation is to recognize that neither measurement 
claims nor causal claims are inherently more epistemo- 
logically secure. Both types of claims should be seen as 
falsifiable hypotheses. To take a causal hypothesis as 
given for the sake of measurement validation is not to 
say that the hypothesis is set in stone. It may be subject 
to critical assessment at a later point. Campbell (1977/ 
1988, 477) expresses this point metaphorically: “We are 
like sailors who must repair a rotting ship at sea. We 
trust the great bulk of the timbers while we replace a 
particularly weak plank Each of the timbers we now 
trust we may in turn replace. The proportion of the 
planks we are replacing to those we treat as sound must 
always be small.” 


CONCLUSION 


In conclusion, we return to the four underlying issues 
that frame our discussion. First, we have offered a new 
account of different types of validation. We have 
viewed these types in the framework of a unified 
conception of measurement validity. None of the spe- 
cific types of validation alone establishes validity; 
rather, each provides one kind of evidence to be 
integrated into an overall process of assessment. Con- 
tent validation makes the indispensable contribution of 
assessing what we call the adequacy of content of 
indicators. Convergent/discriminant validation—taking 
as a baseline descriptive understandings of the rela- 
tionship among concepts, and of their relation to 
indicators—focuses on shared and nonshared variance 
among indicators that the scholar is evaluating. This 
approach uses empirical evidence to supplement and 
temper content validation. Nomological/construct val- 
idation—taking as a baseline an established causal 
hypothesis—adds a further tool that can tease out 
additional facets of measurement validity not ad- 
dressed by convergent/discrimimant validation. 

We are convinced that it is useful to carefully 
differentiate these types. It helps to overcome the 
confusion deriving from the proliferation of distinct 
types of validation, and also of terms for these types. 
Furthermore, in relation to methods such as structural 
equation models with latent variables—which provide 
sophisticated tools for simultaneously evaluating both 
measurement validity and explanatory hypotheses— 
the delineation of types serves as a useful reminder that 
validation is a multifaceted process. Even with these 
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models, this process must also incorporate the careful 
use of content validation, as Bollen emphasizes. 

Second, we have encouraged scholars to distinguish 
between issues of measurement validity and broader 
conceptual disputes. Building on the contrast between 
the background concept and the systematized concept 
(Figure 1), we have explored how validity issues and 
conceptual issues can be separated. We believe that 
this separation is essential if scholars are to give a 
consistent focus to the idea of measurement validity, 
and particularly to the practice of content validation. 

Third, we examined alternative procedures for 
adapting operationalization to specific contexts: con- 
text-specific domains of observation, context-specific 
indicators, and adjusted common indicators. These 
procedures make it easier to take a middle position 
between universalizing and particularizing tendencies. 
Yet, we also emphasize that the decision to pursue 
context-specific approaches should be carefully consid- 
ered and justified. 

Fourth, we have presented an understanding of 
measurement validation that can plausibly be applied 
in both quantitative and qualitative research. Although 
most discussions of validation focus on quantitative 
research, we have formulated each type in terms of 
basic questions intended to clarify the relevance to 
both quantitative and qualitative analysis. We have also 
given examples of how these questions can be ad- 
dressed by scholars from within both traditions. These 
examples also illustrate, however, that while they may 
be addressing the same questions, quantitative and 
qualitative scholars often employ different tools in 
finding answers. 

Within this framework, qualitative and quantitative 
researchers can learn from these differences. Qualita- 
tive researchers could benefit from self-consciously 
applying the validation procedures that to some degree 
they may already be employing implicitly and, in par- 
ticular, from developing and comparing alternative 
indicators of a given systematized concept. They should 
also recognize that nomological validation can be 
important in qualitative research, as illustrated by the 
Lijphart and Anderson examples above. Quantitative 
researchers, in turn, could benefit from more fre- 
quently supplementing other tools for validation by 
employing a case-oriented approach, using the close 
examination of specific cases to identify threats to 
measurement validity. 
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and foreign policies framed outside the language of justice. His deeper political-philosophical aim 
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was to explore the relationship between nomos (convention) and phusis (nature) and its implications 
lentiti 


for civilization. Thucydides concludes that nomos constructs i 


and channels and restrains the 


behavior of individuals and societies. Speech and reason (logos) in tum make nomos possible because all 
conventions depend on shared meanings. The feedback loop between logoi (words) and ergoi (deeds) 
created Greek civilization but also the international and civil strife (stasis) associated with the Peloponne- 
sian War. Internationa! security and civil order depend upon recovering the meanings of words and the 
conventions they enabie. Thucydides should property be considered a constructivist. 


ovements establish genealogies to legitimize 

themselves. To make Christianity more attrac- 

tive to Jews, the New Testament traces Jesus’s 
lineage to King David. Realists claim Thucydides as 
their forebear. In recent years, a number of interna- 
tional relations scholars have offered more subtle read- 
ings of his history that suggest realism is only one facet 
of his work.1 I make a more radical assertion: Thucy- 
dides is a founding father of constructivism. The un- 
derlying purpose of his history was to explore the 
relationship between nomos (convention, custom, law) 
and pAusis (nature) and its implications for the devel- 
opment and preservation of civilization.2 His work 
shows not only how language and convention establish 
identities and enable power to be translated into 
influence but also how the exercise of power can 
undermine language and convention. Thucydides’ un- 
derstanding of these relationships was insightful and 
points to the possibility, indeed the necessity, of a 
symbiotic and productive partnership between two 
currently antagonistic research traditions. 


REALISTS AND THEIR CRITICS 


Since the time of Thomas Hobbes, Thucydides has 
been celebrated as a realist, as someone who stripped 
away all moral pretenses to expose the calculations of 
power and advantage that of necessity motivate suc- 
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cessful political actors (Bury 1975; de Ste. Croix 1972; 
Kagan 1969; Meiggs 1972). Neorealists assert that his 
history vindicates their emphasis on the system level 
and contains implicit propositions about power transi- 
tion and the onset of hegemonic war as well as the 
inability of norms and conventions to keep the peace 
under conditions of international anarchy (Gilpin 1986; 
Waltz 1979). Other realists, most notably Michael 
Doyle (1997), offer more nuanced readings that at- 
tempt to understand Thucydides in the context in 
which he wrote. A growing number of scholars chal- 
lenge the claims of neorealists, and some question 
whether Thucydides is adequately characterized as a 
Tealist. 

Detailed analysis of Thucydides’ history in the mid- 
nineteenth century called into question its consistency 
and unity. This research gave rise to the Thucydides- 
frage, a controversy about how many distinct parts 
there are to the history, the order in which they were 
written, and what this reveals about the evolution of 
the author’s thinking over approximately two decades 
of research and writing. Thucydides was considered a 
coldly detached and dispassionate rationalist, a scien- 
tist in the tradition of Hippocrates, in search of an 
“objective” and timeless understanding of politics and 
war. Because ordered thought and presentation are 
absolutely essential to such an enterprise, scholars 
assumed that Thucydides would have “cleaned up” his 
manuscript to remove all the inconsistencies if he had 
lived long enough. 

The postwar attack on positivism in social sciences 
and history encouraged a rethinking of Thucydides. 
Wallace (1964), Bowersock (1965), and Stahl (1966) 
made the case for a passionate and politically engaged 
writer who can be considered a critic of the scientific 
approach to history. Connor’s Thucydides (1984) rep- 
resents a dramatic break with the past in that it 
attempts to restore a “unitarian” reading of the history. 
To Connor, Thucydides is a masterful postmodernist 
who carefully structures his text to evoke an intended 
set of responses. He uses omissions, repetitions, and 
inconsistencies in the form of arguments and judg- 
ments that are “modified, restated, subverted, or totally 
controverted” (p. 18) to tell a more complex story and 
convey a more profound understanding of the human 
condition. Ultimately, Connor (pp. 15-8) argues, “the 
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work leads the reader—ancient or modern—far be- 
yond the views and values it seems initially to utilize 
and affirm.” 

Thucydides’ careful attention to language is the 
starting point of another seminal study, When Words 
Lose Their Meaning, by James Boyd White (1984). 
According to White, people act m the world by using 
the language of the world. To understand their behav- 
ior and the social context that enables it, we need to 
track the ways in which words acquire, hold, or lose 
meanings and how new meanings arise and spread. 
White contends that Thucydides recognized this truth, 
and his conception of meaning transcends the lexical to 
encompass understandings of self, manners, conduct, 
and sentiment. Changes in meaning involve reciprocal 
interactions between behavior and language, which are 
tracked by Thucydides in his speeches, debates, and 
dialogues. As the Peloponnesian War progresses, the 
terms of discourse that function at the outset in 
intelligible ways shift and change, and the language and 
community (homonoia) constituted by it deteriorate 
into incoherence. 

When the Athenians can no longer use the tradi- 
tional language of justification for their foreign policy, 
they struggle to find an alternate language, and they 
finally resort to assertions of pure self-interest backed 
by military clout. Such a language is not rooted in 
ideas, is unstable, and deprives its speakers of their 
culture and identities. By using it, the Athenians de- 
stroy the distinctions among friend, colony, ally, neu- 
tral, and enemy and make the world their enemy 
through a policy of limitless expansion. In effect, they 
abandon the culture through which self-interest can 
intelligently be defined, expressed, and bounded. By 
the time of the Sicilian debate, the Athenians can no 
longer speak and act coherently, and this failure is the 
underlying reason for their empire’s decline. For 
Thucydides and for White, the history of the Athenian 
empire not only indicates the tension between justice 
and self-interest but also reveals that they validate and 
give meaning to each other. 

Garst (1989) relies on White’s arguments to accuse 
neorealists of having a narrow definition of power and 
of unfairly projecting it onto Thucydides. Thucydides 
shows that Athenian imperialism was successful when 
power was exercised in accord with well-defined social 
conventions governing Greek speech and behavior. 
These conventions are ignored as the war progresses. 
The Melian Dialogue and the Sicilian debate reveal 
how the Athenians destroyed the rhetorical culture 
through which their interests as an imperial power 
were intelligently formulated and expressed. Their 
foreign policy became a policy of coercion and limitless 
expansion. For Garst, this process illustrates the power 


of agency and reveals that foreign policy is rarely, if 


ever, a mechanical mse to a balance of power. 
For Crane (1998), Thucydides’ history is a realist 
classic because it reveals how the strong dominate the 
weak and interests trump justice. But Thucydides con- 
sidered such behavior a fundamental departure from 
traditional Greek practice, in which foreign policy was 
an extension of aristocratic family connections and 
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enmeshed leaders and their poleis in a web of mutual 
obligations. The Corinthian plea to the Athenian as- 
sembly not to ally with Corcyra, based on Corinth’s 
prior restraint during the Samian rebellion, reflects this 
approach and uses the time-honored language and 
arguments of reciprocity. The Athenians reject the 
appeal because they formulate their interests and 
foreign policy on the basis of immediate interests. They 
act as if alliances are market transactions: short-term 
exchanges unaffected by past dealings. Thucydides 
considered this approach to politics destructive of the 
relationships that are the true source of security and 
prosperity. Pericles, who speaks for Thucydides 
(2.60.2-4) on this question im his funeral oration, 
insists that the individual is nothing without the state, 
but at the time of the Sicilian debate Alcibiades asserts 
that the state counts for nothing if it does not support 
him as an individual (6.92.2-5). The single-minded 
focus on self-interest was the underlying cause of 
discord at home and reckless expansionism abroad. 
Crane believes that Thucydides’ goal was to reconsti- 
tute the “ancient simplicity” (euethés) of the aristocracy 
in a new, rationalized form. 

Rahe (1996) also acknowledges two sides to Thucy- 
dides: the hard-headed analyst of power politics and 
the critic of realism. Thucydides’ portrayal of post- 
Periclean Athens shows how lust (erds) for power 
ultimately made prudent calculation of advantage and 
calibration of means and ends impossible. The Melian 
Dialogue and the debate over the Sicilian expedition 
indicate that Athenians had lost all sense of measure 
and proportion; they had become impervious to rea- 
soned argument and therefore to the risks inherent in 
their initiatives. Thucydides wants readers to recognize 
that without moral boundaries human beings develop 
unlimited ambitions. The sober construction of self- 
interest requires restraint, which in turn requires ac- 
ceptance and internalization of the claims of justice 
and human decency. 

Forde (1989, 1992) and Orwin (1994) approach 
Thucydides from a more Straussian perspective. Forde 
criticizes neorealists for ignoring justice, a concern that 
was central to such early postwar realists as, Hans 
Morgenthau and John Herz. He contends that Thucy- 
dides, like Plato, recognized the possibility of reconcil- 
ing justice and interest through the citizen’s love for 
and identification with his polis—the principal theme 
of Pericles’ funeral oration. In post-Periclean Athens, 
citizens put their self-interest first, and this led to acute 
discord, domestic instability, and defeat. For Orwin, 
Thucydides paints an “unflinching” portrait of the 
harshness and even brutality of the time but with the 
goal of showing how human beings, through their 
“humanity,” can transcend both the security dilemma 
and crippling domestic discord. To do this they must 
take justice seriously. 

Ober (1989, 1998) blends the traditions of classical 
and international relations scholarship. He invokes 
Austin’s (1975) conception of performative speech acts 
and Searle’s (1995) distinction between brute .and 
social facts to analyze Athenian politics (Ober 1998). 
He argues that Searle’s all-important distinction be- 
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tween social and brute facts becomes blurred in the 
context of the awesome power wielded by the Athenian 
assembly. Debates and decisions became “social facts” 
because successful orators imposed their own speech- 
dependent meanings on brute facts. As brute facts and 
social meanings diverged, the latter became the basis of 
policy, and this led to disaster. In this conflict between 
words (logoi) and deeds (erga), Ober contends that 
Thucydides’ sides with the latter. The history attempts 
to reconstruct erga through the application of scientific 
principles of data collection and evaluation (techné) to 
the past, and by doing so it points the way to a similar 
process in everyday politics. 

My analysis builds on these works but differs from 
them in important respects. I take issue with some of 
their interpretations or reach the same conclusion by 
different routes. My main difference with my political 
science and classical colleagues concerns the purpose 
of the history; I contend it is about the rise and fail of 
civilization and what might be done to salvage it.3 My 
analysis builds on Connor’s insight that the structure of 
Thucydides’ text provides clues for reconciling some of 
his seeming inconsistencies. Toward this end, I identify 
four layers to the history: (1) the nature and relation- 
ships among power, interest, and justice; (2) Athens as 
a tragedy; (3) the relationship between nomos (con- 
vention, custom and law) and phusis (nature); and (4) 
the relationship between erga and logoi and its.impli- 
cations for civilization. Each layer addresses a different 
question, and the successive answers can be read back 
to provide a deeper understanding of the questions 
posed by previous layers. For Connor, omissions, rep- 
etitions, inconsistencies, and subverted sentiments and 
arguments are intended to move readers to deeper 
understandings. I see them playing this role within 
Jevels, and I argue that Thucydides offers the structure 
of his narrative, choice of language, and implicit refer- 
ences to other fifth-century texts—Herodotus’ History, 
the Hippocratic corpus, and the tragedies of Aeschylus, 
Sophocles, and Euripides—as “signs” (sémmata) to move 
us from one level of the text to the next. 

There are sound historical and textual reasons for 
reading Thucydides this way. Fifth-century sophists 
considered themselves teachers and intended their 
works or oral presentations as courses of study. They 
opened with the statement of a problem and simple 
responses to it and went on to develop increasingly 
complex and sophisticated arguments that often under- 
cut their initial argument. At the deepest levels; their 
arguments were left implicit to encourage students to 
draw the intended conclusions for themselves. Sophists 
dominated Athenian philosophy during the second half 
of the fifth century and had considerable political 


2 We must distinguish between Greek civilization and civilizations 
more generally. Thucydides certainly had in mind the restoration of 
civil society and international order in Athens and Greece. Did he 
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influence. Pericles himself was their principal patron. 
Sophists were subversive of the old aristocratic order in 
the deepest sense, for they maintained that areté (ex- 
cellence, especially the kind that made a man a re- 
spected leader) could be acquired through study, not 
just through heredity and lifelong association with men 
of good breeding. Thucydides ‘rejected some Sophist 
teachings—he was undoubtedly troubled by the social 
consequences of Sophist ridicule of objective standards 
of justice. But he was greatly attracted to their style of 
argument, which he adopted for his own and quite 
different purposes. 

In his treatment of the origins of the Peloponnesian 
War, Thucydides provides a striking example of his use 
of the sophistic method. At the onset (1.235~-6) he 
attributes the war to “the growth of the power of 
Athens, and the alarm which this inspired in Sparta, 
made war inevitable.” He goes on to describe Athens 
and Sparta making their respective cases before the 
court of public opinion. By his use of the word propha- 
sis, which was widely used before the law courts as a 
rationalization for suits, Thucydides signals to more 
sophisticated readers that charge and countercharge 
are little more than propaganda that obscures the real 
causes of the war (Rawlings 1981). The subsequent 
narrative and paired speeches of Book I describe the 
deeper causes: Sparta’s fear for its way of life, which is 
threatened by the political, economic, and cultural 
transformation of Greece spearheaded by Athens; the 
ability of third parties to manipulate Sparta for their 
own ial interests; and the miscalculation of 
leaders throughout Greece at critical junctures of the 
crisis (Lebow 1991, 1996). 

Thucydides requires a dedicated and thoughtful au- 
dience. Readers must be willing to recognize multiple 
levels of analysis as well as the questions and argu- 
ments specific to these levels, and they must ponder the 
implications of any apparent contradictions. The his- 
tory cannot be read in 4 linear manner; one must move 
back and forth between sections of the text to grasp the 
contrasts and ironies embedded in structure and lan- 
guage and the ways in which different contexts. and 
orders of presentation encode insights and interpreta- 
tions. Not all inconsistencies can be resolved in this 
way, and those that remain are intended to draw 
attention to tensions inherent in the situation and the 
possibility of a deeper truth that helps reconcile them. 
Heraclitus taught that the world is a battleground 
between opposing forces and that philosophers must 
look beneath the surface to find the deeper unity 
(harmonia) that unites them. Thucydides, as did Plato, 
thought and wrote in this binary tradition. 


POWER, INTEREST, AND JUSTICE 


Almost all the works I have discussed address ques- 
tions of interest and justice in the history. There is a 
near consensus that Thucydides’ depiction of the so- 
called realism of the Athenians does not reflect bis own 
views. Justice must be considered because it provides 
the language for any reasonable formulation of inter- 
est. Otherwise, interests are equated with power and 
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result in policies of aggrandizement. White (1984), 
Garst (1989), Forde (1992), Orwin (1994), Rahe 
(1996), and Crane (1998) develop this thesis from the 
“inside out” perspective of Athenians attempting to 
manage, protect, and expand their empire. Thucydides 
is also interested in the “outside in” perspective: how 
allies, enemies, and neutrals respond to Athens and its 
policies. His work documents not only the process by 
which Athens succumbed to a foreign policy of limitless 
expansion but also the reasons such a policy was bound 
to fail. l 
As noted elsewhere (Lebow and Kelly n.d.), 


Thucydides distinguished between hegemonia and arkhé, 
both of which are most frequently translated as hegemony. 
For fifth- and fourth-century Greeks, hegemonia was asso- 
ciated with #mé—the gift of honor [Meiggs 1972; Periman 
1991]. Timé was bestowed informally by free consent of the 
Greek community as reward for achievements, and re- 
tained by consent, not by force. Sparta and Athens were so 
honored because of their contributions during the Persian 
Wars. Athens also earned fimé because her intellectual 
and artistic accomplishments made her the “school of 
Hellas.” Arkhé connoted something akin to our notion of 
political control, and initially applied to authority within a 
city state and only later to rule or influence over city states. 


The semantic field of arché was gradually extended to 
encompass y. ; 

. By 416, when the assembly voted to occupy Melos 
and subdue Sicily, Thucydides makes it clear that the 
Athenian empire was an arché based primarily on 
military might. The structure and language of the 
Melian Dialogue mark a radical break with past prac- 
tice. The Melians deny the Athenian envoys access to 
the people, granting only a private audience with the 
magistrates and the few (olgioi). The exchange consists 
of brachylogies: short, blunt, alternating verbal thrusts, 
suggestive of a military encounter. The Athenians 
dispense with all pretense. They acknowledge they 
cannot justify their invasion on the basis of provoca- 
tions or their right to rule. They deny the relevance of 
justice, which only comes into play between equals. 
“The strong do what they can, and the weak suffer what 
they must,” and the Melians should put their survival 
first and submit (Thucydides, 5.89). The Melians warn 
that the Athenian empire will not last forever, and if 
the Athenians violate the established norms of justice 
and decency their fall “would be a signal for the 
heaviest vengeance and an example for the world to 
mediate upon” (5.90). The Athenians insist they are 
only concerned with the present and the preservation 
of their empire. The Melians suggest it is in their 
mutual interest for Melos to remain neutral and a 
friend of Athens. The Athenians explain that neutrality 
would be interpreted as a sign of weakness by other 
island states “smarting under the yoke” (5.99) and 
would serve as a stimulus to rebellion. “The fact that 
you are islanders and weaker than others renders it all 
the more important that you should not succeed in 
baffling the masters of the sea” (5.91-9). Contempo- 
rary Greeks would have been shocked by the failure of 
Athens to offer any justification is) for its 
invasion of Melos and by its repudiation of the Melian 
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offer of neutrality on the grounds that “your [Melian] 
hostility cannot so hurt us as your friendship” (5.95). 
Fifteen years into the war the Athenians repudiate, 
indeed invert, core Greek values. 

The rhetorical style of the envoys reinforces the 
impression conveyed by their words. Dionysius of 
Halicarnassus (1975, 31) considered their language 
“appropriate to oriental monarchs addressing Greeks, 
but unfit to be spoken by Athenians to Greeks whom 
they liberated from the Medes.” Thucydides seems to 
have modeled his dialogue on a passage in Herodotus 
(7.8), in which the Persian king Xerxes discusses with 
his council of advisors the wisdom of attacking Greece 
(Connor 1906; Cornford 1984). The language is simi- 
lar, and the arguments run parallel; Xerxes alludes to 
the law of the stronger and the self-interest of empires. 
Herodotus (8.140, 144) also describes an offer of peace 
and friendship that Xerxes made to Athens and Sparta 
on the eve of his invasion. The Athenians spurn his 
olive branch and accept the danger of confronting a 
seemingly invincible force in the name of Greek free- 
dom and cultural identity, just as the Melians reject an 
Athenian offer of alliance because of the value they put 
on their freedom. These parallels would not have been 
lost on contemporaries. For Thucydides, as for many 
Greeks, the Athenians of 416 have become the Per- 
sians of 480, the symbol of rank depotism in the Greek 
world. 

The Melians offer a long view on the fate of empires. 
The Athenians focus on the immediate future, and in 
their pursuit of short-term gain alienate allies and dry 
up whatever reservoir of good will their early heroic 
behavior had created. By the time of the Melian 
Dialogue, they have antagonized even neutrals and 
close allies, which makes their fear of the security 
dilemma self-fulfilling. Thucydides tells us through the 
voice of the Melians that raw force can impose its will 
at any given moment, but few empires have the military 
and economic capability to repress their subjects indef- 
initely. Allies who see themselves as exploited will 
sever the bonds when the opportunity arises. Oppres- 
sion also leaves memories that inhibit future attempts 
at empire building. In 378, when Athens tried to form 
the Second Athenian Confederacy, most of Greece 
resisted. Hegemonia is an essential precondition of 
sustainable empire. 

Realists define the national interest in terms of 
power. Many regard international law and associated 
norms as impediments to state interests unless they 
provide a rhetorical cover for policies whose real 
purpose it is to maximize power and influence. Thucy- 
dides opposed such a narrow view of state interests. 
Pericles was praiseworthy because he made foreign 
policy responsive to his vision of long-term Athenian 
interests, and he used his personal standing and rhe- 
torical skills to win popular support for these policies. 
The demagogues who followed him were at best suc- 
cessful tacticians. They advocated foreign policies they 
expected to be popular with the masses (ho homilos) 
and were more interested in their own fortunes than 
those of their polis. Pericles understood that the over- 
tiding interest of Athens was preservation of the 
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empire, and this required both naval power and legit- 
imacy. To maintain the latter, Athens had to act in 
accord with the principles and values that had earned 
hegemonia, and it had to offer positive political and 
economic benefits to allies. Because post-Periclean 
Athens consistently chose power over principle, it 
alienated allies and third parties, lost hegemonia, and 
weakened its power base. The Melian Dialogue and the 
Sicilian expedition are pathological departures from 
rational self-interest. 


ATHENS AS TRAGEDY 


Fifth-century tragedies dramatized the lives of individ- 
uals to convey insights into human beings and their 
societies as well as critically examine or reaffirm fun- 
damental values of the community. Cornford (1907) 
and Euben (1990), among others, have discussed 
Thucydides’ relationship to tragedy and the structural 
similarities between his history and the tragedies. Alker 
(1988, 1996) contends that the history might be read as 
the tragedy of the empire’s rise and fall and the Melian 
Dialogue as a “morality play” about might and right 
Bedford and Workman (2001) suggest that Thucydides 
adopted the tragic form to develop his critique of 
Athenian foreign policy. I believe he wanted readers to 
experience his history as a tragedy and to move from 
emotional involvement with the story to contemplation 
of its general lessons, just as they might with a theat- 
rical production. 

In his only statement about his intent, Thucydides 
(1.22) offers his history as “an aid for the interpretation 
of the future, which in the course of human things must 
resemble it if it does not reflect it.” The cyclical pattern 
he has in mind is not just about the growth and decline 
of empires but, more generally, how success spawns 
excessive ambition, overconfidence, and self-destruc- 
tive behavior. 

The Greek literary tradition was largely an oral one, 
and Herodotus, author of the first long historical 
narrative, was paid to read sections of it aloud (Luce 
1997). His words are chosen with their sounds in mind, 
and his style, lexis eiromené (literally, speech strung 
together), is related to epic poetry. He introduces an 
idea or action, defines it by approaching it from 
different perspectives, and expands its meaning 
through the apposition of words, phrases, and clauses. 
Opinion is divided about Thucydides, who wrote at a 
time when the oral tradition was declining (Havelock 
1963; Lain Entralgo 1970). Thucydides can be appre- 
ciated if read aloud, but it would be difficult to grasp 
deeper layers of meaning. His text is written in a 
complex and idiosyncratic style that requires careful 
analysis to discover and work through its purpose. 
Thucydides makes extensive use of parallels in setting, 
structure, and language with other passages in his work 
and those of other writers. He intended his history to 
be read and studied. 

The embedding of oral forms in a literary text is 
common to Herodotus, Thucydides, and Plato. In The 
Iliad, from which so much of this tradition derives, 
paired and group speeches are as important as narra- 


tive and mark critical moments of decision and turning 
points. The speeches are also vehicles for moving 
thematically toward greater depth, compassion, and 
ethical sophistication. In Thucydides, the speeches 
highlight critical junctures, sometimes suggest their 
contingency, but always examine opposing courses of 
action and the justifications provided for them. They 
also track the progression—really the descent—of 
Greece from relatively secure societies bound together 
by convention, obligation, and interests to a condition 
of disorder and even anarchy, a transformation to 
which I shall return. 

Another commonality in the Greek literary tradition 
is the use of heroes to provide continuity and structure 
to the text. Modern writers on the origins, course, and 
consequences of wars frequently acknowledge the 
prominent role of key actors, but they almost always 
provide some kind of general, sociological framework 
to understand and assess the decisions and behavior of 
these people (Herwig 1997; Murray and Millet 2000, 
Weinberg 1994). Herodotus and Thucydides do the 
reverse; they rely on the words, actions, and fate of 
heroes to move the narrative along and give it meaning. 
Herodotus uses the story of Croesus to set up the 
central saga of Xerxes. Solon warns Croesus to recog- 
nize his limits and restrain his ambitions, and Xerxes 
receives similar advice from Artabanus. Both men 
nevertheless embark upon ambitious military ventures 
that end in catastrophe. Early in Book One, Thucy- 
dides (1.9-11) uses the story of Agamemnon and the 
Trojan War—in which an alliance held together by 
naval power confronts a major land power—to provide 
an overview of what will follow. Elsewhere in the 
history, the stories of individuals and cities prefigure 
the fate of more important personages and major 
powers, especially Athens. 

There is a more fundamental difference in the way 
ancient Greek and modern historians approach heroes, 
Most contemporary works dwell on the particular mix 
of background, personal qualities, and experience that 
make people distinct as individuals. They do this even 
when these figures are intended to be emblematic of a 
class, movement, or set of shared life experiences. 
Herodotus and Thucydides hardly ever take note of 
idiosyncratic attributes; like the authors of epic poetry 
and drama, they are interested in using individuals to 
create archetypes. They stress the qualities, especially 
strengths and weaknesses, their heroes share with 
other heroes. The typicality, not the uniqueness, of 
actors and situations is a central convention of fifth- 
century poetry, tragedy, and prose. Even Pericles, 
whom Thucydides offers as the model of a modern man 
of politics, is a stereotype. He is the sum of qualities 
that make him an ideal leader in a transitional democ- 
racy and a benchmark for his successors. All subse- 
quent leaders possess different combinations of some 
of his qualities but never all of them—to the detriment 
of Athens. Nicias displays honesty and dedication but 
lacks the skill and stature to dissuade the assembly 
from undertaking the Sicilian expedition. Alcibiades 
has intelligence and rhetorical skill but uses them to 
advance his career at the expense of his city. 


551 


‘({hucydides the Constructivist 


i September 2001 
SSS ptember 2001 


Greek tragedies consist of archétypical characters 
who confront archetypical situations. The tragic hero, 
like his Homeric predecessor, is a self-centered, nar- 
cissistic figure who revels in his own importance and 
comes to believe that he is not bound by the laws and 
conventions of man. These manifestations of ego and 
their consequences are often explored through a stan- 
dard plot line: Success carries with it the seeds of 
failure. Success intoxicates heroes; it encourages them 
to form inflated opinions of themselves and their 
abilities and to trust in hope (elpis) rather than reason. 
It makes them susceptible to all kinds of adventures in 
which reason would dictate caution and restraint. The 
Greeks used the word até to describe the aporia this 
kind of seduction induces and associated it with hamar- 
tia (missing the mark). Hamartia leads the hero to 
catastrophe by provoking nemesis (wrath) of the gods. 

Herodotus frames his treatment of Croesus and 
Xerxes in terms of this progression (Beye 1987). Intox- 
icated by his riches, Croesus misinterprets the oracle 
who tells him that a great empire will be destroyed if he 
invades Persia. He is defeated and only saved from 
being burned at the stake by the mercy of his adversary. 
Xerxes is an ambitious but cautious leader who accu- 
mulates enormous power. His exaltation and pride 
nevertheless grow in proportion to his success, and até 
makes him vulnerable to hamartia. At first, he resists 
Mardonius’ suggestion to exploit the revolt of the 
Ionians to invade Greece and add Europe to his 
empire. Subsequent dreams change his mind and lead 
him to a fatal error of judgment. His sense of omnip- 
otence leads him to attempt to punish the Hellespont 
for washing away his bridge across it in a storm. 
Nemesis at Salamis is inevitable, and from the perspec- 
tive of Herodotus and Greek tragedy, the destruction 
of the Persian fleet and, later, army represents less a 
triumph of the Greeks than a failure of Xerxes. 

Thucydides begins where Herodotus leaves off and 
shifts the locus of the narrative from Persia to Greece. 
The Athenians, the principal agents of Xerxes’ neme- 
sis, repeat the cycle of success, overconfidence, miscal- 
culation, and catastrophe. Indeed, the Athenian victory 
over Xerxes at Salamis, which marks the emergence of 
Athens as a military power, sets the cycle in motion. 
Athens achieves a string of victories until ambition and 
overconfidence lead to military and political disasters: 
the complete annihilation in 454 of the expedition to 
Egypt, the revolt of Erythrae and Miletus in 452, and 
the defeat at Coronea in central Greece in 446 (Thucy- 
dides 1.104, 109-10). These setbacks temporarily com- 
pel Athenians to recognize the limits of their power. In 
449 they make peace with Persia, and in 446 they agree 
to the Thirty Years’ Peace with Sparta. Under Pericles, 
Athens devotes its energies to consolidating the sprawl- 
ing empire. But like Xerxes, Pericles is unable to 
exercise restraint in the longer term. Convinced of his 
ability to control events at home and abroad, he 
persuades an initially reluctant assembly to seize the 
opportunity of alliance with Corcyra in the erroneous 
expectation that the worst possible outcome will be a 
short war in which Sparta will discover the futility of 
opposing Athens. This initial hamartia leads to war, 
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plague, the death of Pericles, a prolonged war, and 
abandonment of Pericles’ defensive strategy. A second 
hamartia, the Sicilian expedition, urged on the assem- 
bly by Alcibiades, leads to nemesis. 

Cleon, intended to represent a figure intermediate 
between Pericles and Alcibiades, shows none of 
Pericles’ caution or thoughtfulness. He is as unscrupu- 
lous as Alcibiades—Thucydides calls him “the most 
violent man at Athens’—but not as clever in his 
pursuit of power (Thucydides 3.36). He launches a 
stinging verbal attack on Nicias, accusing him and his 
troops of cowardice in facing the Spartans in Pylos. 
Nicias offers to stand aside and let Cleon assume 
command of his forces. Cleon discounts this as mere 
thetorical posturing, but Nicias then resigns his com- 
mand. Cleon tries desperately to back down, but the 
assembly, remembering his earlier bravura, will not let 
him do so (4.249). Cleon is forced to sail for Pylos, 
where he and Demosthenes succeed, much to Cleon’s 
surprise and relief, in overwhelming the Spartans in 
short order (4.29--42). In the aftermath of his victory, 
Sparta sued for peace to secure the return of its 
hostages, and the Archidamian phase of the Pelopon- 
nesian War comes'to an end. 

Not content with the peace, Alcibiades convinces the 
assembly to renew the war and embark upon a policy of 
imperial expansion. Thucydides regards the decisions 
to ally with Corcyra and conquer Sicily as the most 
fateful decisions of the assembly; each is a hamartia, 
and together they lead to nemesis. In discussing these 
decisions, he suggests the real motives of the assembly 
and hints at the contradictions these entail as well as 
the unexpected and tragic consequences that will fol- 
low (1.44; 4.65). The decision to ally with Corcyra 
Tequires a second debate in which the assembly re- 
verses itself. This also happens in the punishment of 
Mytilene and the Sicilian expedition (1.44; 3.36; 6.8). 
But the most important similarity, which sets the 
Corcyra and Sicilian decisions apart from other events 
in the history, is that Thucydides provides “archeolo- 
gies” that establish the background for the momentous 
events that will follow (Thucydides 1.2-13, 6.2-6; see 
Connor 1984; Rawlings 1981). He not only heightens 
the connection through his use of this analytical paral- 
lel but also suggests that we read the Sicilian debate as 
a new beginning, a history within the history that 
describes decisions and events that deserve equal bill- 
ing with those that led to the war. 

Nicias does his best to dissuade the assembly, which 
is utterly ignorant of the size and population of Sicily, 
from sailing against an island so large, distant, and 
powerful. As does Artabanus in his plea to Xerxes, 
Nicias urges (6.9-14) the Athenians to keep what they 
have and not risk “what is actually yours for advantages 
which are dubious in themselves, and which you may or 
may not attain.” Alcibiades, cast in the role of Mardo- 
nius, makes light of the risks of the expedition and 
greatly exaggerates its possible rewards to the assem- 
bly. He does not attempt to rebut the arguments of 
Nicias but makes a calculated, emotional appeal to a 
receptive audience. Nicias comes forward a second 
time (6.20-3) and, recognizing that direct arguments 
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against the expedition will not carry the day, tries to 
dissuade the assembly by insisting on a much larger 
force and more extensive provisions than originally 
planned. To his surprise, the more he demands from 
the assembly, the more eager it becomes to support the 
expedition, convinced that a force of such magnitude 
will be invincible (6.24-6).: 

There are striking similarities in plot and language 
between Thucydides’ account of the Athenian assem- 
bly and Herodotus’ depiction of Xerxes at Abydus 
(Connor 1984; Rahe 1996). Thucydides describes the 
Sicilian expedition as more extravagant than any Greek 
campaign that proceeded it by virtue of its lamprotés 
(splendor) and tolma (audacity). These are words used 
by Herodotus and other Greeks to describe Xerxes’ 
court and military plans. Readers of Thucydides would 
have found his work old-fashioned. He could assume 
that they were familiar with the works of Aeschylus, 
Sophocles, Euripides, and Herodotus and that most 
would recognize his personification of Athens as a 
tragic hero and the mordant comparison he intended 
between Athens and Persia. This format and analogy 
would encourage readers to consider the story of 
Athens as the basis for generalizations about Greece 
and the human condition. 


NOMOS VERSUS PHUSIS 


Greek city-states were isolated from one another and 
the wider world by mountain ranges or large bodies of 
water. In the fifth century, economic growth, immigra- 
tion, and improvements in shipbuilding enabled the 
Greeks to expand their travel and trade and learn more 
about the customs of other peoples. In the process, 
they began to question their long-standing belief that 
their social practices were gods-given and moved to- 
ward a position of cultural relativism. In Athens there 
was an intense, century-long debate about the relative 
importance of human nature (phusis) and convention 
(nomos) (Finley [1942] 1967; Kerferd 1981). Pindar, 
who declared that custom is the master of us all, and 
Herodotus, who offered a detailed and nonjudgmental 
account of the diversity of human practices, anchored 
one pole of this debate. Sophocles resisted their agnos- 
ticism and relativism. Plato, in his Protagoras and the 
Republic, would offer the most sophisticated defense of 
the underlying importance of innate qualities. 
Realists and some classicists assert that for Thucy- 
dides phusis trumps nomos (Crane 1998; de Ste. Croix 
1972; Romilly 1990). They cite references in speeches 
to universal laws that govern human behavior and 
behavior that appears to lend substance to these 
claims. One example is the justification for empire the 
Athenian envoys offer to the Spartan assembly on the 
eve of the war. They are doing nothing more than 
acting in accord with “the common practice of man- 
kind” (hê anthropeia phusis) that “the weaker should be 
subject to the stronger” (Thucydides 1.76). The Athe- 
nians give the same justification to the Melians. If 
neorealists and their classical allies are right, then 
human drives for dominance (arché), ambition (phi- 
lotimia), and self-aggrandizement (pleonexda) will 


sooner or later undermine and defeat any effort to 
construct an international order based on norms, con- 
ventions, law, and underlying common interests. Is this 
inference warranted? 

Heraclitus maintains that nature (phusis) tends to 
conceal itself, and its seemingly contradictory manifes- 
tations have an underlying unity (harmonia) that can 
be discovered through reflection. Thucydides bases his 
inquiry on this assumption and searches for some 
means of getting beneath the established social order 
and day-to-day behavior to discover what truths lie 
underneath. Plato attempts something similar and for 
much the same reason. Thucydides models his inquiry 
on medical research (Cochrane 1929). Hippocrates and 
his followers chart the course of diseases in the human 
body, noting the symptoms that appear at the onset and 
how these build to a critical moment or crisis stage 
(kairos) that leads to death or recovery. Thucydides 
applies this method to the social diseases of revolution 
and war; he describes their manifestations and charts 
their course through the body politic to the point of 
social strife (stasis) and the disintegration of civil 
society. As physicians sought to learn something about 
the nature of the human body from studying the 
progression of illness, so Thucydides hoped to learn 
about the human mind. 

Thucydides (2.47-54) makes the link between phys- 
ical and social diseases explicit in his analysis of the 
Athenian plague of 430-28. He begins by noting the 
common view that the disease arrived in Athens via 
Africa but refuses to speculate about its causes. Fol- 
lowing Hippocratic tradition, “I shall simply set down 
its nature, and explain the symptoms by which perhaps 
it may be recognized by the student, if it should ever 
break out again” (2.48.3). He describes in clinical detail 
the onset of the disease, subsequent symptoms, varia- 
tion in the course of the illness, the suffering and 
fatality it causes, and the disfigurement of survivors. 

The plague left the city crowded with dead and 
dying. Bodies accumulated and decayed in houses, 
half-dead creatures roamed the streets in search of 
water, and sacred places were full of the corpses of 
those who came there seeking relief. As rich and poor 
died off in large numbers, the social fabric began to 
unravel. “Men, not knowing what was to become of 
them, became utterly careless of everything, whether 
sacred or profane” (2.52.3). Family responsibilities 
were ignored in violation of the most fundamental 
ethical principle of Greek society: the obligation to 
help one’s own philoi. People were increasingly afraid 
to visit one another, and many sufferers died from 
neglect. Sacred rituals were ignored, burial rites were 
dispensed with, and corpses were disposed of in any 
which way. Some residents resorted to “the most 
shameless modes of burial, throwing the bodies of their 
family or friends on the already burning pyres of 
others” (2.52.3). “Lawless extravagance” became in- 
creasingly common, and men “cooly ventured on what 
they had formerly done in a corner” (2.53.1). Those 
who suddenly inherited wealth “resolved to spend 
quickly and enjoy themselves, regarded their lives and 
riches as alike things of the day” (2.53.2). Fear of the 
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gods and human laws all but disappeared, as “each felt 
that a far severer sentence had been already passed 
upon them all and hung over their heads, and before 
this felt it was only reasonable to enjoy life a little” 
(2.53.4). 

The other stasis that Thucydides records in detail is 
political: the revolution, civil violence, and moral dis- 
integration of Corcyra in the 420s (3.70—81). As in the 
account of the plague, he begins with a detailed, 
precise, almost day-by-day description of what tran- 
spired. This sets the stage for a more impressionistic 
account, followed by generalizations based on that 
account, and he ends with a depiction of the gravest 
atrocities. Violent conflict between democratic and 
oligarchic factions, intervention by the foreign allies of 
both, and internal revolution culminate in seven days 
of “butchery” in which Corcyreans, consumed by ha- 
treds arising from private and political causes, kill as 
many of their enemies as they can lay their hands upon. 
As in Athens, every convention is violated: “Sons were 
killed by their fathers, and suppliants dragged from the 
altar or slain upon it, while some were even walled up 
in the temple of Dionysus and died there” (3.81.4—5). 

Just as the plague ushers in an era of lawlessness and 
boldness (tolma) that significantly affects domestic 
politics and foreign policy, so the Corcyrean revolution, 
for much the same reason, is the precursor of similar 
developments in other cities. After Corcyra, Thucy- 
dides (3.82) tells us, “the whole Hellenic world” is 
convulsed as democratic factions seek to assume or 
maintain power with the help of Athens, and oligarchs 
do the same with the support of Sparta. “The sufferings 
which revolution entailed upon the cities were many 
and terrible, such as have occurred and always will 
occur, as long as the nature of mankind remains the 
same, though in a severer or milder form, and varying 
in their symptoms, according to the variety of the 
particular cases” (3.82.1). 

These extreme situations bring out the worst in 
human beings, and the passage just quoted can be read 
as support for the universality and immutability of 
human nature. But Thucydides (3.82.2) modifies his 
generalization in the next sentence: “In peace and 
prosperity states and individuals have better sentiments 
because they do not find themselves suddenly con- 
fronted with imperious necessities; but war takes away 
the easy supply of daily wants, and so proves a rough 
master that brings most men’s characters to a Jevel with 
their fortunes.” The arrow of causation is reversed; 
stasis does not so much reveal the hidden character of 
people as it shapes that character. People who have 
little to live for behave differently from people who 
have much to lose. The qualifier “most” is important 
because it indicates that not everyone responds the 
same way to social stimuli, not even in the most 
extreme situations. In his description of the plague, 
Thucydides (2.51) uses parallel constructions to de- 
scribe how some people, fearful of succumbing to the 
disease, isolated themselves at great costs to friends 
and family; others placed honor above survival, and 
“honor made them unsparing of themselves.” Some 
survivors participated in the greatest excesses, whereas 
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others were unstinting in administering to the ill and 
dying. The same bifurcated response can be observed 
at the other end of the in secure and 
prosperous societies: The majority of people adhere to 
social and religious conventions, and a minority is 
unconstrained and destructive in behavior. 

Thucydides has a less deterministic understanding of 
human nature. By removing the constraints and obli- 
gations arising from convention, stasis permits the 
fullest expression of the worst human impulses, but in 
some people it brings out the best. The plague and 
Corcyrean revolution, and the wide range of other 
“tests” to which human beings are subjected in the 
course of the Peloponnesian War, indicate that human 
nature encompasses a range of needs, desires, and 
impulses, some of them contradictory (Kokaz 2001). 
People appear driven by their needs for self-preserva- 
tion, pleasure, recognition, and power but also by 
needs for love, honor, and esteem. The Melian Dia- 
logue offers a nice counterpoint to the Corcyrean 
revolution in this respect. Opposition to Athens is 
futile, but the Melians choose to resist because they 
value freedom more than self-preservation. 

The Hippocratic physicians taught that phusis varied 
according to the environment. Some believed that 
traits acquired through social practice (nomos) could, 
over time, modify nature (phusis). Thucydides believes 
that behavior is the result of a complex interaction 
between the two. If human nature could not be har- 
nessed for constructive ends, civilization would never 
have developed. This conclusion refocuses our atten- 
tion on the meta-theme of Thucydides’ narrative: the 
rise and fall of Greek civil society and the circum- 
stances in which positive and negative farets of human 
nature come to the fore. 


LOGOI AND ERGA 


Ober (1998) maintains that Thucydides privileges erga 
over logoi. From Thucydides’ perspective, both deeds 
and words are social constructions, but he gives pride 
of place to logoi. Social facts and social conventions 
create the intersubjective understandings on which all 
action depends. Social facts often misrepresent brute 
facts, but Thucydides considers this discrepancy a 
double-edged sword. It can prove destructive, as it did 
in the Sicilian debate, for the reasons Ober describes. 
But it is potentially beneficial, if not essential, to the 
maintenance of community. Democratic ideology in 
Athens exaggerated the equality among classes and 
downplayed political, economic, and social inequali- 
ties. It reconciled the démos to the existing social order 
and muted the class tensions that led to violent conflict 
and civil wars in many other polities. The Athenian 
democratic ideology rested on myths: on social facts at 
variance with reality and on a history that bore only a 
passing relationship to so-called empirical facts, as the 
Archeology in Book One convincingly demonstrates. 
It is no coincidence that observations about words 
(Thucydides 3.82) follow directly on a discussion of 
how the Corcyrean revolution affected the rest of 
Hellas. “Revolution ran its course from city to city, and 
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the places which it arrived at last, from having heard 
what had been done before, carried to a still greater 
excess the refinement of their inventions, as manifested 
in the cunning of their enterprises and the atrocity of 
their reprisals” (3.82.3). Language is the vector by 
which the disease of revolution spreads, but it is also a 
contributing cause of constant movement (Ainésis) and 
destruction (Saxonhouse 1996). Not just in Corcyra but 
throughout much of Greece, “words had to change 
their ordinary meanings and to take those which were 
now given them.” Thucydides (3.82) gives a string of 
examples, and all indicate the extent to which meanings 
and the values they expressed were subverted: 


Reckless audacity came to be considered the courage of a 
loyal supporter; prudent hesitation, specious cowardice; 
moderation was held to be a cloak for unmanliness; ability 
to see all sides of a question, incapacity to act on any. 
Frantic violence became the attribute of manliness; cau- 
tious plotting a justifiable means of self-defense. The 
advocate of extreme measures was always trustworthy; his 
opponent a man to be suspected. To succeed in a plot was 
to have a shrewd head, and to divine a plot still shrewder, 
but to try to provide against having to do either was to 
break up your party and to be afraid of your adversaries. 


Words are the ultimate convention, and they also 
succumbed to stasis. Altered meanings not only 
changed the way people thought about one another, 
their society, and their obligations to it but also encour- 
aged barbarism and violence by undermining long- 
standing conventions and the constraints they en- 
forced. Thucydides (3.82.8) attributes this process to 
“the lust for power arising from greed and ambition; 
and from these passions proceeded the violence of 
parties once engaged in contention.” Politicians used 
“fair phrases to arrive at guilty ends” and degraded and 
abased the language. 

Thucydides follows the introductory remarks in 
Book One with the so-called Archeology (1.2-21), in 
which he describes the rise of Hellenic culture. In 
contrast to other fifth-century accounts of the rise of 
civilization, less emphasis is placed on agriculture and 
the development of material technology and more 
stress is given to the power of tyrants to cobble together 
small settlements into increasingly larger kingdoms and 
alliances. He portrays archaic Greece as being in 
constant movement as a result of frequent migrations 
due to population growth, depletion of local agricul- 
tural resources, and the depredations of pirates and 
invaders. Civilization, defined as a state of peace and 
rest (hésuchia), only became possible when communi- 
ties combined to undertake common action, including 
the suppression of piracy. Common action required 
common understanding; language was the vehicle of 
this understanding and the very foundation of political 
stability and civilization. Civilization is also due to a 
reinforcing cycle of logoi and erga. The Archeology 
sets the stage for the history of decline that follows. 

Greeks distinguished men from animals by their 
ability to speak and their preference for cooked meat. 
The word omos (raw) is used three times by Thucy- 
dides (3.94, 3.36, 3.82.1): to describe an Aetolian tribe 
so uncivilized that “they speak a language that is 


exceedingly difficult to understand, and eat their flesh 
raw”; in the Mytilenian debate, to characterize what 
many Athenians think about the previous day’s deci- 
sion to execute all the Mytilenians; and to describe the 
stasis that convulsed the Greek world beginning with 
the revolution in Corcyra. Rahe (1996) suggests that 
the word is used on the last two occasions to indicate 
that the war, plague, and revolutions reversed the 
process described in the Archeology. The measure of 
rest (hésuchia) and peace civilization brought about 
was disrupted by the movement (kinesis) of war, which 
undermined conventions (nomoi), including those of 
language, and encouraged the kind of brazen daring 
(tolma) that provoked “raw” and savage deeds. The 
Greeks became increasingly irrational and inarticulate 
(alogistos) and, like animals, no longer capable of 
employing the logos (rational facilities and language) 
necessary for communal deliberation. 

Is the rise and fall of civilization inevitable? Greek 
myth and saga portray a largely unalterable world, but 
one that is only tenuously connected to the time in 
which the audience dwells. The great playwrights car- 
ried on this tradition, and the tragic sense of life 
depends on the inevitability of nemesis and the immu- 
tability of things (Beye 1987). Like the plots of so much 
myth and epic, tragedy also relies on the intervention 
of the gods and the power of situations to generate 
pressures and psychological states that move the action 
along and leave limited choice to the individual. In 
Agamemnon (176-83), Aeschylus explains that “Zeus 
shows man the way to think, setting understanding 
securely in the midst of suffering. In the heart there 
drips instead of sleep a labor of sorrowing memory, 
and there comes to us all unwilling prudent measured 
thought; the grace of gods who sit on holy thrones 
somehow comes with force and violence.” Orestes 
confronts a dilemma not of his own making and from 
which there is no exit. The chorus, whose lines I quote, 
reminds us that the most he can do is preserve his 
dignity and learn from his suffering. Herodotus im- 
ported this tradition into prose. His Xerxes has no 
control over his fate; the power of Persia and the 
insolence of the Greeks compel him to attempt their 
conquest. When he has second thoughts, the gods 
intervene through Mardonius to push him to invade 
Greece, just as the Argives are compelled to make war 
against Troy by Athena, who speeds down from Olym- 
pus to convince Odysseus to prevent their departure 
(Homer 2.135-210). 

For Herodotus, the stories of Croesus and Xerxes 
are concrete manifestations of a timeless cycle of 
hubris-até-hamartia-nemesis that can be expected to 
repeat itself so long as humans walk the earth. The 
same attitude of resignation and acceptance has been 
attributed to Thucydides. Some of his actors do artic- 
ulate this perspective. The Athenian envoys at Sparta 
portray themselves as prisoners of history and seem to 
understand that they are playing roles in a grand, 
historical drama, although not yet framed as a tragedy 
(1.75). Pericles warns his countrymen that one day 
they, too, will be forced to yield “in obedience with the 
general law of decay” (2.64). 
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Thucydides is not as pessimistic as many realist 
readings suggest. Why would he invest decades in the 
research and writing of the history and offer it as a 
“possession for all time” if he thought human beings 
and their societies were the prisoners of circumstance 
and fate? He must have believed that people possess at 
least some ability to control their destiny. The appro- 
priate analogy is to psychotherapy. Freudian therapy 
assumes that people will repeatedly enact counterpro- 
ductive scripts until they confront and come to terms 
with the experiences that motivate this behavior. This 
can only be achieved through regression; people must 
allow themselves to relive painful experiences they 
have repressed and come to understand how these 
shape their present behavior. Sophists relied on a 
somewhat similar process. Their works were offered as 
courses of study that engage the emotions and mind. 
By experiencing the elation, disappointment, anguish, 
and other emotions a story provoked, and by applying 
reason to work through its broader meaning and 
implications, readers could gain enlightment. Hippo- 
cratic physicians put great store in the curative power 
of words. Euripides’ Phaedra and Andromache de- 
scribe words as: sources of power and psychological 
compensation. The plays of Aeschylus are based on the 
maxim of pathei mathos, of learning and transcending 
one’s situation through the pain associated with under- 
standing that situation. There is ample Greek prece- 
dent for Thucydides’ project. 

Like analysts, neither sophists nor tragic playwrights 
tell. people what lessons to learn; all believe that 
genuine understanding (saphés skopein) can only be 
internalized and influence behavior if it arises from a 
process of cathartic self-discovery. Thucydides’ history 
encourages Athenians and other Greeks to relive trau- 
matic political experiences in the most vivid way and to 
work through their meaning and implications for their 
lives and societies. I believe he harbored the hope that 
such a course of “therapy” could help free people of 
the burdens of the past and produce the kind of 
wisdom that enables societies to transcend their scripts. 

Transcending old scripts requires an alternative vo- 
cabulary. Crane (1998) argues that Thucydides wanted 
to reconstruct the aristocratic ideology, the “ancient 
simplicity” to which he was born and raised. He was 
undeniably attracted to the “ancient simplicity.” Evi- 
dence for this lies in the location of bis discussion of it 
in the text (3.83), which follows his description of stasis 
at Corcyra. The intended inference is that religion, 
honor, and aristocratic values promote a tranquil and 
secure social and political order, and their decline 
removes restraints to unprincipled self-aggrandize- 
ment. 

The passage is unabashedly nostalgic but also bru- 
tally realistic. The ancient simplicity had not merely 
declined; it had been “laughed down and disappeared” 
(Thucydides 3.83). Here and elsewhere Thucydides 
recognizes the gulf between the old and the new, and 
he knows the life-style associated with the ancient 
simplicity has passed and cannot readily be restored. 
Greece, and especially Athens, has been transformed 
by what can only be called a process of modernization. 
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Population growth, coinage, trade, the division of 
labor, major military undertakings, and empire have 
given rise to new classes, new ideas and values, and new 
social and political practices to cope with a more 
complicated and competitive world. The Athenian 
empire has become so powerful that it no longer needs 
to rely on the standard pattern of client-patron rela- 
tions, based on obligation and the mutual exchange of 
favors and services. Success has made the traditional 
system of political relationships and the values on 
which it rested look old-fashioned and unnecessary, 
even a hindrance. The fate of Sparta also testifies to 
this change. Its influence in Greece derived largely 
from the symbolic capital it had accumulated in the 
form of reliability in the eyes of others, especially allies. 
Spartans had gone to war to preserve this capital and in 
the vain hope that defeat of Athens would stave off the 
changes that threatened their traditional way of life. 
Sparta emerged as the victor in the war, but it was no 
longer the same polis. Spartans had to become more 
like their adversary to defeat it, which is perhaps the 
most compelling evidence that the old ways were 
doomed. 

Thucydides recognizes the impracticality of trying to 
turn the clock back; the aristocratic order and its values 
had become anachronistic, and the effort to reimpose 
oligarchic rule at the end of the Peloponnesian War 
failed miserably. He has a subtler project in mind: 
Adapt older values and language to present circum- 
stances to create a more workable synthesis that can 
accommodate progress but mitigate its excesses. Ober 
(1998) contends that Thucydides looked to Periclean 
Athens for his model. It functioned well because of the 
balance of power between the masses (ho démos) and 
the smaller elite of rich, influential, and powerful men 
(Aoi dunatoi). The need of each group to take the other 
into account and: the presence of leaders such as 
Pericles, who mediated and muted these class-based 
tensions, led to policies that often reflected the interest 
of the community (hoi Athénaioi), not merely the 
democratic or aristocratic faction. 

In Book One, Thucydides portrays Pericles as some- 
one who personifies the ancient simplicity but has 
mastered the new arts of oratory and statécraft. His 
success in governing Athens under the most trying 
circumstances may have convinced Thucydides that 
such an amalgam was desirable and possible. But his 
praise of Pericles is another one of his judgments that 
is in part subverted later in the text. In Book Four, 
Thucydides offers Hermocrates of Syracuse as another 
role model (Connor 1984; Monoson and Loriaux 
1998). He is intended to be a counterpoint to Pericles 
and a more accurate guide to how foreign policy 
restraint can be sold to the public and a more peaceful 
international order maintained. 

In his appeal to Sicilians for unity against Athens, 
Hermocrates inverts key realist tenets of foreign policy 
that are associated with Pericles (Thucydides 4.59-64). 
Connor (1984) observes that the “law of the stronger” 
becomes an injunction for the weaker to unite, and 
Hermocrates (4.62) goes on to exploit the widespread 
fear of Athens to justify forethought and restraint but 
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urges common defensive action. On the eve of war 
Pericles sought to inspire confidence in his fellow 
citizens, but Hermocrates wants to intensify their fears. 
Athens and its enemies attributed Athenian success to 
ingenuity, speed of execution, and confidence in the 
ability to face challenges (Thucydides 1.68—71, 2.35- 
46). Hermocrates finds strength in the restraint and 
caution that come from recognition of the limits of 
knowledge and power and contemplation of the future 
(prométhia) with an eye toward its unpredictability. 
Pericles urged his countrymen to spurn Sparta’s peace 
overtures, but Hermocrates favors accommodation and 
settlement. Successors of Pericles, especially Cleon and 
Alcibiades, encouraged the Athenians to contemplate 
the rewards from imperial expansion. Hermocrates 
implicitly urges his audience to consider the advantages 
they already possess and the loss that war may entail. 
Hermocrates—and Thucydides—had an intuitive 
grasp of prospect theory (Levy 1992, 1996; Tversky and 
Kahneman 1992), which is based on the robust psycho- 
logical finding that people are generally more con- 
cerned with preventing loss than they are with making 


Sophists pioneered the rhetorical strategy of “anti- 
logic.” Zeno silenced his opponents by showing how 
their arguments also implied their negations and were 
thus contradictory (Kerferd 1981). Thucydides makes 
extensive use of antilogic. He examines every so-called 
law of politics, appears to validate it, but ultimately 
subverts it by showing the unintended and contradic- 
tory consequences that flow from its rigorous applica- 
tion. This is most obvious with the principles espoused 
by demagogues like Cleon, but it is also true of more 
honorable politicians like Pericles. Thucydides did not 
spoon feed conclusions; he wanted readers to draw 
them by reflecting on his narrative, speeches, and 
dialogues. Hermocrates’ speech is the most overt at- 
tempt to point readers in the right direction. Through 
emotions and intellect—feeling the pain of the rise and 
fall of Athens and grasping the reasons this occurred— 
readers could experience the history as a course of 
“logotherapy.” Its larger purpose was to make them 
wary not only of the sweet and beguiling words of 
demagogues but also, as Monoson and Loriaux (1998) 
suggest, of any politician who advocates policies at 
odds with conventions that maintain domestic and 
international order. This caution is the first and essen- 
tial step toward the restructure of language and the 
reconstitution of conventions that can permit economic 
and intellectual progress while maintaining political 
order. 


THUCYDIDES THE CONSTRUCTIVIST 


Fifth-century Greece experienced the first Methodenst- 
reit. “Positivists” insisted on the unity of the physical 
and social worlds as well as the existence of an ordered 
reality that can be discovered through the process of 
inquiry. They were opposed by “constructivists,” who 
regarded the social world as distinct and human rela- 
tions as an expression of culturally determined and 
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ever evolving conventions.* Early Greek thinkers ac- 
cepted the divine nature of the world and considered 
human customs part of an overall, unified scheme of 
nature. The goal of the Ionian protophysicists was to 
discover the original principle, the arché, that deter- 
mined all the other regularities, social and physical, of 
the universe. Reality was out there, waiting to be 
described in terms of impersonal forces and the agency 
that also expressed those forces. In the fifth century, 
sophists directed their inquiry away from nature to 
human beings. According to Jaeger (1939-45, 1.306), 
“the concept of phusis was transferred from the whole 
universe to a single part of it—to mankind; and there it 
took on a special meaning. Man is subject to certain 
rules prescribed by his own nature.” 

This shift coincided with exposure to alien cultures 
and the discovery of practices that differed remarkably 
from those of the Greeks. People in these cultures also 
found different meanings in the same events. Philo- 
sophical inquiry and experience of cultural diversity 
combined to encourage a subjectivist epistemology in 
which nomos was contrasted with phusis and consid- 
ered by many a more important determinant of human 
behavior. The deeds themselves (auta ta erga) and 
concept of the “real world” became problematic, as did 
the assumption that either could be understood 
through observation. Democritus (1956, fragments 9 
and 11), proclaimed that things were “sweet by con- 
vention, bitter by convention, hot by convention, cold 
by convention,” and he went on to reason that all 
observation was illegitimate. Such skepticism encour- 
aged the belief that truth was relative (Lloyd 1978). 

Given sophistic epistemology, it is not surprising that 
it spawned a cognate to postmodernism. Protagoras, 
who is the best known representative, regarded all 
claims to knowledge as nothing more than rhetorical 
strategies for self-aggrandizement. Justice was a con- 
cept invoked by the powerful to justify their authority 
and advance their parochial interests. Philosophical 
nihilism reached its fullest expression in Critias, who 
defined justice in terms of power and found justifica- 
tion for this in human practice—the very argument the 
Athenian envoys made at Melos. Critias is good grist 
for the mill of any contemporary critic of postmodern- 
ism. A politician and one of the thirty tyrants who 
briefly ruled Athens after its defeat in 404, he was 
infamous for his corruption and brutality (Guthrie 
1969; Strauss 1986). Plato represented a reaction to the 
sophists; he was horrified by their reduction of law to 
custom and by the equation of justice with tyranny. He 
parodied sophists in his dialogues (see especially Pro- 
tagoras) and argued against their efforts to explain 


4I do not want to exaggerate the parallels between ancient and 
modern philosophies of social inquiry, there were important differ- 
ences in ideas and the relative timing of social and sclentrfic 
advances. In the modern era, advances in mathematics have contrib- 
uted to modern science and, ultimately, the social sciences. In 
Greece, the age of mathematical discovery came after these philo- 
sophical debates were under way. Athenian interest in mathematics 
began a generation after Thucydides; Euclid wrote his Elements at 
the end of the fourth century, and Archimedes made his contribu- 
tions almost a century later. 
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physical and social reality purely in terms of its phe- 
nomenal aspects. He sought to restore objectivity and 
the status of universal laws by discovering an underly- 
ing, ultimate reality that would provide a foundation 
for a universal nation of justice and social order 
(Guthrie 1969; Kerferd 1981). 

Like contemporary constructivists, Thucydides was 
fascinated by convention (nomos) and the role it 
played in regulating human behavior. The history 
makes clear that he regarded conventions not only as 
constraints but also as frames of reference that people 
use to understand the world and define their interests. 
It may be going too far to claim that Thucydides 
initiated the “linguistic turn” in ancient philosophy, but 
he certainly shared the constructivist emphasis on the 
importance of language, which he thought enabled the 
shared meanings and conventions that make civiliza- 
tion possible. His history explores the relationship 
between words and deeds and documents the double- 
feedback loop between them. Shared meanings of 
words are the basis for conventions and civic coopera- 
tion. When words lose their meaning, or their meaning 
is subverted, the conventions that depend on them lose 
their force, communication becomes difficult, and civ- 
ilization declines. Thucydides exploited the growth and 
evolution of the Greek language for purposes of ex- 
pression and precision, and he probably coined more 
neologisms that any other fifth-century author. One 
goal of the history is the considered restoration of 
traditional meanings of words to help resurrect the 
conventions they sustained. In this sense, he antici- 
pates Plato. 

The core of constructivism is hard to define because 
there is so much variation among authors. In a thought- 
ful analysis of this literature, Hopf (n.d.) suggests that 
constructivism has two components. The first is appre- 
ciation of social structure, whether understood socio- 
logically, as in the thin imstitutionalist accounts of 
Finnemore and Sikkink (1998) and others, or linguis- 
tically, as attempted by Kratochwil and Ruggie (1986), 
Kratochwil (1989), Onuf (1989), and Ruggie (1998). 
The second component is the acceptance of the mutual 
constitution of agents and structures (Kratochwil 1989; 


5 Well before Thucydides, Greek philosophy debated the importance 
and meanmg of language. There was some recognition that it 
mediated human understandmg of reality and thus constituted a 
barrier to any perfect grasp of that reality. An attempted solution was 
to assert that names are not arbitrary labels but imitations of their 
objects. Others (e.g, Hermogenes) insisted that words are arbitrary 
in ongin and do not represent any reality. Socrates tried to split the 
difference by arguing that things have a fired nature that words 
attempt to reproduce, but the imitation u imperfect, and this is why 
languages vary so much. Moreover, all attempts at imitation become 
corrupted over time. 

Considerable effort went into recapturing the meaning of words 
and names m the late fifth century, and Thucydides must be situated 
in that tradition. I see no evidence that he believed in the original 
meanmg of words, but certainly he wanted to restore earlier mean- 
mgs, supportive of homonoia, that had been subverted. Plato, in 
Phaedrus, 2600, makes a milar argument when he discusses a skilled 
thetorician who convinces someone to use the name “horse” io 
describe a donkey and thus transfers the qualities of one to the other. 
He ts clearly tilting at rhstoricians and politicians who advocate evil 
as good. 
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Kratochwil and Ruggie 1986; Onuf 1989). Constructiv- 
ism, in its thicker linguistic version, is interested in the 
logic of intelligibility, that is, what makes some actions 
more imaginable and thus more probable than others. 
The thin version gives more weight to the role norms 
play in advancing interests than to the creation of 
norms by identities. 

Thucydides is undeniably a constructivist and may 
have been the original practitioner of the thicker 
linguistic version. His history examines how language 
shapes the identities and conventions in terms of which 
interests are defined. He drives this point home in the 
most graphic way by showing that it is impossible to 
formulate interests at all when conventions break down 
and the meaning of language becomes subverted. Tra- 
ditional Greek social intercourse, domestic and “inter- 
national,” was embedded in a web of interlocking 
relationships and obligations and governed by an elab- 
orate set of conventions. Dealings with foreigners were 
an extension of domestic relations. There was no 
specific word for international relations—the closest is 
xenia, which generally is translated as “guest friend- 
ship.” 


War was not infrequent but was limited in means 
and ends. With rare exceptions, the independence and 
social system of other city-states were respected; wars 
were waged to establish precedence and settle border 
disputes. Combat was highly stylized and was designed 
to minimize casualties and allow individuals to gain 
honor through the display of heroism. Truces were 
obligatory to permit both sides to gather their dead and 
the victor to erect a trophy (Adcock 1957). With the 
introduction of the hoplite phalanx and later develop- 
ments against massive Persian armies, the character of 
war changed somewhat, but most conventions were still 
observed. They did not break down until late in the 
Peloponnesian War, when even the quasi-sacred truces 
that enabled proper disposal and honoring of the dead 
often were no longer observed. 

To the extent that realist readings of Thucydides 
address the breakdown of conventions, these changes 
are attributed to the effects of war, which is “a rough 
master” (3.82.2). This explanation is not convincing, 
because the Persian wars were equally harsh, yet most 
conventions held. Modern analogies spring to mind. 
The American Civil War was brutal by any standard, 
but both sides generally observed the conventions of 
war. Confederate mistreatment of African American 
prisoners of war was the principal exception, but even 
this reflected a convention. Troops on both sides 
behaved in ways that baffle us today. At Bloody Angle 
at Gettysburg, New Yorkers refused to follow orders to 
fire on the remnant of retreating Alabamians and 
instead threw their caps into the air and cheered them 
for their bravery. In World War I, German and Allied 
armies behaved on the whole quite honorably toward 
each other and civilians, in sharp contrast to World 
War II, especially on the Eastern front, which approx- 
imated Thucydides’ depiction of barbarism. The differ- 
ences were not due to the harshness or duration of war 
but to the character of the political systems. When 
language was subverted and conventions ignored or 
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destroyed, as in Nazi Germany, the rational construc- 
tion of interest was impossible, war aims were limitless, 
and the rules of warfare were disregarded. 

Thucydides takes the constructivist argument an- 
other step and implies that civil society is also what 
actors make of it. Following Hobbes, most realists 
maintain that the distinguishing feature of domestic 
society is the presence of a Leviathan that overcomes 
anarchy and allows order to be maintained. For ‘Thucy- 
dides, the character of domestic politics runs the gamut 
from highly ordered, consensual, and peaceful societies 
to those wracked by anarchy and bloodshed. It is not a 
Leviathan that is critical but the degree to which 
citizens construct their identities as members of a 
community (homonoia, literally, being of one mind) or 
as atomistic individuals. When the former view pre- 
vails, as it did in Periclean Athens and in Greece more 
generally before the Peloponnesian War, conventions 
restrain the behavior of actors, whether individuals or 
city-states. When the latter dominates, as in Corcyra 
and almost in Athens after 412, civil society disinte- 
grates, and even a Leviathan cannot keep the peace. 
The domestic environment in these situations comes to 
resemble the war-torn international environment, and 
for the same reasons. 


CONCLUSION 


The history drives home the truth that a strong sense of 
community is equally essential to domestic and inter- 
national order. Some rational choice formulations— 
again following Hobbes—acknowledge this reality and 
Tecognize that it is necessary to preserve the rules of 
the game if actors collectively are to maximize their 
interests. They highlight the paradox that a focus on 
short-term interests—by individuals, factions, or 
states—can undermine the order or environment on 
which the rational pursuit of interest depends. Thucy- 
dides would regard the tragedy of the commons as an 
unavoidable outcome in a culture in which the individ- 
ual increasingly is the unit to whom advertisers and 
politicians appeal and in terms of whom social scien- 
tists conduct research. He would not find it surprising 
that a significant percentage of the citizens of such a 
society cannot see any reason for or imagine any 
benefit that might accrue from paying taxes. 

The importance of community, and of identities 
defined at least partly in terms of it, was not lost on 
traditional realists. Morgenthau ([1951] 1982, 61) cited 
Edward Gibbon’s observation that the balance of 
power functioned well in the eighteenth century be- 
cause Europe was “one great republic” with common 
standards of “politeness and cultivation” and a com- 
mon “system of arts, and laws, and manners.” As a 
consequence, the “mutual influence of fear and shame 
imposed moderation on the actions of states and their 
leaders” and instilled in all of them “some common 
sense of honor and justice” (p. 60). However much they 
desired to increase their power at the expense of their 
neighbors, they limited their ambitions because they 
recognized the right of others to exist and the funda- 
mental legitimacy of the international political order. 
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Morgenthau regards the breakdown of this sense of 
community as the underlying cause of both world wars 
and the threat to humanity posed by the Cold War. The 
same objection can be raised about liberal, institution- 
alist approaches that stress the role of institutions in 
creating and maintaining order. Those institutions may 
flourish and function as they do—when they do— 
because of an underlying sense of community. 
Thucydides’ history suggests that interest and justice 
are inextricably connected and mutually constitutive. 
On the surface they appear to be in conflict, and almost 
every debate in his history in one form or another pits 
considerations of interest against those of justice. But 
Thucydides, like Democritus, is interested in the un- 
derlying and often hidden nature of things. At that 
level, the history shows that interests cannot be intel- 
ligently considered, formulated, or pursued outside a 
homonoia and the identities it constructs and sustains. 
The creation and maintenance of homonoia depends 
on enduring individual commitments to justice and 
respect for other human beings (or political units). In 
the most fundamental sense, justice enables interests. 
Materialist interpretations of Thucydides, which 
overwhelmingly are realist, offer a superficial and one- 
sided portrayal. Constructivist readings must avoid this 
error. Thucydides is both a realist and a constructivist. 
Stasis and homonioa represent two faces of human 
beings; both are inherent in their phusis. Materialism 
and constructivism are equally germane to the study of 
international relations. They need to build on Thucy- 
dides’ research program, that is, discover the condi- 
tions that underlie stasis and homonoia and what 
caused transitions between them. For this reason 
alone, the history is “a possession for all time.” 
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achiavelli scholarship is prolific but claustrophobic. Even though chapter 18 of The Prince advises 
the aspiring leader to emulate both lion and fox, commentators ignore or devalue the lion and 
focus on the fox. Machiavelli is thereby depicted as a champion of cleverness and deception, and 
not much else. This article takes up the lion. It argues that Machiavelli’ lion is not a simple and violent 
beast, but is rather a complex tutor that complements clinical and lonely foxiness with crucial injections of 


virility and community. 


ust why did Plato write The Republic? Was the 
Academy failing to recruit the most promising 
youngsters? Or was it just an easy way to extract 
. eighty talents from Dionysius the Younger? Per- 
haps Plato was. merely indulging a nascent passion for 
spelunking? Of course, we do not ask these questions 
of our icon of metaphysics. Niccolò Machiavelli, in 
contrast, suffers incessantly the indignity of similar 
suspicions. Variously, he is the sycophant who solicits 
his republican soul for aristocratic patronage (Godman 
1998). He is the nerd whose revenge is delivered 
vicariously in the form of macho prose (Pitkin 1984). 
Or he is a feeble lecher trying to impress a fetching 
young actress (Ridolfi 1954, 312). It is hardly surprising 
that we impute Machiavélli’s motives. A scholar by 
default, he could not help but educate his cloistered 
forebears as to the advantages of hidden agendas and 
false promises. Understandably stupefied by his venal- 
ity, commentators regroup, whispering exotic specula- 
tions regarding his true intentions. 

Friedrich Meinecke (1965, 25-48) leads a contingent 
that detects a moral residue in Machiavelli’s unprece- 
dented dissimulation (see also Chabod 1980; Dotti 
1979; Foscolo 1972; Russo 1983; Zanini 1984). Trou- 
bled by the incompatibility of the normative and the 
feasible, Machiavelli is said to dilute a nonetheless 
sincere attachment to moral behavior, or “universal 
norm” (Derla 1980, 35), for emergency reasons. Gram- 
sci (1966, 10), for instance, refutes the alleged “scien- 
tific disinterestedness” in Machiavelli, claiming that his 
true intention is to leak proprietary strategies of the 
privileged to “the revolutionary class of the time, the 
‘people’ and the ‘Italian nation,’ the democratic citizen 
who feels in his breast the Savonarolas and the Pier 
Soderinis and not the Castruccios and the Valentinos”! 
(1966, 10). More recently, Dietz (1986) argues that The 
Prince is diabolical to end the diabolical, a sophisti- 
cated trap into which an ing Medici boob 
might fall, thus opening a republican window of oppor- 
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tunity. Wolin (1960, 207) detects “an essentially moral 
response inspired by a concern for man in an age of 
political corruption” (see also Derla 1980, 27-8; Sasso 
1980, 293-327; Struever 1992, 164-81). Berlin (1982, 
55-6) ventures that Machiavelli’s endorsement of 
power politics is no less than an alternative morality. 

The opposing position promotes a more literal, 
morally uncluttered reception of Machiavelli’s calcula- 
tions and is often associated with Benedetto Croce 
([1914] 1993; see also Cassirer 1944, 167; de Sanctis 
1956, 511; Gentile 1968, 118; Hulliung 1983; Kahn 
1986; Mansfield 1985). Croce argues that Machiavelli 
has little interest in a concept of goodness transcendent 
of the will of the strongest. His politics precedes 
morality as base precedes superstructure, and morality 
is no more than a luxury afforded the politically 
comfortable. But where Croce detects anguish in Ma- 
chiavelli regarding the sacrifice, Cassirer’s Machiavelli 
is more venal. And Strauss (1958, 11), of course, goes 
so far as to say that Machiavelli is a “teacher of evil.” 

That scholarship is limited to variations on Machia- 
velli’s moral status is tribute to the strange charisma of 
his meditations on expediency. Whether one embraces 
the more sensitive Machiavelli of Meinecke or the 
more clinical Machiavelli of Croce, a claustrophobic 
element persists in an investigative focus that presup- 
poses compromise as Machiavelli’s overweening inter- 
est. We obsess over the extent to which such exquisite 
human qualities as integrity and charity succumb to 
trickery and deceit. The debate is over whether the 
moral justification is only highly leveraged or just 
nonexistent. Sasso (1964, 47-51) argues that the only 
significant exchange among modern scholars of Ma- 
chiavelli regards the extent to which he laments his 
unprecedented duplicity. 

Due to this focus on concession and cleverness in the 
face of difficult surroundings, Machiavelli is a magnet 
for critics of modernity, who indict his claims of success 
by exposing what are thought to be his shallow aspira- 
tions. By focusing on expediency, Machiavelli is cred- 
ited with turning politics toward its eventual depiction 
as “who gets what when how” (Laswell 1936). The 
“why,” which for Machiavelli’s Greek and Latin foils 
provides politics its essence, is left to the “historian, the 
lawyer, or the philosopher” (Laswell 1936, 23). Accord- 
ing to his critics, Machiavelli escorts politics down to 
the Piraeus never to return to the rarefied atmosphere 
of the Acropolis. His contempt for the good and the 
frail in favor of the commoner currency of selfishness 
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and force is seen as a minor revision from the banality 
of Hobbes and Locke. Chance can be eliminated from 
human life by pursuing an existence of not taking 
chances.? 

This is the ironic sense in which Machiavelli is 
labeled the founder of the Enlightenment, the first 
modern. The indictment is bipartisan. From the Right, 
Strauss (1962) identifies Machiavelli as charter con- 
certmaster of the modern orchestra that fiddies while 
Rome burns.* From the Left, Horkheimer (1982) ac- 
cuses Machiavelli of orchestrating the stifling regimen- 
tation of industrial reason. Whether one attributes to 
him the loss of philosophy (Strauss) or the loss of 
poetry (Horkheimer), Machiavelli is said to have aban- 
doned the exquisite in favor of “statecraft” (Butterfield 
1940); he is the precursor to Lasswell’s political “sci- 
ence,”> the purported value neutrality of which masks a 
craven surrender to vulgar, perfunctory, or even evil 
“whys.” 

So, notwithstanding a secondary debate about the 
relative importance of a remote moral interest, Ma- 
chiavelli has come to represent the quintessential tac- 
tician. Political success, we are told, depends upon the 
prince’s ability to hide in the bushes, studying and 
countering the ambitions of malefactors. We are drawn 
to that most famous passage in that most famous 
chapter, in which Machiavelli advises his prince to 
emulate the fox, the cleverest of animals. The fox 
ratifies the underlying consensus in Sasso’s scholarly 
dichotomy. The fox is sly, not above employing its 
intelligence in service of the ignoble. 

Machiavelli also asks his prince to consider the 
demeanor of the lion, but that is dismissed by his 
analysts as an uninteresting or unimportant accessory, 
even though Machiavelli grants equal proportions to 
both of the animals in his hybrid. Kahn (1986, 65), for 





2 For J. G. A Pocock (1975, 31-80), the “Machiavellian moment” 
occurs when human mgenuity ws of such confidence that it 1s willmg 
to assume the responsibility of human destiny Thus, for Pocock, wi 
consists of the extent to which human cleverness can avoid or 
overcome the vicissitudes of fortuna. 

3 Admittedly, the extent to which analysts recognize an irony is 
variable. The treatments by Strauss and Horkhemmer are at the 
extremes, whereas some are almost congratulatory that Machiavelli 
can be Imked to nationalism, or science, or liberalssm. Mansfield 
(1975, 373) argues that these authors do not rame the question of 
Machiavell’s modernity “in sufficiently broad and uncompromising 
terms.” Perhaps the most categorical depiction of Machiavelli as the 
quintessential modern is that of Robert Hariman who, employmg 
postmodernist vernacular, accuses Machiavelli of mposmg a disem- 

bodied metanarrative of cold power that not only envelops constit- 
uent “texts” but human desire too (Harman 1989, 28). A good 
repository of bibliographic informaton on Machiavellr?s modernism 
1s, Viroll 1990 

* Of course, Strauss’s (1962, 327) famous indictment of the “science” 
of politica is even more severe, since political science “does not know 
that it fiddles and it does not know that Rome burns.” Strauss (1959) 
holds Machiavelli responsible for endorsing the abdication of con- 
siderations of the “why” to expediency. 

5 Cochrane (1970, 170, translation mine) can say unreservedly that 
“he 1s everywhere recognized founder of another discipline, that of 
political science.” 

6 According to Sasso (Machiavelli 1963, 150, n. 1, translation mine), 

“chapter 18 of The Prince us perhaps the text most tortured, studied, 
vilified, and disparaged among those numbered not only in the entire 
work of Machiavelli, but all of modern poltical thought.” 
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instance, finds a “particularly forceful articulation in 
chapter 18” of “the aim of The Prince as a whole,” 
which is nothing more than the “redefinition of repre- 


` sentation as ruse and thus of mimesis as power”; 


Machiavellian virta is tantamount to “mere cleverness.” 
Dietz (1986, 78) mentions only the fox in her depiction 
of The Prince: “In short, whether the subject is love, 
war, or politics, Machiavelli recognizes the advantages 
of crafty assault in any form, be it trickery, stratagem, 
or artifice.” Hulliung (1983, 214) at least admits the 
lion’s existence but asserts that Machiavelli reserves his 
deepest admiration for those who are able to dominate 
by resorting to “the subtle methods of the fox rather 
than the crude methods of the lion.” Similarly, Sasso 
(Machiavelli 1963, 152, n. 7) associates the fox with 
“shrewdness and cunning prudence” (astuzia e accorta 
prudenza), the lion with only primitive traits of “impet- 
uosity and violence” ( e violenza) (see also Bacon 
[1869] 1973, 84). Pitkin (1984, 46) argues that “the lion 
is not a standard of manhood Machiavelli ever takes 
seriously, either for himself or for those he admires.” 
Although Rebhorn (1988) suggests equal treatment in 
the title of his book, Faxes and Lions, he immediately 
undertakes an exposé of Machiavelli’s “trickster fig- 
‘ures . . . in short, foxes of every species” (p. x). Machia- 
velli’s ‘heros, according to the book’s subtitle, are no 
more than “confidence men.” 

In this article, I consider the lion. I show that the 
successful prince needs more than trickery, stratagem, 
and artifice. In our haste to sensationalize Machia- 
velli’s foxy reduction of the classical attachment to 
exquisite moral standards, we overlook the extent to 
which the lion represents a widening rather than a 
lowering, a recognition that Socrates’ contemplative 
priorities helped him die well but did not always help 
him live well. Politics is about life, and Machiavelli, 
invoking the lion, would counsel escape from prison. 
Although the escape involves a foxy plan, it demands 
some of the virility, comradery, and innocence of the 
lion. Ironically, Machiavelli anticipates and addresses 
the same deficiencies of modernity for which he is held 
responsible, as an examination of the lion will show. 


A BEASTLY HUMANISM 


Machiavelli prepares for the lion, and the fox for that 
matter, with preliminary discussions that establish the 
importance of animal traits in princely comportment. 
Indeed, as respected editions of The Prince remind us 
(Borsellino 1989, 99, n. 1; Burd 1891, 301, no. 1; 
Machiavelli 1960, 72, n. 2, and 1963, 151, n. 2), 
Machiavelli’s ratification of princely animalism is rein- 
forced with allusions to a classical humanist source, the 
De Officiis of Cicero, in which the Roman aristocrat 
informs his departing son that there may come a time 
when the human method for settling disputes may not 
be available to him, and he may need to resort to tactics 
of the beast (beluarum) (Cicero 1913, 37 [De Officiis 
1.11.34]). Although there may be scholarly consensus 
regarding the involvement of Cicero and classical hu- 
manism in chapter 18, there is controversy regarding 
the nature of that involvement. This controversy is 
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informative regarding what I believe to be a misinter- 
pretation of the scope of Machiavelli’s “princely ani- 
malism” and thus the scope of traits attributable to the 
lion. 

On the one hand, Colish (1978) argues that Machia- 
velli’s references to Cicero are straightforward and 
deferential (see also Gentile 1968, 365-72; Allan Gil- 
bert 1938, 2; Felix Gilbert 1939, 464; Merleau-Ponty 
1964; Olschki 1970; Skinner 1981, 36-41; Toffanin 
1921; Viroli 1998, 54). She maintains that the pragma- 
tism of classical humanism remains mostly unappreci- 
ated, due to the analytical residue of medieval mter- 
pretations, and that Machiavelli’s concern for verità 
effettuale is not that far from a similar interest found in 
classical humanism, which considers the moral good 
(honestum) in terms of the constraints of feasibility 
(utile).7 Machiavelli’s reference to the beast, then, is a 
concession to a practicality that he shares with classical 
humanism. On the other hand, Hexter (1973, 210) 
claims that Machiavelli invokes Cicero in order to 
pervert him; whereas Cicero might intermittently di- 
vert his moral interest with questions of practicality, he 
never considers the Machiavellian “dislocation of vir- 
tue and reason and nature from their customary rela- 
tions with each other” (see also Hulliung 1983, 213; 
Mansfield 1996, 35; Paparelli 1970, 86; Raimondi 
1972). For using Cicero’s vernacular to such a diver- 
gent purpose (Raimondi [1972, 266] calls it a “deforma- 
zione”), Machiavelli is accused of “one of the most 
efficient perversions of a writer’s intention in the long 
history of literary hocus-pocus” (Hexter 1973, 208). 

If nothing else, this discrepancy substantiates Sasso’s 
assertion regarding the pervasiveness of the debate 
about moral compromise. The extent of Machiavelli’s 
humanism is depicted as dependent solely upon the 
degree to which both he and Cicero adjust normative 
commitments in the face of pragmatic concerns. The 
controversy centers on whether, and to what extent, 
there is room im humanism for the occasional suspen- 
sion of moral constraints. Restricting the debate about 
Machiavelli’s humanism to such familiar territory 
yields smug conclusions. Skinner (1981, 37-8, emphasis 
in original), for instance, identifies a “simple” Machi- 
avellian divergence, namely, that “if a ruler wishes to 
reach his highest goals, he will mot always find it 
rational to be moral” Viroli (1998, 54), similarly 
focused, argues that Machiavelli merely “restricts the 
range” of Cicero’s humanism, admitting moral com- 
promise to a wider variety of political pursuits. But 
Machiavelli’ departure from humanism is simple and 
restrictive only to those whose analyses are spellbound 
by traits associated with the fox. Analysts who debate 

whether there is a place for the fox in classical human- 
ism may miss in Machiavelli a significant expansion of 
animalism in general as it pertains to princely comport- 
ment. That expansion is wide enough to enhance the 
domain of the lion, and it is wide enough to detach 
Machiavelli from humanism in ways more complex and 


7 Toffanin (1921) makes a similar connection of Machiavelli and 
humanmm, using Tacitus in place of Cicero. 
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substantial than detected by analysts preoccupied with 
the morality issue. 

That Machiavelli’s princely animalism is more com- 
plex than enhanced foxiness should be clear, given his 
employment of the centaur to introduce it. The juxta- 
position of appropriate animalism with centaurs is an 
allusion Cicero does not make. By emphasizing the 
parity between the human and the animal in the 
prince’s curriculum,® Machiavelli thoroughly betrays 
the humanist hierarchy, in which the animal is a much 
less significant human component. Although Cicero 
recommends resorting to the animal in the resolution 
of disputes, he never gives the animal and the human 
traits equal billing. The human approach, discussion 
(disceptationem), is always preferable to the animal, 
force (vim). Furthermore, resorting to the animal has 
to be justified within the context of the human. The 
centaur is not an appropriate image for Cicero’s distri- 
bution of human and animal traits, and Machiavelli’s 
employment of the image leaves little doubt as to the 
unprecedented importance of animalism to princely 
success.’ 

Machiavelli’s divergence from Cicero does not end 
with a dispute over the zoological hierarchy, however. 
More important, Machiavelli redraws the very bound- 
aries ratified by Cicero between the human and the 
animal. Following his Greek mentors, Cicero places 
reason (rationis) exclusively in the human domain and 
then deduces that only humans are endowed with a 
historical consciousness, in which events can be linked 
to analytical continua (Cicero 1913, 13 [De Officiis 
1.4.11]). In addition, reason facilitates speech, which 
promotes community, which then promotes public 
assemblies along with unusual tenderness and love for 
one’s children (p. 15 [De Officiis 1.4.12]). Animal 
behavior, for Cicero, is reserved for the likes of 

achus, who intrude on civility “like a wild 
beast” (Plato 1968, 13 [Republic 336b]) and scare 
people out of their wits. Machiavelli is much less 
generous to the realm of the human. The human 
environment is one of laws (moral, political, religious), 
not intellectual or emotional superiority: “You must 
then know how there are two means of combat: one 
with the laws, the other with force” (Machiavelli 1960, 
72 [Prince 18]).1° The animal dwells in the realm of 


8 Machiavelli (1960, 72 [Prince 18]) is quite precise in describing the 
centaur as “one half beast and half man” (uno mezzo bestia et mezzo 
uomo). Viroli (1998, 54), who craves humanism in Machiavelli, hones 
m on the intermittent depictions of princely animalism as “detest- 
able,” to be always avoided when “laudable” human recourse 
available, Even if Machiavelli’s depictions are sincere, which n highly 
doubtful, the fact remains that introducing pnocely animalism with 
the image of the centaur sends a strong message regarding its 
importance to appropriate princely comportment. 

° That Machiavelli’s intention m using the centaur is to deflate 
anthropocentrism is reinforced by passages in the Golden Ass, m 
which the boar chides humanity for rts silly self-aggrandizement 
(Machiavelli 1965, 299 [Golden Ass 8.37-9]). Cicero, however, ia not 
Machiavelli’s only target. Pico’s ([1496] 1942, 101-65) De homurus 
dignitata is similarly self-absorbed and may be assumed to be yet 
another object of Machiavelli's derision. 

10 “Dovete adunque sapere come sono dua generazione di combat- 
tere: Puno con le leggi, Valtro, con la forza.” The translations of 
Machiavelli are mine and purposely favor clumsy accuracy over 


563 


_ Lionizing Machiavelli 


“force,” which now includes all extralegal forms of 
coercion and persuasion. There is little similarity be- 
tween Cicero’s vim and Machiavelli’s forza.1! 

At first, this expansion of the animal realm, both 
hierarchically and behaviorally, may not trouble those 
who focus on Machiavelli's cleverness. In fact, that 
animals might metaphorically be endowed with speech 
and a sense of history—with at most a half-hearted 
obeisance to moral, political, and religious laws— 
seems to prepare us perfectly for the fox, who is more 
sophisticated than the impetuous Thrasymachus. Yet, 
given Machiavelli’s equal partitioning of fox and lion, 
we are justified in exploring the possibility that the 
expanded realm of princely animalism, prepared by a 
discussion of centaurs, includes an enhanced artillery 
for the lion, too. If forza includes reason, then might it 
not also include an extreme tenderness toward off- 
spring, a trait more suited to a lion, who prefers the 
company of the pride to the loneliness of the fox? 
There is good reason to believe that Machiavelli’s lion 
is capable of more than Sasso’s “impetuosity and 
violence.” And there is good reason to believe that 
Machiavelli’s prince benefits not only from unencum- 
bered cleverness but also from unencumbered physical 
magnificence. 

Our suspicion that the expanded realm for animals 
in general may extend specifically to the realm of lions 
is reinforced by the particular centaur selected to 
introduce princely animalism. Machiavelli chooses Chi- 
ron, the complex and compelling tutor to Achilles “and 
many others of those ancient princes” (Machiavelli 
1960, 72 [Prince 18]).% Chiron is no garden variety 
centaur. His parentage is of Cronos and Philyras, and 
his more rowdy cousins are descendants of Ixion. While 
other centaurs are wreaking havoc,!3 Chiron teaches 
Achilles the finer points of medicine and music. For 
those who transport Plato’s contempt for the animal to 
Machiavelli’s treatment, the choice of Chiron as 
princely model is strange; any of his more rambunc- 
tious associates would be better prototypes, it is 
thought. In the tradition of Dietz (1986), the selection 
of Chiron is viewed as a trick, one more instance in 
which “Machiavelli does not mean what he says” 
(Arieti 1995, 389). Chiron, we are informed, could 
never qualify as a quintessential animal. “How apt a 


n 
fluidity. The original Italian, from the Feltrinell: editions, m included 


Although she succumbs to the humanist bias that laws are superior to 
force, Pitkin (1984, 46) also notices Machiavelli’s reduction of the 
human domain. Indeed, his lees exquisite humans reflect a mechoval 
bias, which held that the differences between animals and humans lay 
in their relative tameness, not relative intelligence (Yamamoto 2000, 
24-5). Tameness is akin to “law-abidingness.” 

2 so molti altı di quelli prmeip: antichi.” 

D The contrast 1s best preserved in art The western pediment of the 
Temple of Zeus at Olympia depicts the abduction of the Lapith 
women by the vulgar centaurs. Yet, when Chiron is depicted without 
his boorsh associates fresco, National ical Mu- 
seum of Naples, no. 9109), he is seen with a harp and in affectionate 
embrace with Achilles. 
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symbol is Chiron?” asks Arieti. “The answer is that he 
is totally, completely, and entirely inapt. Machiavelli is 
right that Chiron was the tutor of Achilles, but what he 
taught Achilles was medicine. Throughout ancient 
literature, Chiron is the kindly, poetic, artistic, peace- 
loving centaur, the very antithesis of beast-like quali- 
ties” (p. 387, italics in original). 

Looking at Chiron in Machiavelli’s terms, however, 
one begins to see an animal par excellence. Chiron is 
an apt model for forza broadly understood, as extra- 
legal persuasion and coercion. Achilles lacked not 
ferocity but the subtlety of the animal, which tutoring 
in medicine and music could provide. “The son of 
Philyra made the boy Achilles accomplished on the 
lyre, and by his peaceful art subdued those savage 
passions” (Ovid 1929, 13 [Art of Love, 1.11-2]). Chi- 
ron’s animalism is a sophisticated force, which involves 
both thinking and acting. Achilles learns how to plan 
an attack, how to execute it, how to enjoy the victory 
properly, and how to lament defeat.4 As his name 
suggests (“Chiron” is an affectionate derivative of 
cheirosophos, “skilled in hand crafts”), Chiron teaches 
Achilles the most complex of the “touching” arts: 
music and medicine.5 Achilles’ physical presence is 
rendered formidable and versatile. He can seduce and 
heal as well as he can fight. 

The choice of Chiron from among the other centaurs 
is not the only telling component of the tutoring 
scenario. By alluding to others but not naming them, 
Machiavelli distinguishes Achilles from Chiron’s other 
formidable students. Had Machiavelli wanted to focus 
on strength, he might have mentioned Chiron’s student 
Hercules; had his context called for only a healer, he 
might have employed yet another student, Asclepius. 
But if the intention is to highlight the sophistication 
and breadth of the Chiron curriculum, then Achilles is 
the apt choice. Of these three, only Achilles receives a 
balanced education in the sensual arts, using his hands 
to heal and to perform as well as to slay (see Mackie 
1997). 

It is no surprise that Machiavelli chose Chiron for 
the cover of his comedy, Mandragola. If Chiron repre- 
sents a much broader concept of sensuality than that of 
brute force, then it makes sense to have him introduce 
Machiavelli’s literary masterpiece. Fido (1977, 109- 
22), although convinced that Machiavelli personally 
chose the cover illustration,!* offers multiple complex 
explanations of how Machiavelli might hold two simul- 


14 “Tighten every il with wme and song, sweet consolations for 
unlovely sorrow” (Horace 1914, 397 [Epedes 13]). Chiron also 
showed unusual tenderness (for Cicero a human trait) for his 
daughter and wept upon hearing of her forced metamorphosis. And 
this: “The Centaur was rejoicing in his foster-child of stock, 
glad at the honour which the task brought with rt” (Ovid 1916, v. 1, 
107, 105 [Metamorphoses 2.676, 2.633]). 

1 As for Chiron teaching music to Achilles, the tutoring is implied in 
The Ihad (see Mackie 1997, 8) and is explicit in Horace (1914, 397 
[Epodes 13]) and Ovid (1929, 13 [Art of Love 1.11). As for medicine, 
see Homer 1924, 543 (Iliad 11.832). 

16 That Machiavelli did mdeed make the choice is not as certain as 
Fido would have ıt, however. For an excellent treatment of the 
publication history of Mandragola and Machiavelli's other works, see 
Bertell: and Innocenti 1979, esp. xxi-exvili. 


American Political Science Review 


Vol. 95, No.3 


Ce Oo OERE NN E SS 


taneous visions of Chiron, one the fierce tutor of 
Achilles, the other the musical accompaniment to 
comedy. He does not consider the possibility that the 
Chiron of The Prince is precisely the musician of 
Mandragola. 


CHIRON’S OBSOLESCENCE 


Although my ultimate intention is to link Machiavelli’s 
lion to the sophisticated sensuality of Chiron instead of 
the anthropocentrism of Cicero and classical human- 
ism, a preliminary discussion of Machiavelli’s transition 
from centaur to lion (and fox) is in order. Why is 
Chiron, an appropriate teacher for Achilles, not suffi- 
cient to explain appropriate animalism to the budding 
prince? Why, after exploiting the centaur to establish 
the importance of animal traits, does Machiavelli 
choose to dissect his ancient prototype into the more 
specific constituent elements of lion and fox? Of 
course, a simple and pertinent influence would be the 
popularity of lions and foxes in Renaissance imagery, 
and I will return to that issue. Here I want to discuss a 
more profound instigation for Machiavelli’s choice of 
these particular images. I want to show that the insuf- 
ficiency of Chiron and Achilles is related to a more 
formidable discrepancy between simple and complex 
societies, between ancient Rome and Renaissance Ita- 
ly.17 I also want to show that the discrepancy necessi- 
tates even closer attention to the lion traits. 

In chapter 19, just after the discussion of animals, 
Machiavelli introduces the concept of corruption with 
the advice that it be attended by princely scrutiny. He 
goes on to argue that in ancient times princely scrutiny 
was best directed at the military, but now the common 
civilian class is more in need of it, since “they can do 
more” (possono pitt) than the armed forces (Machia- 
velli 1960, 83 [Prince 19]). Corruption has afflicted the 
people and made them more formidable, threatening 
princely endeavors. I believe it is this new climate of 
popular corruption that compels Machiavelli to subdi- 
vide his ancient tutor. The prince must hone some 
particular behaviors that, although they may have been 
part of Chiron’s arsenal, can draw undistracted atten- 
tion when isolated in the image of the lion. 

As for how it is that corruption is precipitated and 
thereby necessitates a keener attention to leonine 
traits, The Prince responds only with an evasive, tanta- 
lizing latinism, quodammodo (somehow) (Machiavelli 
1960, 80 [Prince 19]). Why The Prince is so blatantly 
unanalytical will be discussed later. Only in consulting 
the Discourses and the Art of War do we learn that the 


17 The disconnection is not often recognized and 1s part of a much 
wider syndrome that mistakenly assumes Machiavelli's admiration 
for the past demonstrates his interest m reviving it. Machiavelli is 
said to have been “mdoctrmated” in the pagan revrval (Sabine and 
Thorson 1973, 335; see also Plamanatz 1963, 1:32; Skinner 1978, 
1:182; Strauss and Cropsey 1972, 281). Russo (1983, 15) does not 
even notice the switch from centaur to lion and fox, claiming that 
Machiavelh intended the centaur to be the appropriate model for the 
modern individual. I will show that Machiavelli recognizes contem- 
porary developments that make a return impossible. For a systematic 
treatment of the distinction between admiration and revival regard- 
ing religion, see Lukes 1984. 


corruption of the people is the result of communal 
experience, in both the political and natural environ- 
ments.!8 Civilization has chipped away at the people’s 
innocence,!9 and they have taken on characteristics 
that are resistant to princely influence. Numa has an 
easier time of it, as would a prince among mountain 
dwellers, since civilization is for both in a primitive 
state (Machiavelli 1960, 161-2 [Discourses 1.11]). The 
inhabitants of mature civilizations, however, suffer a 
potentially debilitating cynicism. Referencing Plutarch 
(1909), who is considered “the most serious writer” 
(gravissimo scrittore) (Machiavelli 1960, 275 [Discourses 
2.1]), and quoting Livy directly on the subject (p. 164 
[Discourses 1.12]), Machiavelli describes a decline of 
innocence: People have gone from eager acceptance of 
leaders’ nobility to a crass skepticism of anything but 
mundane claims to leadership. 

Perhaps the most intriguing analysis of popular 
corruption appears in the Art of War, in which Machia- 
velli directly connects two phenomena: the tendency of 
ancient populations to fear eclipses and earthquakes, 
and the ability of military leaders to exploit that fear 
with clever interpretations attached “to their purpose” 
(Machiavelli 1961a, 487 [Art of War 6)).% Recalling a 
humorous anecdote from The Twelve Ceasars (Sueto- 
nious 1957, 35 [sec. 59]), Machiavelli exposes an audi- 
ence so simple that it can be hoodwinked by Caesar’s 
cute recovery when he fell down upon disembarking in 
Africa and decreed: “‘Africa, I have taken you” (Ma- 
chiavelli 1961a, 487 [Art of War 6]).?4 

Things are different in Machiavelli’s Italy, where the 
blunders of Caesar’s descendants are received with 
cynical smugness. To believe that an earthquake por- 
tends a political disaster, the populace needs to trust 
the statement of the leader making the connection, and 
it needs to be ignorant of the science of earthquakes. In 
the case of Italy, both requisites are missing. As for the 
issue of trust, Machiavelli complains that leaders now 
use their powers of deception indiscriminately. 
Whereas ancient rulers invoked augers and oracles 
during times of military crisis, modern Italian rulers 
have stretched their credibility beyond all limits. In the 
Romagna, before the intervention of Alexander VI and 
Valentino, rulers resorted to venal trickery casually and 
frequently, and it did not take many such episodes to 
alienate the population (Machiavelli 1960, 465 [Dis- 


12 Although I do not believe that the Discourses or any other of 
Machiavelli's works can be considered mere chatty versions of The 
Prnce, to be casually exploited for clarification of the shorter work, 
connections can be made cautiously, especially when linking un 
assertions in The Prince to more reserved analyses else- 
where. In fact, I will argue that special attention to traits of the lion 
m The Prince precludes Machiavelli's employment of lengthy analyt- 
ical treatments. Because I can only offer that explanaton after an 
examination of the lion’s curriculum, I will postpone the discussion 
that justifies consultation of works outude The Prince. 
9A sculptor will create more easily a beautiful statue of rough 
marble than of one pocriy hewn by others” (Machiavelli 1960, 162 
[Discourses 1.11]). (Uno scultore trarrà phi facilmente una bella 
statua d'un marmo rozzo, che d'uno male abbozzato da altruL} 
z “a loro proposito.” . 
2 “Affrica, io (ho press.” 
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courses 3.29}). Modern Italy has succumbed to an 
imbalance of cleverness, of foxiness.# 

In a striking admission, Machiavelli identifies the 
other part of the cynicism equation, that issuing from a 
more educated perception of natural phenomena. In so 
doing, he objects to the facet of modernity for which he 
is accused of being unthinkingly responsible. He no- 
tices that people become cynical and lethargic “when 
more knowledgeable about natural things” (Machia- 
velli 1960, 164 [Discourses 1.12]).% Science is no pan- 
acea; it can sap practitioners of their energy. Lightning 
and earthquakes are more likely to be inspirational 
when they are thought to issue from angry gods. This 
disrupts the depiction of Machiavelli as a paragon of 
cleverness and founder of the scientific study of poli- 
tics. If the source of the debilitating popular corruption 
is cynicism, which comes from more clever appreben- 
sions of leaders and nature, foxiness cannot be enough 
to succeed in a political environment that suffers from 
growing corruption. By giving the lion a separate 
identity, Machiavelli attempts to address the deficien- 
cies of the fox and isolate the necessary countervailing 
traits. 

We may now appreciate the importance of the 
dissection of Chiron. The constituent parts may have 
been in harmony in the ancient (or mountain) setting, 
but recent history reveals a discontinuity. Thinking and 
acting, seeing and touching, cleverness and vigor are 
not always, indeed not often, in equilibrium. Achilles 
represents a simpler, more noble, past, in which princes 
such as Camillus need only request sacrifices of their 
subjects, and compliance is forthcoming (Machiavelli 
1960, 254 [Discourses 155]); or in which Pompilus 
Numa can claim to be on friendly terms with a nymph 
(p. 161 [Discourses 1.11]). Clever leaders could count 
on the good “feelings” of their constituents and thus 
could elicit devotion even when employing intermittent 
deception. Modern princes encounter no such environ- 
ment. Crude and excessive cleverness now prevails,’ 
and before the affective resources available to the 
prince are overwhelmed, Machiavelli isolates and pro- 
tects them in the image of the lion. 

Other scholars have paid special attention to modern 
corruption. Moravia (1964, 125), Paparelli (1970), 
Sasso (1980), and Wolin (1960) all resist Machiavelli’s 
reputation as the cool and clinical political scientist; 
instead, they trace his drastic suggestions to a fear that 
the environment in which humans must survive is 
growing ever more inhospitable, and therefore serious 
tactical measures must be taken to ensure functional- 
ity. But these scholars tend to bemoan or rationalize 


2 Machiavell) discusses one of Aesop’s fables in a letter to Vettori. 
It concerns the fox’s encounters with the hon: “The first time he was 
dying of fear, the second he stopped hmmself to observe him from 
behind a bush, the third he spoke to him” (Machiavelli 1961b, 292 
[Letters 138, 26 August 1513]). (La prima volta fu per morire di 
paura, la seconda si fermò a guardarlo drieto un cespuglio, la terza 
gli favelld.) 

DB “quanto piú conosciton delle cose naturali.” 

34 Merleau-Ponty (1964, 217) seems to recognize the imbalance 
when he emphasizes Machiavelli's mterest in emotional reward: 
“The prince must have a feeling for these echoes that his words and 
deeds arouse.” 
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what they concede is a necessary ascent of the fox 
rather than deal with the complexity of Machiavelli’s 
response to popular corruption. The cynicism atten- 
dant on political exposés and the deciphering of nature 
is mostly immune to foxy therapy and is in fact its 
byproduct. To succeed, founders and their constituents 
must be “happy” (Machiavelli 1960, 31 [Prince 6]), and 
thus they require some freshness and emotion not 
found in the fox. 

Even though foxiness is in need of refinement, and 
Machiavelli certainly offers his suggestions, The Prince 
is not just about foxiness. It is not just an intro- 
duction to Lasswell’s political science and to classical 
liberalism. Indeed, Machiavelli precedes Strauss and 
Horkheimer in lamenting a politics that has been 
detached from its “why.” The corruption Machiavelli 
identifies, particularly citizen cynicism, puts the “why” 
in jeopardy. And rather than endorse a method that 
exacerbates the loss, Machiavelli offers the lion and its 
affective arsenal in an attempt to rescue politics from 
an excess of foxiness. 

As to why the lesson of the lion may not be as 
accessible, or at least as “visible,” as that of the fox, 
Nietzsche, whose connection to Machiavelli has not 
gone unnoticed, may have something to say. That 
analysts bypass the widening of the Platonic agenda for 
its lowering is a tribute to Plato’s seductiveness, not a 
sign of his defeat. In addition, if the lion is more 
sensual than intellectual, we may have to look else- 
where than to systematic prose for the lessons. Inspi- 
ration is not a matter for traditional instruction, and it 
may be contained more in impressions than in specifi- 
cations. Indeed, the brevity of The Prince is evidence 
that its message may be as much in what is not said as 
in what is said. But I am aot offering just one more 
conjecture about hidden meanings and secret mes- 
sages. The lessons concerning the lion are in The Prince 
for all to see, and I will identify a few of them. If the fox 
is the part of forza that is intellectual, then the lion 
reasonably can be held to represent the sensual part. 
And if the lion is of Chiron’s ancestry, then there is 
much more to the lion’s sensuousness than ferocity.?’ 


ay ere te eee ae [Beyond Good and 
Evil 28]) famous tribute to Machiavelli that celebrates hia 
non-German allegnssmmo. Adams (Machiavelli 1977) mcludes pas- 
sages from Nietzsche in his edition of The Prince, but he connects the 
two only in terms of theur dissection of morality. Hulltung (1983) 
devotes a good deal of discussion to comparison af the two. 

26 The Platonic agenda is maintained in Berlin’s (1953) famous essay, 
“The Hedgehog and the Fox,” ın which nary a mention of the lion 
occurs in a discourse that locks political philosophy in a continuum 
characterized by intellectual rather than active, affective relations 
with reality. 

77 Oblivious to Machiavelli's connechon of Chiron and the Hon, 
Allen Gilbert (1938, 120-1), in a smgle passage, underestimates both 
the aptitudes and the importance of the lion: “Nor 1s the lion here the 
lion accompanying the virtue of Fortitude, as the portal of the 
cathedral of Amiens; it is the hon of inhuman cruelty such as might 
typify Hannibal, who succeeded by detestable measures. But the 
concern of Machiavelli not so much with force as with another sort 
of brutish conduct, characterized by craft.” 
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MACHIAVELLI’S LION 


Even though I focus on the image of the lion, my 
primary intention is to identify in Machiavelli concerns 
that will challenge his reputation as simply a proponent 
of deception and violence. A demonstration that Ma- 
chiavelli intends all his departures from foxiness to be 
associated with the lion is less important, or feasible, 
than a recognition of the inadequacies of only tactical 
advice in princely comportment. It certainly can be 
established that Machiavelli’s Renaissance audience 
might be disposed to associate the lion with complex 
sensual characteristics (like those associated with Chi- 
ron), instead of with mere ferocity. The lion had long 
since lived up to its regal etymological roots, especially 
for Florentines, whose “Davidian”? insecurities were 
frequently teased by the imperialistic Venetians, whose 
mascot was the winged lion of San Marco.” Of course, 
the Florentines could respond with their own lion, 
Marzocco, and Machiavelli made certain of the ram- 
pant lion’s image on the parade grounds of the city’s 
militia (Bayley 1961, 255). Even on the cosmic level, 
the lion assumed prominence in Machiavelli’s time 
through linking the nurturing sun to the constellation 
Leo (Klingender 1971, 456). 

More pervasive, however, are two literary traditions 
that provide Machiavelli and his readers rich leonine 
depictions.» The first is that of the bestiary, the 
intriguing synthesis of biology and mythology contain- 
ing pictures and descriptions of animals that combine 
the clinical with the whimsical. Descended from the 
Greek and Latin Physiologus, the continental bestiaries 
were remarkably consistent (Barber 1993, 12). Italian 
versions a in the thirteenth century (Dardano 
1967, 29), a subdivision of which has come to be known 
as the bestiario toscano and a connection with Machia- 
velli has been noted (Haist 1999, 8). Even Machiavelli’s 
purported collaborator, Leonardo da Vinci (see Mas- 
ters 1996), tried his hand at a bestiary (da Vinci [1939] 
1974), and Machiavelli’; own robust appreciation of 
animal is reinforced in his Dell’asino d'oro 
(Machiavelli 1965, 267-302). The lion in that poem is 
first described as fierce and bestial (p. 275 [Golden Ass 
2.58]), but it is later revealed to be susceptible to great 
pride and sadness (pp. 300-1 [Golden Ass 8.79-81]). 

The bestiaries are clear in their depictions of the lion 
as a versatile and complex entity. The texts establish its 
status as king of beasts and discuss the lion’s affect. 
Snakes, fire, and loud noises are among the lion’s 
phobias, which reinforces the image of a creature of 
complex and sympathetic emotions. The formidable 
sensuality of Chiron, along with his medicinal apti- 


Æ Michelangelo’s Dawid s often associated with the Florentine 
self-image’ small and ingeneous, courageous in the face of giants. 
2 Eyen the biblical lion contributes to the complexity. The lion’s 
amociation with Saint Mark regards not only the “royal dignity of 
Christ” but also “a voice crying” (Mark 1:3). Regaineas 1s comple- 
mented by pathos and lonelmess. See Appleton and Bridges 1959, 60. 
For the divermty of leonine depictions in the Venetian Republic, see 
Rien 2000. 

= Jf I were to explore both metaphors instead of jast that of the lion, 
I would need to add a third mftuence, that of Reynard the fox See 
Best 1983. 
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tudes, reverberate in the lion’s capacity to resuscitate 
stillborn cubs with an invigorating lick (George and 
Yapp 1991, 48). Perhaps most interesting for my 
purposes, however, is that by the late Middle Ages the 
violent and impetuous qualities first attributed to the 
lion had been mostly transferred to the panther, which 
provided “a means of isolating and channeling the 
aggressive characteristics traditionally attributed to the 
lion” (Haist 1999, 11). The leopard is described as so 
impatient that it can tolerate only .three or four 
pounces before seeking alternative prey (McKenzie 
1905, 394). In Machiavelli’s time, the authority of the 
lion was linked more to courage, respect, and integrity 
than to aggression and cruelty, thus solidifying the 
lion’s heraldic identity (Yamamoto 2000, 75-131). The 
bestiary prepared Machiavelli’s audience for a lion 
whose ferocity could be tempered and enhanced with 
the same complex sensual traits detected, albeit less 
distinctly, in Chiron. 

The more commonly cited source of inspiration for 
Machiavelli’s lion metaphor, however, is the Fables of 
Aesop, one of which is mentioned in a letter to Vettori 
(Machiavelli 1961b, 292 [Letters 138, 26 August 1513]). 
Najemy (1993, 510-1) notes the similarity of Machia- 
velli’s rendition to that of the Latin version published 
by Ermolao Barbaro (1977). He fails to mention the 
influence of a more famous edition, the exquisite 
“Medici Aesop,” a Greek language version almost 
certainly illuminated by the esteemed artist Gherardo 
di Giovanni. Although Machiavelli probably did not 
read Greek, it is impossible that he was unaware of the 
text, which was assembled to teach the ancient lan- 
guage to the children of Lorenzo the Magnificent 
(Fahy 1989, 10). The fable in the Vettori letter, on the 
lion and the fox, is contained in the Medici Aesop and 
is remarkably similar to it. 

The importance of the fables in the education of 
Machiavelli’s potential patrons and protégé, then, sug- 
gests a connection to his lion. And, as in the bestiaries, 
the Tuscan versions of which actually include select 
fables (Goldstaub and Wendriner 1892), it is clear that 
the lion’s authority is based as much upon character as 
upon muscle. One fable in the Medici edition depicts 
an infirm lion, “dying in his cave” (Aesop 1989, 93). All 
the animals except one come to pay their respects. The 
exception is the fox, apparently immune to memories 
of the lion’s magnetism. The limits of deception are 
revealed in another fable, when a donkey disguises 
himself under a lion skin. The ruse is exposed by the 
fox, “who knew the difference between a donkey’s bray 
and a lion’s roar” (p. 136). In yet another fable, an 
aging lion turns to guile in hunting when sapped of his 
youthful speed and strength (p. 161). The lions of 
Aesop, and of the Medici version in particular, are 
charismatic as well as fierce. Their qualities are such 
that they cannot be imitated, and they are capable of 
foxiness when constrained by loss of virility. For the 
king of beasts, cleverness is more a retirement diver- 
sion than a rare and inaccessible aptitude. 

These discussions of Renaissance lions prepare us 
for a return to chapter 18 and a revised apprehension 
of the commentary that immediately follows Machia- 
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velli’s mention of the animal hybrid. Machiavelli intro- 
duces two senses, sight and touch. Clearly, he means 
touch in the figurative sense of “feeling,” since he says 
that most people are capable only of seeing, and that 
the prince must therefore attend to appearance. Ap- 
pearance is the purview of the fox, who in the fables is 
unaffected even by the immiment departure of the lion, 
and whose coldness facilitates seeming over being. And 
the discussion is invariably interpreted to support the 
preeminence of cleverness in Machiavelli’s thinking. 
Since many more people can see things than can feel 
them, the argument goes, the aptitudes attendant on 
the visible must of necessity be the most important (see 
Blanchard 1984; Feaver 1984; Marolda 1979). 

Touch is a rarer but no less important aptitude;31 
however, the lion’s relative obscurity facilitates the 
faulty syllogism that links importance with frequency. 
Although individual citizens may not often touch the 
prince, the capable prince has the capacity to touch the 
collective and be touched by it. Sight is a sense, but it 
is not sensual. The observant fox lives a lonely life, dies 
a lonely death, and is not a fully political animal. It is 
left to the lion side of the prince to undertake the less 
accessible but nonetheless crucial sensual aspect of 
politics, to cultivate the inclination to touch, and to 
receive and revel in the touch of loyal adherents. 

But beyond brute strength and an imprecise connec- 
tion to the tactile, precisely what sensual elements does 
Machiavelli’s lion bring to the political setting? I 
suggest that the lion stands for virility and community, 
traits that are consistent with Machiavelli’s generous 
distribution of aptitudes to animals, with the inclusion 
of Chiron in the lineage of princely models, and with 
the animal mythology that surrounds Machiavelli and 
his audience. They are also traits that appear fre- 
quently in The Prince but receive inadequate attention. 
Although I will divide my discussion of community and 
virility, in the end I will show they are quite interde- 
pendent. 


Virility 


That physical, sensual prowess, the kind that cannot be 
simulated, is important to the prince may be linked to 
Machiavelli’s preoccupation with the phenomenon of 
time. The first substantive discussion of virtù in The 
Prince is associated not with cleverness but with time: It 
cannot be wasted. Referring to the ancient Romans, 
Machiavelli (1960, 22 [Prince 3))?? writes: “Never 
would it have pleased them that which is in the mouths 
of all the experts of our time, to relish the benefit of 
time, so much better was their virtue and prudence; 


31 Although the details must be saved for another essay, [ would say 
that Machiavelll’s interest m moving from a principality to a republic 
s dependent upon the extent to which the sense of touch can be 
inspired and transferred from the princely few to the republican 
many. For a detailed treatment of the sense of touch m Machiavelli, 
see Blanchard 1996 

x «Né piacque mal loro quello che tutto df è m bocca de’ savu de’ 
nestn tempi, di godere el benefizio del tempo, ma si bene quello 
della virti 6 prudentia loro; perché el tempo si caccia innanzi ogni 
cosa, © può condurre seco bene come male e male come bene.” 
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because time eventually overtakes everything, and can 
bring with it good as well as bad and bad as well as 
good.” 

In The Republic, Plato is not very kind to the warrior 

character, Polemarchus, who is dismissed as a sweet 
but gullible bumpkin. In Machiavelli’s dialogue, The 
Ant of War, Luigi, the counterpart to Plato’s warrior, is 
given more sincere respect. Fabrizio, Machiavelli’s 
Socrates, chooses Luigi as interlocutor because his 
youth predisposes him to action (Machiavelli 1961a, 
398 [Art of War 31). That he is more likely to carry out 
(eseguire) the advice makes him more fit to receive it. 
There is a connection between youth and action, action 
and politics. A clever plan needs a vigorous proponent. 
I believe Machiavelli’s distaste for the normative, con- 
templative approach to politics stems as much from its 
undesirability as from its infeasibility. Glory, courage, 
and even more common appetitive interests are not 
“settled for,” as self-preservation may have been for 
Hobbes and Locke. Luigi has qualities and perspectives 
that are just as important to political success as those of 
the intellectual, and the intellectual does not have the 
capacity to appreciate that fully. Hobbes and Locke 
may have chosen their priorities on the basis of what 
can be “trusted.” Machiavelli’s choices are not made by 
default. 
_ The Prince is written for a Luigi. It is the most 
optimistic of Machiavelli’s works with regard to the 
susceptibility of fortune to human cleverness. In the 
Discourses, humans are said to be restricted by their 
natural stubbornness and the comfort they receive 
from acting habitually. Pocock (1972, 173), relying on 
these and similar comments, attributes fortune’s ulti- 
mate superiority to human inertia: “We cannot change 
our natures as fast as our circumstances change.” Yet, 
we are told that the prince is to have “a spirit disposed 
to change itself according to the winds and to variations 
of fortune that command” (Machiavelli 1960, 73-4 
[Prince 18]).* And this is immediately preceded by the 
famous admonition to simulate rather than embrace 
conventional virtues. The simulator, the fox, is capable 
of avoiding inertia by remaining emotionally aloof. 

It appears that princely vulnerability issues more 
from physical frailty than intellectual deficiencies. Af- 
ter all, Machiavelli’s ancient heroes, Moses, Cyrus, 
Romulus, and Theseus, owed to fortune only their 
occasione, that is, the time and place propitious enough 
to showcase their talents. Had they the abilities to 
overcome the limitations of time and space, our im- 
pression is that they would have owed fortune nothing. 
They would have found an appropriate venue for their 
brilliance. Being mortals, however, they depended on 
fortune for the coincidence of cause and charisma. The 
association of time with Machiavelli’s concept of virtue 
is thus established. Whereas cleverness may accumu- 
late over time, virility requires timely exploitation. 





% Just as the young Callimaco “burns with demre” (Machiavelli 1965, 
62 [Mandragola 1.1]), so does Luigi in his conversation with Fabrizio 
(Machiavelli 196128, 408 [Art of War 3)). 

= “m anmo disposto a volgersi secondo ch'e' venti e le variazioni 
della fortuna li comandono.” 
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Similarly, in Machiavelli’s setting, the hero of the 
day, Cesare Borgia, found “only opposed to his designs 
the brevity of Alexander’s life and his own illness” 
(Machiavelli 1960, 39 [Prince 7]).25 Even Borgia’s mon- 
umental mistake in judgment, endorsing della Rovere 
for pope, was subtly linked to his health rather than 
deficiencies in cleverness. Just before discussing the 
“mistake” that was the cause of Borgia’s demise, 
Machiavelli mentions a discussion with Valentino in 
which the duke claimed he had considered his father’s 
premature departure and was ready with a clever plan. 
What Borgia could not have anticipated, or overcome 
even if he had anticipated it, was his own mortality: “To 
everything he had found remedy, except that he never 
thought, upon his [Alexanders] death, to be also 
himself dying” (Machiavelli 1960, 39 [Prince 7]}.* 
Borgia’s cleverness was compromised by his mortality, 
not by debilitating intellectual deficiencies. 

In these discussions we encounter a sense of urgency. 
Borgia is of “great spirit,” and he has “great intention” 
as well as “so much fierceness” (Machiavelli 1960, 39 
[Prince 7]}.37 These are transitory elements in his 
princely arsenal. Machiavelli is hesitant to distract his 
young students with merciless analytical scrutiny of 
Borgia, like those included in Legaztoni (Machiavelli 
1964, 599 [Legations 1])38 Luigi and his ilk (Lorenzo) 
are inspired to act, to take advantage of what is a 
limited window. A few errors in cleverness are not 
nearly as disabling as an occasion lost in analytical 
paralysis. 

It is not surprising, given the scholarly focus on 
Machiavelli’s foxiness, that some commentators assert 
that Machiavelli’s nemesis, fortune, can be overcome 
by literary fame. Although Moses, Cyrus, Romulus, 
and Theseus may depend upon “occasione” for their 
successes, Machiavelli’s eventual triumph is guaran- 
teed as long as he has readers (Strauss 1958, 168; see 
also Flaumenhaft 1978). Cleverness is cumulative, and 
all one needs is eyes to take it in and an intellectual 
aptitude to employ it. It can be passed from generation 
to generation, perhaps most effectively in the form of 
books. Even death, it is argued, can be outfoxed. 

Yet, although Plato and his acolytes might suggest 
that talking and thinking about wine are superior to 
drinking it (Strauss 1959, 31), Machiavelli insists that 
the thought is deficient without proximate action. In 
appreciating the youth of Luigi, and the corresponding 
qualities of spirit and courage, Machiavelli, rather than 
escape inteliectualism, addresses its limitations. The 
lion feels, and a victory without the ability to be present 
to revel in it is hollow indeed (see Machiavelli 1965, 
300 [Golden Ass 8.76]). Until Machiavelli’s banish- 
ment, writing was ancillary to his political endeavors. 
Prowess cannot be experienced vicariously, and unlike 


3 “solo si oppose alli sua disegni la brevità della vita di Alessandro 
e la malattia sua.” 

% “a tutto aveva trovato remedio, eccetto che non pensò mai, in su 
la sua morte, di stare ancora [ui per morire.” 

yY “Panimo grande e la sua intenzione alta,” “tanta ferocia.” 

æ For a full treatment of Machiavelli’s various positions on Borgia, 
see Sasso 1966. Gilbert (1965, 170) also discusses the treatment of 
Borgia in The Prince compared to other of Machiavelli's works. 
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cleverness, is not cumulative. Rather, prowess is a 
perishable commodity. It is powerful only when it is 
fresh, and it is difficult, if not impossible, to simulate 
freshness. As his friends joked and worried about the 
depth of Barbera’s affections, Machiavelli himself, in 
the autobiographical character of Clizia’s Nicomaco 
and in his poem to Barbera, considers the limits of 
simulation. In this context, then, even the bawdiest of 
Machiavelli’s letters may be consulted with motives 
more respectable than salacious. 

Machiavelli (1960, 101 [Prince 25]) states that for- 
tune favors the young; he does not write “he who seems 
to be young.” The advantage of seeming over being is 
reserved for the foxy characteristics. In terms of the 
lion, being young is preferable to seeming young,” and 
the passion and vigor of youth, and youth’s followers, 
cannot be replicated in lifeless fame. Pertinax, regard- 
less of his leadership skills, drew not just hatred but 
“contempt” (disprezzo) from a constituency disen- 
chanted with his advanced years (pp. 79-80 [Prince 6]). 
In this sense, then, politics involves more than the 
Socratic rather than less. Certainly, Machiavelli gave 
up on The Republic, but as much because of deficien- 
cies of the model as because of deficiencies of the raw 
materials. For Machiavelli, mtellectual rewards are not 
especially compelling, and they certainly are insuffi- 
cient to sustain a vibrant polity or an effective politi- 
cian. 

I now may address just why Machiavelli’s analytical 
discussions (in the case of this article, the corruption of 
modern societies) must often be sought outside The 
Prince. Helpful are the works of those scholars who 
resist the mainstream notion that The Prince and the 
Discourses together form a seamless, synthetic treat- 
ment of comparative regimes (see Mansfield and Tar- 
cov 1996, xx-xxvii; Strauss 1958, 17-53). Especially 
helpful are the insights of Felix Gilbert (1953, 153-6; 
see also Dionisotti 1980, 257), who shows that The 
Prince is a timely discussion of contemporary concerns, 
whereas the Discourses is more reserved and historical 
in focus. Gilbert’s analysis, however, attempts to ex- 
plain Machiavelli’s migration to historical analysis with 
biographical details (see also Godman 1998). Machia- 
velli turns to history, we are told, because of the 
influence of his comrades in the Orti Oricellari. 

I believe Machiavelli’s approach changes not be- 
cause of the influence of his audience but because of 
his recognition that potential princes require a message 
different from that needed by the erudite but politically 
impotent aristocrats of Machiavelli’s garden conversa- 
tions. The Prince is a handbook of action addressed to 
those capable of acting; the Discourses is a work of 
reflection addressed to those whose opportunities be- 
tray their intelligence. Reflection alone can be dreary, 
counterproductive to princely inspiration. There may 
be at least some accuracy to Machiavelli’s facetious 


» Machiavelli's two plays, Mandragola and Cima, make the pomt. 
Ciiza, an adaptation of Plautus’s Casina, enhances the wit of the 
main character, Nicomaco (Stalino in Casna), and emphasizes his 
age. This leaves no doubt that the operative distnchon between the 
successful Callimaco in Mandragola and the humiliated Nicomaco of 
Chaa is the difference m their ages. See Lukes 1981. 
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claims regarding the tedium of the Discourses and that 
he was “forced” to write them. 

But when Machiavelli sees in his audience a poten- 
tial for action, he tempers the lengthy, dry, historical 
analysis appropriate for the intellectual audience of the 
Discourses with the “spirited” policy recommendations 
of The Prince.) That The Pnnce is a shorter, less 
analytical work does not mean it is less profound than 
the Discourses. Machiavelli (1960, 274 [Discourses 
2Preface]) is quite sincere when he states that even 
though older men are more prudent, their physical 
weakness clouds their judgment. Full “cognition” can 
be achieved only when affairs can be “handled” 
(maneggiare) as well as seen (p. 272 [Discourses 2.Pref- 
ace]). Politics, a balance of fox and lion, shirks analysis 
when it disrupts or deflates action. In the Discourses we 
encounter the ambiguity of two competing theories 
regarding the founding of Rome (p. 128 [Discourses 
1.1]), but princely resolve is boosted in The Prince with 
only one (p. 31 [Prince 6]). Ambivalence is intellectu- 
ally sophisticated; politically, it can be deadly. It should 
not be surprising that the scholar must leave The Prince 
to find a more analytical treatment of the corruption 
that I argue provokes Machiavelli’s migration from 
centaur to lion and fox. It is important for the budding 
prince to adopt lion behavior. Precisely why two sepa- 
tate animals are more appropriate models than one 
mythical creature is a discussion left for the other, 
more contemplative, audience of the Discourses. 

From this perspective, The Prince does not “forgive” 
Borgia his impetuosity. That is an aspect of his princely 
character, of his virility. Although Machiavelli may not 
necessarily believe Borgia when he says he could have 
conquered everything, Machiavelli does enjoy hearing 
him say it, and he is certain that Borgia’s followers and 
even Fortuna herself, who prefers the “audacious” 
(Machiavelli 1960, 101 [Prince 25]), are equally im- 
pressed with his hubris. Machiavelli admires Borgia not 
only because he is clever but also because he has the 
strength and inclination for tactile experience of the 
“hills, the valleys, the plains, the rivers, and the 
swamps” (p. 63 [Prince 14]).4* These are princely 
specifications not often mentioned by those preoccu- 
pied with Machiavelli’s focus on cleverness. When they 
are mentioned, interesting maneuvers ensue. Kahn 
(1986, 64), who notes the foxy recommendation to the 


* Gilbert (1965) argues that Machiavelli’s friend, Francesco Gurc- 
clardini, turned away from polrics and toward history m response to 
the disappointments of the optimism and humanism of the carly 
Renaissance. Gilbert argues that Machiavelli retained an mterest in 
politics because the collapse of Florence was not so imminent when 
he wrote. Yet, surely, Machiavell: experienced the ambiguities of 
Renaissance hfe as intensely as Guicclardini. A trusted advisor to the 
leaders of the Florentme Republic, Machiavelli suffered six drops of 
the strappado and banishment from Florence at the hands of the 
Tesurgent Medic. He was immersed in the cultural achievements of 
Florence, yet he lamented its embarressing vulnerability to the 
mivading French. Gilbert does not fully recognize that history = not 
the only recourse to the ambiguity of the present. As a politico, 
Machuavell strikes a delicate balance between consulting history and 
ignoring or reformulating lt for the sake of confident action. 

41 Machiavelli (1960, 14 [Prince Dedicatory]) literally imbues hrs gift 
to Lorenzo with his “spirit” (anmo). 

2 “li poggi, le valli, e piani, e’ ftum, e’ paludi che sono” 
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prince to read history, suggests that Machiavelli 
“makes the hamanist claim for textual imitation even 
more forcefully by comparing skill in government to 
skill in reading, by making the ruler’s landscape into a 
text and the text into a realm of forces. The prince is 
advised to read the terrain (imparare la natura de’ siti).” 

Indeed, the prince is to learn about nature by 
subjecting his body to nature’s inconveniences (disagi) 
(Machiavelli 1960, 63 [Prince 14]). But imparare (to 
learn), Machiavelli’s term, is rendered “to read” by 
Kahn. To conflate the reading of history and trudging 
through the bush into one type of exercise is to miss 
the counterposition of the fox and the lion. Machiavelli 
(p. 63 [Prince 14]), after all, encourages the prince to 
learn the countryside by hunting in it, a far cry from 
reading about it. For him, landscape is not a text to be 
read and manipulated to fit some princely metanarra- 
tive. Rather, the landscape provides limitations to 
cleverness and demands the less fungible traits of youth 
and vigor.* It takes more than an intellectual plan to 
bring down a wolf and maintain a domain. The prince 
is not just consummate cleverness. He is, rather, a 
difficult synthesis in which youth and vigor must par- 
ticipate. 

This perspective activates the “much debated ques- 
tion whether the Italian nationalism of the last chapter 
forms an integral part of Machiavelli’s political out- 
look” (Gilbert 1939, 483). All too often, the sincerity of 
chapter 26 is dismissed by those who cannot reconcile 
the apparent hyperbole with the intellectual aptitudes 
of Lorenzo, Machiavelli’s second choice (Borsellino 
1989, 111; Godman 1998, 300; Prezzolini 1954, 168). 
Yet, Machiavelli makes it clear that a good prince is 
not so much brilliant as opportunistic. Taking advan- 
tage of an opportunity elicits a communal “enjoyment” 
{si godono) that is capable of distracting him from 
analytical dissection (non cercano altro) (Machiavelli 
1960, 97 [Prince 24]). And to take such advantage, the 
prince needs vigor and virility, traits of the lion, as 
much as he needs cleverness. Indeed, Machiavelli (p. 
103 [Prince 26]) finds a “genius” (ingegno) in the Italian 
citizenry, and he is not exaggerating. He detects in the 
people not an intellectual brilliance but an emotional, 
communal inclination that is in suspended animation. 
To the lion falls the responsibility of awakening them 
from dormancy. 

Machiavelli (1960, 102 [Prince 26]), having intro- 
duced the healing arts through Chiron, exploits the 
surname of The Prince’s recipient with a medical 
analogy in the last chapter: “As if without life, [Italy] 
awaits that which can be cures to its injuries.” No 
doubt the physician need be clever, but the capable 
practitioner also must carry a supreme confidence to 
the task. Expertise must be balanced with “feeling,” 
with the belief that the patient is worthy of the 
treatment and the physician is supremely qualified to 





& Machiavelli (1960, 16-25 [Prince 3]) reinforces the limitations of 
cleverness yet again by encouraging the prince to go and live, 
physically, m conquered territories. 


“ “Come sanza vita, espetta qual possa esser quello che sanı le sua 
ferite.” 
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act on the patient’s behalf, not to mention the requisite 
confidence the patient must hold in the practitioner. 
These are the traits of the sophisticated lion, and if 
those traits are promulgated in the body of the text, as 
I demonstrate, then the last chapter is less anomaly and 
more parting inspiration. In a book normally inter- 
preted as an homage to cleverness, the closing refer- 
ence to love (p. 105 [Prince 26]) may seem incongruous. 
If the lessons about cleverness are balanced with the 
references to urgency, vitality, and virility, however, 
then the references to love and being loved are not 


surprising. 


Community 


Youth and vigor may be necessary components of 
princely love, but they are not sufficient. The lion’s 
virility is complemented by its comradery. The lion 
is young and strong but, unlike the fox, is a social 
animal whose power is charismatic as well as muscular. 
Achilles distinguishes himself among fellow warriors by 
sharing his emotions, even suffering, in captivating 
song (Homer 1924, 395, 397 [liad 9.186, 189]). Music is 
a shared experience; it reveals vulnerabilities that 
lesser warriors must sequester. The lion shares loyalty, 
love, and passion with its community, whereas the fox 
remains cold and lonely. Whereas the unappreciated 
fox may seek redemption in posthumous recognition, 
the lion and his community thrive on shared experi- 
ence. 

In an otherwise excellent biography of Machiavelli, 
Sebastian de Grazia (1989) musters only a weak tribute 
to his motivations. Obviously troubled by Machiavelli’s 
reputation for compromise and banality, de Grazia 
(p. 175) puts forth “the common good” as Machiavelli’s 
true and compelling passion. But having already de- 
scribed Machiavelli as the life of the party at the 
Palazzo Vecchio, whase absences are sorely missed by 
his less lively compatriots, as a master equestrian who 
could ride tirelessly for days on important diplomatic 
missions, and as an orator whose skills are not reserved 
for charming alliances of only the political variety, it 
becomes even more difficult to accept that for Machia- 
velli the point of it all is some uncertain and delayed 
. gratification of a distant and nebulous cause. Having 
preceded “The Point of It All” with chapters titled 
“Irreverent and on the Go” and “The Fool in Love,” 
we are asked not to allow “these biographical ques- 
tions...[to prevent us] from confronting the teleo- 
logical issue” (p. 158). There is apparently no place in 
the teleology of politics for “the sorrow Niccolo ex- 
pressed at being out of politics, when in it he must have 
enjoyed it” (pp. 157-8). Essentially, de Grazia joins 
the Meinecke (1965) camp. Machiavelli’s life, like his 
writings, contains bad and distracting things, and we 
are asked to suspend our knowledge of them in favor of 
a more noble concern. 

If the common good were sufficient reason to do 
politics, then Machiavelli’s hero would be Ligurio, the 
clever trickster of his play, Mandragola. After all, it was 
the successful implementation of his plan that 
prompted Sostrata to crow: “Who would not be hap- 
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py?” (Machiavelli 1965, 112 [Mandragola 5.6}).4 In- 
deed, with few exceptions,* the scholarly consensus is 
that Ligurio represents Machiavelli.“ But Ligurio is a 
sad and deficient character, described as a parasite, 
who exploits and manipulates Callimaco’s courage and 
initiative, which are qualities he lacks. In fact, Ligurio 
is a paragon of those who “proceed coldly” (fredda- 
mente procedono) (Machiavelli 1960, 101 [Prince 25]) 
and thus fail to seduce fortune. How can Ligurio be 
considered a quintessential Machiavellian when Ma- 
chiavelli speaks so vehemently of the deficiencies of 
mercenaries and cynics? 

In fact, Machiavelli’s disdain for mercenaries is 
difficult to explain in the context of a preoccupation 
with cleverness. Machiavelli makes the point in chapter 
12 that mercenaries are immune to foxy manipulation. 
Indeed, if they were susceptible, princes would be given 
strategic advice as to how to manufacture fealty. In- 
stead, the prince is counseled to avoid the employment 
of mercenaries like the plague. That the prince is to 
rely on a native militia betrays the limitations of 
cleverness. What a native army can provide, both for 
itself and for its princely leader, is a feeling of common 
purpose, which in soldiers inspires courage and in 
leaders inspires ambition and tenacity. Machiavelli 
(1960, 54 [Prince 12]) wants to maintain the passion of 
politics, despite its intermittent venality, and the em- 
ployment of mercenaries debilitates that passion: 
“They have no other love or cause on the battlefield 
than their stipend, which is not enough to make them 
want to die for you.”48 The retention of fearless troops 
is, of course, paramount to military success. But the 
feeling of the prince that his troops are prepared to die 
for him is also crucial to inspired leadership. In both 
cases, cleverness is insufficient, and the comradery and 
virility of the lion are crucial, 

De Grazia is partly right, but the common good 
ought to be accompanied by a very personal feeling one 
can derive from the common good being served. Ma- 
chiavelli’s appreciation for the republic is founded on 
its capacity to condition its participants, with laws and 
religion, to appreciate something larger than them- 


45 “Chi non sarebbe allegra?” 

46 Dionisotti (1984, 644) is not so amdous to equate Machiavelli with 
Ligurio; he notes the similar life experiences of Machiavelli and 
Callimaco, which complicates the issue. Cavallini (1973, 79) states 
that the prince in Mandregola w “nessuno in particolare.” Although 
Barber (1984) links Ligurio with Machiavelli’s association of radical 
adaptability with princely virtù, he is not prepared to equate Ligurio 
with the prince. But ambivalence is the exception. Mansfield (2000, 
27-8) offers an unusual vindication of Nicia, the cuckold. 

47 Sasso (1988, vol. 3, 118) equates Machiavelli with Ligurio, who is 
able to succeed in the “artificial” environment at what evaded 
Machisvelh m the “serious” world. Lord (1979, 817) recognizing the 
influence of Hale (1961, 186), simply states. “Ligurio is Machiavelli’s 
self-portrait.” Pitkin (1984, 30) writes: “If one were to select one 
character in this play with whom Machiavelli might best be identified, 
the choice seems clear enough It is not, despite the possible pun on 
his name, Nica, nor, as one might conventionally su the hero 
Callimaco. Instead, it uw Ligurio, the author of the plot.” See also 
Baldan (1980, 389), who argues that “Ligurio is the projection of 
Machiavelli.” 


48 “Che le non hanno altro amore né altra cagione che le tenga m 
campo, che un poco di strpendio, il quale non è suficiente a fare che 
voglino morire per te.” 
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selves. Not only is the common good served, but also 
there is induced the kind of emotion that inspires a 
citizen-soldier to risk his life. Identifying in Machiavelli 
some hidden attachment to humanist morality, and 
thus to some nebulous “comunita spirtuale” (Olschki 
1970, 187) is not enough. The common good must be 
reinforced by a sense of personal worth, contribution, 
and grandeur.4? 

An important passage in The Prince seems to con- 
tradict an interpretation of the lion as complex and 
communal: “Fortune is woman, and it is necessary, 
wanting to take her under, to beat her and knock her 
around” (Machiavelli 1960, 101 [Prince 25]).° Of 
course, our modern sensitivities draw us, mostly criti- 
cally, to the rhetoric of male domination. At first blush, 
this is quintessential machismo, and the lion is a stock 
character. Certainly, Hanna Pitkin believes it to be so. 
For her, the Renaissance is defined by a sense of 
autonomy in rebellion against the dependent mentality 
of the Middle Ages. Machiavelli’s apparent machismo, 
then, is said to represent the hubris of Renaissance 
autonomy. 

The lion prevails, however, because fortune “allows” 
it (lasciare). Autonomy is not the key to the lion’s force. 
The lion relies more on a visceral attractiveness that 
accompanies virility, impetuosity, and sensuality. For- 
tune, like the rest of the lion’s constituency, allows 
herself to be influenced. This is a much more complex 
and tenuous matter than sheer machismo or “autono- 
my,” and Machiavelli is not the Renaissance man that 
Pitkin claims. Cleverness, the fox, may produce a kind 
of autonomy, but the lion, crucial to princely success, is 
actually quite vulnerable. The lion’s audience is fickle 
and transitory, but it is willing to forgive mistakes of 
cleverness among its favorites. Ligurio, the epitome of 
freddezza, despite his superb foxiness, is never going to 
conquer fortune, not because he is not smart enough, 
but because he is not erotic enough. At least one 
perceptive scholar has recognized that the real source 
of power in Mandragola lies with Lucrezia; like fortune, 
she “allows” herself to be won over by Callimaco 
(D'Amico 1984), and his princely success depends 
upon Lucrezia’s unpredictable, perhaps impetuous, 
inclinations. He can only hope he has the sensual 
characteristics, of which youth is perhaps the most 
indispensable, to elicit her favors. 

Lions are not autonomous. They are communal 
animals that demand the respect of their enemies and 
the loyalty of their friends. Autonomy would mean that 
the lion rapes fortune in The Prince, which is Pitkin’s 
impression. But rape is enjoyable to neither assailant 
nor victim, and both fortune and the prince’s citizenry 
would soon rebel. In fact, Machiavelli frequently dis- 
cusses, figuratively and literally, the devastating reper- 


# It is Locke, not Machiavelli, who has trouble in this regard. Locke 
can only advise his self-interested soldiers that a certain death for 
deserting is more dreadful than an uncertam desth for holding firm 
m battle. See Locke 1960, 407 (Second Treatise, sec. 139). 

æ “Io mdico bene questo, che sia meglio essere impetuoso che 
Tespettvo, perché la fortuna è donna; et è necessario, volendola 
tenere sotto, batterla et urtarla. E si vede che la sı lascia piú vincere 
da questi, che da quelli che freddamente procedono.” 
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cussions of rapacious behavior (Machiavelli 1960, 46, 
75, 149 [Prince 9, 19; Discourses 1.7). 

The lion’s aversion to rape may help clarify a puz- 
zling section of The Prince, regarding the differences 
between Cesare Borgia and Agathocles the Sicilian. 
Kahn, stressing cleverness, argues that Machiavelli is 
testing his audience when he distinguishes the two 
leaders. The basis for her position is her identification 
of “striking analogies” in their respective careers 
(Kahn 1986, 73). The clever reader will see that in 
condemning Agathocles for his excessive cruelty, Ma- 
chiavelli is conveying the same message to his readers 
that Borgia sent to the people of Cesena when he killed 
his fall guy, Remirro, and left him in two pieces in the 
piazza. The lesson to the clever is that all actions are 
amenable to the mollification of cleverness. 

Kahn’s argument is more compelling than those of 
scholars who excavate the historical record for factual 
distinctions that were never mentioned by Machiavelli 
(Price 1977; Wooton 1994). In fact, it seems that 
Machiavelli intentionally correlates the deeds of the 
two men. Nevertheless, we should not overlook the 
larger discussion in which Agathocles and Borgia are 
found. There is a third individual in the comparison, 
Oliverotto da Fermo, who is advertised as a modern 
Agathocles and whose power issues not only from a 
parricide but also from a massacre that eliminates “all 
of the first men of Fermo” (Machiavelli 1960, 43 
[Prince 8]).5! Oliverotto does no less than erase the 
entire context in which his newly gained power could 
be relished and celebrated. We learn that the father he 
kills is not his natural father but a generous uncle who 
had come to love Oliverotto voluntarily. How doubly 
sweet the celebration would have been had Oliverotto 
the opportunity to return all the favors. 

Although Borgia, Agathocles, and even Oliverotto 
may have committed similar deeds, we detect distinc- 
tions in the portrayals of how they felt about those 
deeds. If the lion is important, then Machiavelli’s 
prince needs to be able to commit judicious and foxy 
cruelty (even murder) without succumbing to, or insti- 
gating, debilitating cynicism. He needs to do so quickly 
and then return to an appreciative and receptive 
audience. Borgia ends up killing Oliverotto, cruelly, but 
we are content with the deed. And our happiness 
allows Borgia to maintain an identity not fully defined 
by his intrigues. As Merleau-Ponty (1964, 213) notes, 
“pure violence can only be episodic.” 

The difficulty in distinguishing Agathocles from Bor- 
gia, then, stems directly from the difficulty in seeing 
The Prince as more than a handbook on cleverness. The 
deeds may be strategically similar, but the way in which 
the perpetrator contemplates them is not. Machiavelli 
needs the prince to believe he can commit crimes and 
still feel pride and respect. And regardless of historical 
detail, Machiavelli portrays Agathocles as a superb fox, 
who indeed owes nothing to fortune. But in his cruelty, 
like Oliverotto with whom he is linked, Agathocles 
deprives himself of the very reason to pursue politics. 
The interests and aptitudes of the lion must be ac- 
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knowledged, especially when the environment is 
fraught with foxiness. Agathocles is a rapist, so beneath 
the civility of his constituents lies at best a resentful 
resignation to the authority of a lonesome deviant. The 
lion, conversely, may be macho, but he is not isolated. 
His machismo demands of his audience a voluntary 
infatuation, and thus he is dependent on his audience 
in ways the fox is not. 


CONCLUSION 


Whereas the Discourses more thoroughly dissect the 
who, what, when, and how of politics, The Prince also 
entertains the why. Recognizing that cleverness alone 
is not enough to inspire the political enterprise, Ma- 
chiavelli introduces the lion as accompaniment to the 
more famous fox. In response to the why of politics, 
Plato’s “idea of the good” is utterly uninspiring, and 
the “common good” is insufficient. Rather, the lion 
represents a more satisfying physical component in the 
political equation. The successful prince may be tu- 
tored to avoid recklessness, but he is also young and 
impetuous, infatuated with his position and attractive 
to his constituents. He reads books, but he is also 
restless, traveling and hunting throughout his domain. 
He studies warfare, but he also relishes joining battle 
amid an intimate fraternity of compatriots. He seeks 
fame and glory, but he is also happy in doing so, not 
driven. 

These considerations modify standard interpreta- 
tions of Machiavellian virtù, at least that which applies 
to princes. Clearly, fortune can outwit the earnest, and 
the prince needs to cultivate an aloof mutability. This 
advice is given soberly and systematically. But fortune 
is also contemptuous of the lonely and infirm, prefer- 
ring the company of the alpha. The leonine interests 
are undertaken within a less programmatic but more 
pervasive aura of gratification. The depiction of the 
Italian populace in the last chapter of The Prince is that 
of an unrequited courtesan, aching for a connection 
with a young and compelling partner. Unlike the 
precision of his advice regarding the establishment of 
fortresses or the selection of advisers, Machiavelli 
(1960, 105 [Prince 26]) “cannot express” (Né posso 
esprimere) the emotion awaiting release in this compel- 
ling combination. And this crescendo is preceded by a 
promise in chapter 6 that capable founders, pursuant 
tactical success, can look forward to honor, happiness, 
and venerazione (p. 32 [Prince 6]). 

Princely virtue is a balance, then, of the clever and 
the charismatic. Given the sophistication and cynicism 
of Machiavelli’s contemporaries, the balance is difficult 
to achieve. Cleverness must be superbly camouflaged 
and judiciously exploited, lest it dampen the intoxica- 
tion attendant on charisma. This is the complex mes- 
sage of Machiavelli’s quintessential political work. To 
impart the message he chooses his metaphors carefully. 
For the Italian prince the targeted images of fox and 
lion are superior to the more blurry image of the 
centaur. With this enhanced precision, Machiavelli 
recognizes and embraces the prospect of shaping mar- 
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ble that has already been poorly begun by others 
(Machiavelli 1960, 162 [Discourses 1.11]). 

In The Prince, Machiavelli often employs the infor- 
mal pronoun to address his budding prince. Puzzled 
scholars, seduced by the cleverness motif, invariably 
conclude that this is no more than another “forma di 
manipolazione” (Di Maria 1984, 78; see also Chiappelli 
1969, 80; Raimondi 1972, 266). Yet, tu is employed in 
Italian when speaking to the young as well as to 
intimates. I think Machiavelli has in mind a young and 
proximate individual, drawn to the prospect of the 
imminent adulation of intimates. By addressing his 
young, aristocratic audience in the familiar, Machia- 
velli is not employing just another clever trick. Rather, 
he is displaying an honesty whose breadth is not always 
found in his scholarly tracts. Politics, for Machiavelli, is 
a delicate balance of acumen and boldness, of knowing 
and feeling. So it is hardly a trick when he moderates 
his scholarly inclinations for inspirational reasons. In so 
doing, he does not simplify or censor what he knows. It 
is neither ironic nor paradoxical that Machiavelli can 
claim to have put “all I know” about politics into such 
a short work.5? The princely ingredients derived from 
the lion may necessitate a shortening of his message, 
but that hardly prevents the complications often over- 
looked by his interpreters. There is more to Machia- 
velli than a debate about just how much he betrays 
Plato. 


22 Machiavelli has puzzied scholars by introducing both The Prince 
and the Discourses with some sort of statement that what follows 18 
“all I know” about politics (Machiaveli 1960, 13, 121 [Prince 
Dedicatory, Discourses Dedicatory]). In The Prince the verb em- 
ployed is conoscere, rather than the sapere of the Duscourses. Because 
The Prince contains the enhancements of the lion perspective, it 18 
not surprising that Machiavelli employs the verb that encompasses 
sensual and personal perceptions as well as intellectual knowledge. 
He wrote to his friend Francesco Vetton, in his most famous letter, 
that in The Prince he was “as profound as I can be” (mi profondo 
quanto io posto) (Machiavel: 1961, 304 [Letters 140, 10 December 
1513]) 
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Justice and care theories but little to the ideas of early feminist authors. I bring the political philosophy 
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of the Mary Wollstonecraft to bear on contemporary justice/care debates in order to highlight her 
unique contribution. Although usually interpreted as a classical liberal or republican thinker, Wollstonecraft 
is better understood as a feminist care theorist. She aimed at a revolutionary transformation of liberal society 
by emphasizing the importance of care-giving duties. Unlike some recent feminist scholars, however, she still 
an important role for justice. She argued that before personal care-gwing activities could 

transform the political, political justice had first to be extended to personal caring relationships. 
Wollstonecraft’s political philosophy thus provides a feminist model for synthesizing justice and care theories 
and represents an innovative reformulation of classical liberal and republican ideas that incorporates the 


care perspective. 


n recent years, feminist scholars have proposed 

political theories based upon an ethic of care as an 

alternative to liberal theories of justice.1 Although 
these scholars have paid little attention to the ideas of 
historical feminist authors, Mary Wollstonecraft 
(1759-97) provides important insights into the rela- 
tionship between liberal justice and feminist care. In 
this article, I explore Wolistonecraft’s views on the 
relationship between justice and care and outline her 
proposal for creating a nurturing form of liberalism 
based upon a synthesis of these concepts. 

While contemporary scholars continue to debate the 
precise nature of justice and care theories, and some 
even question whether the two concepts are analyti- 
cally distinct, most recognize certain 
features between them (Gatens 1998; Held 1995a).? 
Justice theories are organized around formal and ab- 
stract rights and rules, whereas care theories empha- 
size the importance of nurturing activities, personal 
attentiveness, and the maintenance of human relation- 
ships. An important difference between the two is the 
high moral value that care theories place upon nurtur- 
ing activities and the realms in which they traditionally 
occur, especially family life, friendships, and sexual and 
other close personal relationships (Friedman 1995, 
147-8). Concomitant to this view is the belief that 
private and public spheres cannot be separated: The 
care provided in personal relationships and family life 
is essential for public activity and deserves public 
recognition and support (Katzenstein and Laitin 1987, 
262-3). 

Even if Okin (1989a, 1989b) is correct in arguing that 
justice theories can accommodate certain concerns of 
care theories, the latter may at least be distinguished 
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from the former by their emphasis on care-giving 
activities.3 More distinctive is their commitment to 
taking others’ needs as the starting point for normative 
action (Ruddick 1990, 237; Tronto 1993, 105). In this 
respect, care theories are more flexible and contextual 
than theories of justice. Whereas justice theories me- 
diate human relationships by applying abstract moral 
principles to particular cases, care theories start from 
the particular needs of individuals and attempt to 
address these needs in context (Friedman 1993, 70-1). 
They also tend to focus on the inherent and often 
unchosen responsibilities and duties that stem from 
human interdependence and relationships, while jus- 
tice theories tend to stress individual autonomy and 
voluntary contractual obligations (Gatens 1998, xiv- 
xv). 

The first generation of care theorists were primarily 
interested in differentiating the concepts of justice and 
care, but many recent discussions center around “how 
justice and care can appropriately be combined from a 
feminist point of view” (Held 1995a, 2; see also Card 
1995; Clement 1996; Dillon 1992; Friedman 1993, 
1995; Held 1995b; Hirschmann and Di Stefano 1996; 
Jaggar 1995; James 1992; Koehn 1998; Okin 1989b; 
Robinson 1999; Tronto 1993). Interest in this issue has 
been piqued by awareness of potential shortcomings in 
the original care ideal (Jaggar 1995). The focus of the 
care ethic on the particular raises questions about its 
applicability to public affairs and large social structures 
(Mendus 1993). At the same time, care ethics has been 
criticized for ignoring the ways in which institutionally 
structured power inequities may compromise caring 
relationships (Friedman 1993). A number of theorists 
also point out that caring relationships can slide into 
paternalism or parochialism if not governed by some 
objective criterion of genuine care (Barry 1995, 252-6; 
Jaggar 1995; Koehn 1998; Narayan 1995; Sevenhuijsen 
1998; Tronto 1993). 

Wollstonecraft affords important insights into these 


3 For a very different view on the ability of justice theones to 
accommodate the concerns of care theorists, see Kittay 1997 and 
Sevenhuijsen 1998, 72-9. 
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debates, yet her writings have been ignored by contem- 
porary care theorists. The neglect of Wollstonecraft’s 
writings is perhaps understandable given her reputa- 
tion as a conventional liberal feminist—one who sim- 
ply added women to the classical liberal tradition and 
stirred. Indeed, several scholars suggest that Wollsto- 
necraft’s philosophy embodies all the inadequacies of 
liberal theories of justice when applied to women 
(Eisenstein 1981, 90; Gatens 1991; Jacobus 1979, 10; 
Pateman, 1989; Poovey 1984). The only thing that 
Wollstonecraft is said to have to offer “on the question 
of sexual equality is that women are entitled to be 
treated ‘like men’ or ‘as if they were men’” (Gatens 
1991, 126-7). It is held that she incorporated women 
into the liberal public sphere without taking account of 
their concerns and perspective. 

The problem with the liberal interpretation of Woll- 
stonecraft’s thought is that it disregards the important 
role she accords to caring relationships. Although she 
draws heavily upon the classical liberal tradition, she 
nonetheless identifies caring relationships as the foun- 
dation and end of society. She argues that the devel- 
opment of autonomous individuals begins in the family, 
with the particular attention of mothers and fathers to 
the needs of their children. In fact, family duties are 
central to her political philosophy. In her vision of 
society, the highest moral importance is given to the 
particular caring-giving duties of husbands, wives, fa- 
thers, mothers, sons, daughters, and citizens. 

Within this framework of care Wollstonecraft does 
recognize an important role for justice. She asserts that 
equality and rights must be extended to women in 
political, social, and family affairs in order to foster the 
development of healthy caring relationships within the 
family and society at large. Her critique of society is 
intended to demonstrate that unjust family and social 
relations not only impede the development of care but 
actually promote uncaring and pathological relation- 
ships. Wollstonecraft is thus rightfully known for her 
strong advocacy of equality and rights, but it has been 
overlooked that she demanded equality and rights for 
women primarily to promote a more caring and dutiful 
citizen body. Her political philosophy is perhaps best 
described as a nurturing liberalism in which liberal 
justice provides the necessary backdrop for the devel- 
opment of virtuous care. 

Wollstonecraft offers important insights into con- 
temporary care/justice debates. Recent proposals for 
combining care and justice often minimize the moral 
and theoretical significance of care or limit its applica- 
bility to personal and private affairs (Held 1995b; 
Jaggar 1995). Wollstonecraft synthesizes justice and 
care without slighting either value. She embeds justice 
within the larger framework of care but demonstrates 
its importance for caring relationships. By her account 
justice is instrumental to care—an essential feature of 
caring relationships. She identifies caring duties, in 
turn, as the crux of a virtuous social order and the key 
to addressing many social problems. Wollstonecraft 
argues that many social problems are the result of 
unhealthy caring relationships that can only be cor- 
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rected by extending justice to personal relationships 
and elevating the status of care-giving duties. 

My main purpose is to explore the neglected element 
of care within Wollstonecrafts moral and political 
thought and to bring her ideas to bear on contempo- 
tary debates about care and justice. It may seem 
anachronistic to apply contemporary concepts to the 
ideas of an eighteenth-century thinker, but two points 
justify my interpretation. First, even if the concept of 
care is a recent discursive invention, the elements that 
define it (including a concern for nurturing relations, 
an emphasis on the particular needs of others, a desire 
to promote attentiveness, and responsibility among 
individuals) have long existed. Because Wollstonecraft 
was concerned with those elements, it is plausible to 
consider her an early care theorist even though she 
expressed her concerns in a different social and histor- 
ical context.4 

Second, and more important, Wollstonecraft stands 
at a critical historical juncture in the formation of the 
modern care ethic. In the late eighteenth century, 
philosophers began to shift notions of moral sentimen- 
tality and affection, which are quite similar to elements 
of the modern care ethic, from the public realm and 
humanity to the domestic sphere and women (Okin 
1981; Tronto 1993). According to Tronto (pp. 25-59), 
this shift was driven primarily by large-scale economic 
and social transformations that made political theories 
based upon contextual morality and affective personal 
relations seem less relevant for public affairs. Wollsto- 
necraft’s writings may be read as an early protest 
against the domestication and segregation of caring 
activities as well as a warning about the pathologies of 
care that would result from it. 

A secondary (albeit largely implicit) purpose of this 
article is to contribute to recent efforts to increase 
appreciation for Wollstonecraft’s works in the canon of 
modem political philosophy (Falco 1996; Gunther- 
Canada 1997; Sapiro 1992, 280-300; Weiss 1996). 
Wollstonecraft remains a peripheral figure among po- 
litical scientists. Too often her writings are assumed to 
advocate nothing more than the extension of liberal 
rights to women and are considered of historical but 
not theoretical interest. I suggest that Wollstonecraft 
articulated an innovative liberal theory that places 
nurturing and caring activities at the center of public 
affairs. She viewed liberal rights and freedoms as a 
means to promote the virtuous fulfillment of nurturing 
duties. That fulfillment, in turn, could protect against 
the selfishness and irresponsibility often associated 
with liberal society. Wollstonecraft aimed at the revo- 
lutionary transformation of society, not so much 
through restructuring macroeconomic and political in- 
stitutions as through improving the quality of personal 
and domestic care. But before personal care can trans- 
form the political, she argues, political justice first must 
be extended to personal caring relations. 





4 Sapiro (1992, 258-9) dubs Wollstonecraft a feminist on the 
grounds that she shared with nineteenth- and twentieth-century 
feminist theorists “a minimal and flemble set” of common ideas and 
concerns. See also Kelly 1992, 1-2 
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BURKEAN CARE AND A VINDICATION OF 
THE RIGHTS OF MEN 


It is useful to begin the discussion of Wollstonecraft’s 
philosophy with a brief discussion of Edmund Burke’s 
Reflections on the Revolution in France, Wollstonecraft 
dedicated her first major political work, A Vindication 
of the Rights of Men (1790), to the task of rebutting it. 
Scholars have proposed a variety of hypotheses to 
explain the vehemence of her response to Burke, but 
none has suggested that her reaction may have been 
provoked in part by the similarity of their ideas. Both 
wrote during a period when the contextual and affec- 
tive moral and political philosophies of the Scottish 
Enlightenment thinkers, among others, were being 
replaced by universalistic and abstract moral and po- 
litical philosophies, or at least were being pushed into 
the domestic sphere (Tronto 1993, 25-59). Both Burke 
and Wollstonecraft viewed this development with con- 
cern but addressed it in very different ways. 

In Reflections, Burke criticizes the uncaring nature of 
universalistic philosophies and defends the traditional 
and patriarchal social structure as necessary for the 
preservation of caring relationships and the fulfillment 
of duties. Wollstonecraft apparently read Reflections as 
a twisted or pathological description of her own con- 
cern with nurturance and care-giving duties. She 
counters Burke’s argument by claiming that universal 
rights are a necessary prerequisite for healthy caring 
relationships, and she dedicates much of her Rights of 
Men to highlighting the shortcomings of care ethics 
when treated in isolation from equality and rights. 

Burke formulates some of his central criticisms of 
the French Revolution in terms reminiscent of care 
ethics. One clear example is his criticism of Enlighten- 
ment liberalism’s commitment to abstract rights. The 
true rights of individuals, he declares, depend upon 
their particular wants and needs (Burke [1790] 1987, 
51-3). In professing his love for liberty, for example, 
Burke notes that he would not therefore congratulate a 
mentally ill individual for escaping (or being cast out) 
from a psychiatric hospital (p. 7). The abstract perfec- 
tion of natural rights is their practical defect. They 
ignore the special needs of different individuals. Burke 
asserts that the devotion to abstract rights can devolve 
into the most callous and uncaring sorts of policies and 
actions if not adapted to the particular and contextual 
needs of human beings: 


Though a pleasant writer said, liceat perire poetis [Let the 
poets perish], when one of them, in cold blood, is said to 
have leaped into the flames of a volcanic revolution, 
ardentem frigidus Aetnam msiluit [Coldly he jumped into 
burning Aetna], I consider such a frolic rather as an 
unjustifiable poetic license than as one of the franchises of 
Parnassus; and whether be was a poet, or divine, or 
politician that chose to exercise this kind of right, I think 
that more wise, because more charitable, thoughts would 
urge me rather to save the man than to preserve his brazen 
slippers as the monuments of his folly (p. 55). 


Burke claims it is the responsibility of each of us—and 
of governments more generally—to attend to the real 
and variable needs of individuals. The commitment to 


abstract rights is too simplistic and impersonal to treat 
all human beings in a truly humane manner. 

Burke’s concern with caring relationships is also 
evident in his criticism of the selfish individualism of 
Enlightenment society. Burke worried that liberal so- 
cieties would strip away “all the pleasing illusions which 
made power gentle and obedience liberal” and reduce 
all human relations to matters of power and self- 
interest (p. 67). “On the scheme of this barbarous 
philosophy, which is the offspring of cold hearts and 
muddy understandings ...laws are to be supported 
only by their own terrors and by the concern which 
each individual may find in them from his own private 
speculations or can spare to them from his own private 
interests” (p. 68). For Burke, the only way to preserve 
affectionate and attentive personal relationships is to 
maintain traditional feudal arrangements in which 
power differentials are mitigated by manners and cus- 
toms and each individual occupies a status and role 
that defines his or her duties toward others. Kings 
behave toward subjects as loving fathers; the nobility 
feels obliged to care for the lower classes; men show 
special concern for women. The whole of society is 
arranged “so as to create in us love, veneration, 
admiration, or attachment” (p. 68). 

Liberal theories of justice replace these affections 
with a base commitment to equality and freedom. They 
tear away all traditional and communal norms in favor 
of a calculating and isolated individualism. Burke 
foretells the coming of a society driven by “present 
convenience” and the “tust of selfish will” (pp. 77, 83). 
The French revolutionaries’ rude treatment of King 
Louis XVI and Marie Antoinette is emblematic of the 
new social ethos devoid of humaneness (pp. 62-5). 
Burke’s concern about the increasing equality and 
assertiveness of women, too, reflects his belief that 
universal morality will eventually degrade all relation- 
ships into mere contractual agreements (Gunther- 
Canada 1996, 65—6). “But the age of chivalry is dead,” 
Burke decries. “That of sophisters, economists and 
calculators has succeeded; and the glory of Europe is 
extinguished forever. Never, never more shall we be- 
hold that generous loyalty to rank and sex, that proud 
submission, that dignified obedience, that subordina- 
tion of the heart which kept alive, even in servitude 
itself, the spirit of an exalted freedom” (p. 66). 

Wollstonecraft devoted A Vindication of the Rights of 
Men to a critique of Burke’s Reflections. Probably more 
than any other work, it established her reputation as a 
traditional liberal theorist. She vehemently defends the 
Enlightenment ideals of freedom, equality, rights, and 
rationality against Burke’s conservative defense of per- 
sonal affection and moral sentimentality. Yet, it is 
misleading to characterize this work in terms of the 
simple contrast between affection and rationality, care 
and justice. Wollstonecraft expresses sympathy for 
many of Burke’s ideas and even observes that in 
different circumstances Burke might have been a “rev- 
olutionary” like herself (Wollstonecraft [1790] 1995, 5, 
45-6). Above all, she expresses sympathy with his 
concern for affective and caring relationships, but she 
finds his account of these virtues to be distorted by his 
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attachment to unequal social relations..A Vindication of 
the Rights of Men is not only a vindication of justice and 
equality but also, more accurately, a first vindication of 
her own vision of a just and caring society. She wants to 
demonstrate that the French revolutionaries’ commit- 
ment to equality, rationality, and rights is not necessar- 
ily antithetical to the development of a caring and 
dutiful society but, indeed, the foundation of it. 

Wollstonecraft ({1790] 1995) begins by ridiculing 
Burke’s “pampered sensibility.” His emotions and pas- 
sions overwhelm his reason and subject him to pretty 
flights of imagination (pp. 6-7). Wollstonecraft por- 
trays her own argument as an attempt to reinsert 
reason into the discussion about the French Revolu- 
tion. “Quitting now the flowers of rhetoric, let us, Sir, 
reason together” (p. 7). Several scholars have com- 
mented upon the gender reversal implicit in Wollsto- 
necraft’s rhetoric (Gunther-Canada 1996; Johnson 
1995, 23-46; Kelly 1992, 84-106). Wollstonecraft 
adopts the masculine language of reason to rebuke 
Burke’s effeminate sentimentality and emotionalism.* 
This rhetorical strategy might appear antifeminine 
(indeed, Wollstonecraft has few good things to say 
about most of her female contemporaries), but her 
goal is to lead both women and men away from an 
artificial and conventional definition of the moral 
sentiments toward true human affection. 

According to Wollstonecraft, feelings that are “os- 
tentatiously displayed are often the cold declamation 
of the head, and not the effusions of the heart” (Rights 
of Men, p. 6). “When the heart speaks, we are seldom 
shocked by hyperbole, or dry raptures. I speak in this 
decided tone” (p. 29). She insists that her straightfor- 
ward, unemotional prose indicates her true feelings; in 
contrast, Burke’s fanciful and romantic prose is a sign 
of “false, or rather artificial, feelings” (p. 29). She even 
accuses Burke of “hard-hearted sophistry” (p. 59). He 
tries to manipulate his readers through fanciful images 
and appeals to the emotions. He sacrifices all true 
affection to stylized affectation. 

Wollstonecraft’s clearest statement on style appears 
in the opening pages of her Vindication of the Rights of 
Woman (1792), although a similar statement is found 
at the very beginning of Rights of Men: “I aim at being 
useful, and sincerity will render me unaffected; for, 
wishing rather to persuade by the force of my argu- 
ments, than dazzle by the elegance of my language, I 
shall not waste my time in rounding periods, or in 
fabricating the turgid bombast of artificial feelings, 
which, coming from the head, never reach the heart” 
(Wollstonecraft [1792] 1995, 77; VM, 5). We should 
not, of course, accept at face value Wollstonecrafts 
claim to speak without affectation. Her rhetoric is 
based upon the Rousseavean conceit that there is a 
true and natural language of the heart beneath the 
distorting layers of convention. The important point is 
that her rhetoric is not based, as sometimes supposed, 


5 Because the first edition of A Vindication of the Rights of Men was 
published anonymously, Wollstonecraft probably did not mitially 
conceive of this gender reversal as part of her public rhetoncal 
stratcgy. 


580 


September 2001 


upon the sharp distinction between reason and affec- 
tion. She regards rational reflection as the truest road 
to uncovering the genuine feelings of the human heart. 

Behind the poetic images and flowery rhetoric of 
Burke’s Reflections, Wollstonecraft argues, there is a 
cold and uncaring philosophy. Burke exalts a constitu- 
tion “settled in the dark days of ignorance” (Rights of 
Men, p. 11), when members of the peasantry were put 
to death for hunting on the nobility’s lands or for 
defending their crops from the nobility’s game. “How 
many families,” she asks, “have been plunged, in the 
sporting countries, into misery and vice for some paltry 
transgression of these coercive laws, by the natural 
consequence of that anger which a man feels when he 
sees the reward of his industry laid waste by unfeeling 
luxury?—when his children’s bread is given to dogs” (p. 
16)! She notes that Burke’s empathy seems to extend 
only to the rich and wellborn. His lack of concern for 
the needs of the poor exposes the indifference at the 
core of his philosophy: 


Misery, to reach your heart, I perceive, must have its cap 
and bells; your tears are reserved, very naturally consider- 
ing your character, for the declamation of the theatre, or 
for the downfall of queens, whose rank alters the nature of 
folly, and throws a graceful veil over vices that degrade 
humanity; whilst the distress of many industrious mothers, 
whose helpmates have been torn from them, and the 
hungry cry of helpless babes, were vulgar sorrows that 
could not move your commiseration, though they might 
extort an alms. “The tears that are shed for fictitious 
sorrows are admurably adapted,” says Rousseau, “to make 
us proud of all the virtues which we do not possess” (p. 14). 


Wollstonecraft adds that not only the poor suffer 
under Burke’s system. Feudalism stymies the develop- 
ment of healthy caring relationships. Burke assumes 
that “respect chills love” (Rights of Men, p. 6). Woll- 
stonecraft counters that “affection in the marriage state 
can only be founded on respect” (p. 22). She explains 
that equal respect is necessary to appreciate the par- 
ticular and unique characteristics of others and to love 
them as real individuals. Traditional social arrange- 
ments thwart genuine interpersonal relationships by 
encouraging people to take on false airs, to view others 
through artificial romantic ideals, and to marry accord- 
ing to status and wealth. “The respect paid to rank and 
fortune damps every generous purpose of the soul, and 
stifles the natural affections on which human content- 
ment ought to be built” (p. 24). 

In their efforts to emulate the manners of the 
nobility, the middle class similarly sacrifices personal 
intimacy to wealth and affectation. “The grand concern 
of three parts out of four is to contrive to live above 
their equals, and to appear to be richer than they are. 
How much domestic comfort and private satisfaction is 
sacrificed to this irrational ambition! It is a destructive 
mildew that blights the fairest virtues; benevolence, 
friendship, generosity, and all those endearing charities 
which bind human hearts together” (Rights of Men, p. 
23). Feudal arrangements further undermine nurturing 
family relations by inclining parents to care more for 
the perpetuation of family name and estates than the 
well-being of their children. “The younger children 
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have been sacrificed to the eldest son; sent into exile, or 
confined in convents, that they might not encroach on 
what was called, with shameful falsehood, the family 
estate. Will Mr. Burke call this parental affection 
reasonable and virtuous” (pp. 21-2)? In short, tradi- 
tional social arrangements distort close personal rela- 
tionships by subordinating them to romance, rank, and 
wealth. 

Wollstonecraft defends the rights of men as the 
solution to this problem. Like Burke, she wants to 
avoid the development of a cold and calculating soci- 
ety. But in contrast to Burke, she claims the extension 
of rights and equality is necessary to this end. She 
suggests that a liberal political order based upon 
abstract and universal rights does not necessarily entail 
the demise of personal affection; on the contrary, it 
provides the basis for the development of more caring 
and dutiful relationships. 


The civilization which has taken place in Europe has been 
very partial, and, like every custom that an arbitrary point 
of honour has established, refines the manners at the 
expense of morals, by making sentiments and opinions 
current in conversation that have no root in the heart, or 
weight in the cooler resolves of the mind.—And what has 
stopped its progress?—hereditary property—hereditary 
honours. The man has changed into an artificial monster 
by the station in which he was bom, and the consequent 
homage that benumbed his faculties like the torpedo’s 
touch;—or a being, with a capacity of reasoning, would not 
have failed to discover, as his faculties unfolded, that true 
happiness arose from the friendship and intimacy which 
can only be enjoyed by equals; and that charity is not a 
condescending distribution of alms, but an intercourse of 
good offices and mutual benefits, founded on respect for 
justice and humanity (Rights of Men, pp. 8-9). 


In this passage, Wollstonecraft touches upon a num- 
ber of themes that are central to both her Vindicatons. 
Following Rousseau, she claims that European civili- 
zation is corrupt because it is based upon unequal 
social relationships. This inequality has created a civi- 
lization of monstrous and unfeeling characters. The 
only way to develop true friendship and mutual care is 
to extend equality and justice to all. While she com- 
mends Burke’s concern for relationships and care she 
suggests that he has not explored deeply enough what 
is necessary to promote these ends. He falls back upon 
the feudal hierarchy as the foundation of caring rela- 
tionships and thus helps perpetuate the very thing he 
claims to detest: an impersonal and uncaring society. 

Burke and Wollstonecraft approach the problem of 
promoting caring relationships in quite different ways. 
Burke looks back wistfully to the feudal social order in 
which caring duties are prescribed by the particular 
roles and status of individuals. Wollstonecraft looks 
ahead to the development of a more caring society 
based upon egalitarian social arrangements. Although 
Burke’s concern for affection and care is more similar 
to the ideas of contemporary conservative theorists 
than to those of contemporary feminists, Wollstone- 
craft’s critique nonetheless yields valuable lessons for 
contemporary feminist discussions about justice and 
care. Rather than jettison justice for care or draw a 
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sharp distinction between them, she argues that the 
two concepts are integrally related. Without justice, the 
concern for care can justify paternalistic and oppressive 
social relationships, especially for women, children, 
and the poor. Indeed, she claims it is difficult if not 
impossible to actualize healthy caring relationships 
within an unjust social structure, since social inequality 
creates pathologies of care in the form of misplaced 
and artificial affections. It is this last point that Woll- 
stonecraft elaborates at length in A Vindication of the 


Rights of Woman. 


A VINDICATION OF THE RIGHTS OF 
WOMAN 


After defending the principles of the French Revolu- 
tion against Burke’s conservative criticisms in her first 
Vindication, Wollstonecraft turns in A Vindication of 
the Rights of Woman to a criticism of the French 
revolutionaries for being too conservative. She charges 
them with betraying their ideals by reproducing ele- 
ments of the feudal social order within their new 
constitution. Specifically, they exclude women from 
political rights and thus maintain the old patriarchal 
social system. “Who made man the exclusive judge, if 
woman partake with him the gift of reason” (Wollsto- 
necraft [1792] 1995, 69)? Before the French Revolu- 
tion, kings and nobles justified their rule by arguing 
that the people lacked the reason necessary to govern 
themselves. “Do you not act a similar part, when you 
force all women, by denying them civil and political 
rights, to remain immured in their families groping in 
the dark” (p. 69)? Although the revolutionaries were 
free from belief in the divine right of kings, they held 
fast to the “divine right of husbands” (p. 112). De- 
nouncing both divine rights, Wollstonecraft demands 
the extension of civil and political rights to women. 

This demand for women’s rights solidified Wollsto- 
necraft’s reputation as a liberal feminist, but the liberal 
interpretation of her thought fails to capture the full 
complexity of her ideas. As noted above, in the Rights 
of Men Wollstonecraft criticizes not affection per se, 
but the heartless and unnatural affection that develops 
from unequal social relationships. In the Rights of 
Woman, she likewise defends the extension of rights to 
women primarily on the grounds that it will facilitate 
the development of more nurturing and caring rela- 
tionships. In fact, she approaches the whole issue of 
governance from a care perspective, reformulating 
classical liberal theory to emphasize the importance of 
the family and nurturing relationships for the well- 
being of society at large. 

Wollstonecraft’s concern with care distinguishes her 
thought from classical liberal theory in several ways. 
Rather than simply appeal to the abstract dignity of all 
individuals, Wollstonecraft supports women’s equality 
largely by emphasizing the ways in which it will facili- 
tate the delivery of care. She also focuses on the 
importance of healthy family relationships for the 
public good and rejects the classical liberal distinction 
between public and private spheres. She claims justice 
must be brought to bear on the private sphere of family 
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and social life in order to foster the development of 
care, and government must accord greater resources 
and respect to traditional nurturing activities, such as 
parenting and education. As Sapiro (1992, 183; 1996, 
37) has written, Wollstonecraft “saw no clear distinc- 
tion between public and private” but instead “saw 
‘public’ and ‘private’ as. integral parts of the same 
wholes, not just as different social spaces but, in some 
ways, as OCCU the same space.” Also distinct from 
classical liberal thought is Wollstonecrafts suggestion 
that the proper goal of government is the creation of a 
more virtuous citizen body devoted to fulfilling partic- 
ular duties to one another and humanity at large. 

Wollstonecraft is most often interpreted as a liberal 
theorist, but some scholars place her thought within the 
classical republican tradition and interpret her as a 
virtue theorist (Baker-Benfield 1989; Johnson 1995; 
Landes 1988; Sapiro 1992). There can be no doubt 
about the republican influences on her thinking. She 
argues that hierarchy and luxury invariably corrupt the 
morality of people, and she advocates an activist or 
participatory ideal of citizenship (Sapiro 1992, xix—xx, 
T7-116, 232-7). But her republicanism also is strongly 
influenced by her concern for caring relationships. She 
argues that public virtue depends upon particularized 
care relationships, and these depend upon social and 
political equality. She thus extends classical republican 
arguments about the corrupting effects of hierarchy 
and inequality into the private sphere and makes 
egalitarian family relations a central focus of her theory 
of a virtuous republic. At the same time, she associates 
virtue for both men and women not with military or 
political glory but with the fulfillment of family and 
social nurturing duties. In fact, she positively reviles the 
traditional republican valorization of the military and 
war (Rights of Woman, pp. 92-3, 235-6; see Sapiro 
1992, 102-3). Hers is a republic of care in which duty to 
children, spouses, friends, citizens, and humanity re- 
places the traditional republican ideal of the swagger- 
ing vir virtutis. 

Wollstonecraft lays out the intellectual framework of 
her argument in the opening chapters of Rights of 
Woman. Criticizing Rousseau’s account of the original 
isolation of human beings, she claims that God created 
human beings to progress toward moral perfection 
within and through society (pp. 79-82). She defines 
moral perfection as the development of reason, knowl- 
edge, and virtue (p. 79). Reason and knowledge pro- 
vide human beings with the capacity to understand and 
control their social world and, more specifically, to 
comprehend their duties toward others. Virtue consists 
of the self-conscious fulfillment of these duties (Sapiro 
1992, 74-5). The primary duties of women are their 
care-giving responsibilities as mothers, daughters, sis- 
ters, and wives. The primary duties of men are their 
care-giving responsibilities as fathers, sons, brothers 
and husbands. Beyond these primary duties exist 
broader social responsibilities to neighbors, other citi- 
zens, and all human kind. 

For Wollstonecraft, the moral progress of society, 
which forms the entire backdrop of the argument in 
Rights of Woman, consists of progress toward a world in 
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which men and women recognize the supreme impor- 
tance of their care-giving duties within the family and 
society at large. Yet, she claims, everything depends 
upon women’s equality. Without that, neither gender 
can satisfactorily fulfill its most basic care-giving duties, 
and society will remain stalled in a morally underde- 
veloped state. The following important passage from 
the prefatory letter to Rights of Woman aptly summa- 
rizes the underlying premises of Wollstonecraft’s argu- 
ment: 


Contending for the rights of woman, my main argument is 
built on this simple principle, that if she be not prepared by 
education to become the companion of man, she will stop 
the progress of knowledge and virtue; for truth must be 
common to all, or it will be inefficacious with respect to its 
influence on general practice.... If children are to be 
educated to understand the true principle of patriotism, 
their mother must be a patriot; and the love of mankind, 
from which an orderly train of virtues spring, can only be 
produced by considering the moral and civil interest of 
mankind; but the education and situation of woman, at 
present, shuts her out from such investigations (p. 68). 


Wollstonecraft makes clear from the outset of Rights of 
Woman that her central concern is the development of 
a more caring and nurturing society. She defends 
women’s equality primarily on the grounds that it is 
prerequisite for the moral progress of humanity toward 
a more virtuous and dutiful state. 


INEQUALITY AND THE PATHOLOGIES OF 
CARE 


Wollstonecraft dedicates much of the argument in 
Rights of Woman to demonstrating how social and 
political inequality distorts women’s natural affections 
and prevents them from effectively fulfilling their care- 
giving duties. On the most basic level, the unequal 
education given to most women leaves their minds 
suspended in a state of perpetual childhood. As a 


result, they are totally unfit for mature companionship 
and motherhood. 


The education of women has, of late, been more attended 
to than formerly, yet they are still reckoned a frivolous sex, 
and ridiculed or pitied by the writers who endeavour by 
satire or instruction to improve them. It is acknowledged 
that they spend many of the first years of their lives by 
acquiring a smattering of accomplishments, meanwhile 
strength of body and mud are sacrificed to libertine 
notions of beauty, to the desire of establishing them- 
selves,—the only way women can rise in the world—by 
marriage. And this desire making mere animals of them, 
when they marry they act as such children may be expected 
to act:—they dress; they paint, and nickname God's crea- 
tures.—Surely these weak beings are only fit for a sera- 
gliol—Can they be to govern a family with 
judgement, or take care of the poor babes whom they bring 
into the world (p. 77)? 


Wollstonecraft returns to this theme throughout 
Rights of Woman. Until women comprehend the im- 
portance of their care-giving activities for society and 
receive public recognition for them, they will not be 
fully attached to them or discharge them virtuously (p. 
69). Trained to think primarily about their own beauty 
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and marriage, many girls grow up to be “cold-hearted, 
narrow-minded,” and self-centered women who ne- 
glect their duties (p. 142). Alternatively, those who 
remain attached to their duties tend to fulfill them in 
overly particular and arbitrary ways. “Mankind seem to 
agree that children should be left under the manage- 
ment of women during their childhood. Now, from all 
the observation that I have been able to make, women 
of sensibility are the most unfit for this task, because 
they will infallibly, carried away by their feelings, spoil 
a child’s temper” (p. 145). Wollstonecraft is careful to 
note that her observations do not apply indiscrimi- 
nately to all women (p. 136). Working women, in 
particular, tend to display more virtue than “gentle- 
women” because they are less immersed within polite 
society and are more likely to engage in the sorts of 
practical employment that develop their “good sense” 
(pp. 154—-5).° Wollstonecraft mainly worries about mid- 
dle- and upper-class women, whose sentimental educa- 
tion renders them unfit for adult duties. 

Wollstonecraft argues that gender inequality also 
disrupts close heterosexual relationships by encourag- 
ing women to tyrannize over men (Rights of Woman, p. 
78). She explains that all human beings innately desire 
recognition and respect from others (pp. 132, 231). 
Yet, social inequalities prevent women from attaining 
respect through legitimate social avenues. They are 
provided only the most superficial sort of education 
and are shut out from political activity and most 
occupations. Consequently, they resort to the only 
means available to them to achieve some power and 
respect. They use their feminine wiles to manipulate 
men to their will. “[Women’s] exertion of cunning is 
only an instinct of nature to enable them to obtain 
indirectly a little of that power of which they are 
unjustly denied a share; for, if women are not permit- 
ted to enjoy legitimate rights, they will render both men 
and themselves vicious, to obtain illicit privileges” (p. 
70). Women attempt to gain power vicariously by 
putting on “infantine airs” in order to excite men’s 
desires (p. 78). Men respond with lavish and artificial 
displays of romantic affectation. As a result, marriages 
are formed upon the weakest of foundations. 

After the first flush of love passes, men usually 
became bored with their wives. “Their husbands ac- 
knowledge that they are good managers, and chaste 
wives; but leave home to seek more agreeable, may I be 
allowed to use a significant French word, piquant 
society” (Rights of Woman, p. 143). Women respond by 
redirecting their feminine arts toward new conquests or 
retiring into dreary bitterness. 


The woman who has only been taught to please will soon 
find that her charms are oblique sunbeams, and that they 
cannot have much effect on her husband’s heart when they 
are seen every day, when the summer is passed and gone. 
Will she then have sufficient native energy to look into 
herself for comfort, and cultivate her dormant faculties? 
Or, is it not more rational to expect that she will try to 


6 Tronto (1993, 61-97) argues m this vem that care is linked not only 
with gender but also with class and race; lower-class individuals and 
people of color adopt this perspective more often than individuals 
with power and privilege. 
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please other men; and, in the emotions raised by the 
expectations of new conquests, endeavor to forget the 
mortification her love or pride has received? When the 
husband ceases to be a lover—and the time will inevitably 
come, her desire of pleasing will then grow languid, or 
become a spring of bitterness; and love, perhaps the most 
evanescent of all passions, gives place to jealousy or vanity 

(p. 97). 

Wollstonecraft hardly even considers the possibility 
that women might also turn to one another for support 
and refuge from their unhappy relations with men. Her 
silence on this subject seems to reflect her pessimism 
about it. She indicates that it is very difficult for women 
who grow up submitting to and tyrannizing over men 
ever to trust one another as true friends. Most view 
other women as rivals and reproduce the oppressions 
of patriarchal society in their female relationships 
(Rights of Woman, pp. 142, 286). In her last novel, The 
Wrongs of Women, or Maria, however, Wollstonecraft 
(1989) does provide the story of two women, Maria and 
Jemima, who are able to develop bonds of friendship 
through their common oppression (Johnson 1995, 47- 
69).7 

The effects of inequality are even more pronounced 
in women’s relations with their children. Wollstone- 
craft outlines a variety of ways in which inequality 
contributes to dysfunctional maternal-child relations. 
Being subject to the unjust rule of men within society, 
women often reproduce this injustice within their 
home. “It will not be difficult to prove that such 
delegates will act like men subjected by fear, and make 
their children and servants endure their tyrannical 
oppression. As they submit without reason, they will, 
having no fixed rules to square their conduct by, be 
kind, or cruel, just as the whim of the moment directs; 
and we ought not to wonder if sometimes, galled by 
their heavy yoke, they take a malignant pleasure in 
resting it on weaker shoulders” (Rights of Woman, p. 
120). 

Inequality also leads women to neglect their chil- 
dren. Because women associate their power and re- 
spect with their ability to contro! men, many of them 
show more concern for the arts of beauty and coquetry 
than the duties of motherhood (pp. 231-2). Alterna- 
tively, some women carry out a quiet revenge against 
their husband by “wanting their children to love them 
best, and take their part, in secret, against the father, 
who is held up as a scarecrow” (Rights of Woman, p. 
243). Furthermore, social inequality often poisons even 
the most well-intentioned maternal affection with “per- 
verse self-love” (p. 242). Women look to their children 
to be what they cannot be. They invest their children 
with their dreams and make them their source of power 
and pride. They forget that children, too, are human 
beings entitled to respect and independence. 

Woman, however, a slave in every situation to prejudice, 

seldom exerts enlightened maternal affection; for she 


7 Jemima is initially presented as a woman who, due to oppression 
and abuse throughout her life, 1s misanthropic and actually colludes 
with the corrupt society by working as a guard in an asylum. Over the 
course of the novel, Maria and Jemima develop a friendship based 
largely upon their common experiences and oppression as women. 
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either neglects her children, or spoils them by improper 
indulgence. Besides, the affection of some women for their 
children is, as I have before termed it, frequently very 
brutish: for it eradicates every spark of humanity. Justice, 
truth, every thing is sacrificed by these Rebekah’s [sic], and 
for the sake of their own children they violate the most 
sacred duties, forgetting the common relationship that 
binds the whole family on earth together (p. 243, emphasis 
in original). 

Wollstonecraft recognizes far more clearly than 
many contemporary feminists the close links between 
social structures and the healthy delivery of care. 
Lacking a rational education and just social structure, 
women are likely express their affection in self-serving 
ways. They smother their children with affection but 
forget to recognize their humanity. Their love is just a 
more subtle form of their desire to cae over 
others. 


WOLLSTONECRAFT S SYNTHESIS OF 
CARE AND JUSTICE 


Wollstonecraft is well known for her defense of wom- 
en’s equality, but it is rarely noticed that the central 
reason she calls for equal rights is her desire to 
overcome the pathologies of care. She considers justice 
to be an integral aspect of a caring society and essential 
for encouraging women to fulfill their nurturing duties. 


Would men but generously snap our chains, and be 
content with rational fellowship instead of slavish obedi- 

ence, they would find us more observant daughters, more 
affectionate sisters, more faithful wives, more reasonable 
mothers—in a word, better citizens. We should then love 
them with true affection, because we should learn to 
respect ourselves; and the peace of mind of a worthy man 
would not be interrupted by the idle vanity of his wife, nor 
the babes sent to nestle in a strange bosom, having never 
found a home in their mother’s (Rights of Woman, p. 241). 


Wollstonecraft invokes friendship as her model of a 
natural and pure affection (Rights of Woman, pp. 
98-101). Friendship is devoid of all coercion and 
romantic sentimentality and involves the respect of 
human beings for the unique and particular character- 
istics of one another, or what Wollstonecraft calls 
“mutual sympathy” (pp. 203, 241). She contrasts the 

“calm tenderness of friendship, the confidence of re- 
spect” with the “blind admiration, and the sensual 
emotions of fondness” (p. 99). Unless women are given 
the opportunity to develop their full capacities, how- 
ever, they can never be the equal of their husband and 
never become their genuine friend. “The affection of 
husbands and wives cannot be pure when they have so 
few sentiments in common, and when so little confi- 
dence is established at home, as must be the case when 
their pursuits are so different. That intimacy from 
which tenderness should flow, will not, cannot subsist 
between the vicious” (p. 292). 

Without social equality, women and men can never 
attain the highest form of personal intimacy—friend- 
ship. The lack of friendship between parents, in turn, 
undermines healthy child care: “Children will never be 
properly educated till friendship subsists between par- 


584 


September 2001 


ents. Virtue flies from a house divided against itself— 
and a whole legion of devils take up their residence 
there” (Rights of Woman, p. 292). Wollstonecraft traces 
the ills of society from the political order to the family 
and back again. Without political and social equality, 
men and women will never achieve friendship within 
the family or provide the proper care for their children. 
As a result, children will grow up ill-equipped for the 
duties of citizenship, and political society will remain 
nothing more than an assemblage of selfish and iso- 
lated individuals. 

Wollstonecraft outlines two sets of concrete propos- 
als for izing the condition of women within soci- 
ety. First, she demands the extension of political and 
civil rights to women, including the rights to life, 
liberty, and property as well as the right to vote (Rights 
of Woman, pp. 230-41). Unique to her defense of 
these rights is her attention to their effects on family 
life. She claims political rights will expand women’s 
understanding of their place within the web of social 
relations. They will come to see themselves as active | 
citizens with responsibilities to others outside their 
immediate social sphere. This enlarged social under- 
standing will make them more devoted and adept care 
givers because they will realize the importance of their 
nurturing activities for the public good. Lacking this 
enlarged social understanding, they will remain fixated 
on their petty concerns and will be incapable of fulfill- 
ing even their private family duties. “It is plain from the 
history of all nations, that women cannot be confined 
to merely domestic pursuits, for they will not fulfill 
family duties, unless their minds take a wider range, 
and whilst they are kept in ignorance they become in 
the same proportion the slaves of pleasure as they are 
the slaves of men” (p. 270). 

Wollstonecraft argues in a similar vein about civil 
rights for women. They should be given a legal exis- 
tence independent of their father, husband, and broth- 
ers and granted the right to divorce (Sapiro 1992, 
149-52, 237). They also should have the right to own 
and inherit property as autonomous citizens (Rights of 
Woman, pp. 230-41). Furthermore, the government 
should work to open careers to women. At present, 
Wollstonecraft complains, too many women waste 
their life in idleness who might otherwise have been 
doctors, nurses, midwives, politicians, merchants, 
teachers, store owners, farmers, and more (pp. 238-9). 
Free access to careers will not only allow women to 
contribute their talents to the public good but also help 
them achieve economic independence from men. As a 
result, “women would not then marry for a support 

..and neglect the implied duties” (p. 239). Open 
careers, like engagement in the political realm, also will 
broaden women’s understanding. They will approach 
their nurturing relationships with a wider understand- 
ing of social affairs rather than through their own 
private idiosyncrasies and desires. 

Wollstonecraft claims that professional careers for 
women will strengthen family ties. People will marry 
from affection rather than convention, necessity, or 
passing romance. They will relate to one another as 
equals with common interests and aims rather than as 
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master and servant. “In short, in whatever light I view 
the subject, reason and experience convince me that 
the only method of leading women to fulfill their 
peculiar duties, is to free them from all restraint by 
allowing them to participate in the inherent rights of 
mankind” (Rights of Woman, p. 272). Political and civil 
rights are, by Wollstonecraft’s account, the foundation 
of a caring society. 

Wollstonecraft’s other major reform proposal is her 
plan for national education for girls. She claims the 
extension of equal rights and opportunities to women is 
inadequate for addressing the deep gender inequality 
within society, much of which stems from the manner 
in which women are socialized.® They are taught to 
feign weakness and innocence rather than to develop 
their reason. According to Wollstonecraft, women who 
are socialized in this manner would gain little from 
political and civil rights. She therefore argues that the 
government should assume direction of children’s ed- 
ucation in order to combat these social prejudices and 
especially the parochial views of parents. “For whilst 
schoolmasters are dependent on the caprice of parents, 
little exertion can be expected from them, more than is 
necessary to please ignorant people” (Rights of Woman, 
p- 257). 

Wollstonecraft specifically calls for the establish- 
ment of public day schools to serve as a counterweight 
to the inequities within the family. Their purpose 
should be twofold: Educate the mind and heart, that is, 
cultivate both reason and care. She outlines several 
pedagogical reforms to meet this end. First, public 
schools should abolish all false distinctions of sex by 
educating girls and boys together in the same subjects 
and activities. In this way, they will be inculcated with 
a sense of mutual respect from a young age (Rights of 
Woman, p. 260). “I presuppose, that such a degree of 
equality should be established between the sexes as 
would shut out gallantry and coquetry, yet allow friend- 
ship and love to temper the heart for the discharge of 
higher duties” (p. 264), Second, pedagogical methods 
based upon obedience and hierarchy should be re- 
placed by a system of active learning carried out 
through examples and conversation and designed to 
create students who exercise their reason indepen- 
dently of their teachers (p. 251; Sapiro 1992, 240-1).9 
Third, attention should be given to an education of the 
heart. “I am, indeed, persuaded that the heart, as well 
as the understanding, is opened by cultivation; and by, 
which may not appear so clear, strengthening the 
organs; I am not now talking of momentary flashes of 
sensibility, but of affections” (p. 142). The best means 
to accomplish this goal is to broaden children’s under- 
standing of the intimate links between their nurturing 
duties and the public good. 

In discussing the education of women in particular, 
Wollstonecraft conjectures that they would not be 


3 Sapiro (1992, 237-9) emphasizes that Wollstonecraft defines edu- 
caton broadly in terms of socialization. 

° In an interesting contemporary parallel, Koehn (1998) suggests that 
the Socrate method provides an appropriate model for promoting 
principled and healthy carmg relationships. 
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drawn to petty pursuits “if political and moral subjects 
were opened to them; and I will venture to affirm, that 
this is the only way to make them properly attentive to 
their domestic duties.—An active mind embraces the 
whole circle of its duties, and finds time enough for all” 
(Rights of Woman, p. 265). Starting from small practical 
examples within the classroom, children can be taught 
the importance of care and respect im interpersonal 
relations. Later these particular affections can be re- 
lated to larger political and moral concerns. Students 
then can be shown the important role of their care- 
giving duties for society at large. The point is to elevate 
the moral status of care-giving activities by teaching 
students about the central role of these activities in 
fostering a healthy and functioning society. 

Wollstonecraft’s fusion of justice and care is most 
evident in her account of healthy care-giving activities. 
Here she claims that justice is necessary not only in 
broad social affairs but also in the intimate relations 
among individuals. It is not only a prerequisite for the 
delivery of care but also provides an important check 
on it. The proper approach of the adult care giver to 
the child is one of respectful nurturing. Parents and 
teachers should devote their nurturing activities to 
transforming children into autonomous beings. They 
are to address their wards in the following manner: “It 
is your interest to obey me till you can judge for 
yourself; and the Almighty Father of all has implanted 
an affection in me to serve as a guard to you whilst your 
reason is unfolding; but when your mind arrives at 
maturity, you must only obey me, or rather respect my 
opinions, so far as they coincide with the light that is 
breaking in on your own mind” (Rights of Woman, p. 
247). This passage aptly summarizes Wollstonecraft’s 
vision of adult care. Parents and teachers are to direct 
their affections to the development of a child’s reason 
and autonomy. Even if they cannot treat children as 
equals and hence as friends, they should think of them 
as friends in the making. 

The following passage from Original Stories, as re- 
layed by Mrs. Mason, the tutor, highlights Wollstone- 
crafts views.10 


[As] Mary had before convinced me that she could Tegu- 
late her appetites, I gave her leave to pluck as much fruit 
as she wished; and she did not abuse my indulgence. On 
the contrary, she spent [the] most part of the time gather- 
ing some for me, and her attention made it sweeter. 
Coming home I called her my friend, and she deserved the 
name, for she was no longer a child; a reasonable affection 
had conquered an appetite; her understanding took the 
lead, and she had practiced a virtue (Wollstonecraft [1788] 
1989, IV, 400-1). 


The test of healthy, as opposed to self-indulgent, 
parental care is whether it promotes the child’s auton- 
omy. Only after achieving autonomy can the child 
express genuine care for others. The goal of the parent 
or teacher is to graduate the child from a dependent to 
a friend. 

Wollstonecraft has no illusions about the short-term 


19 For the view that Mra. Mason represents the ideas of Wollstone- 
craft, see Kelly 1992, 63-4. 
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effectiveness of her proposed reforms. Since “men and 
women must be educated, in a great degree, by the 
opinions and manners of the society they live in,” she 
concludes that “till society be differently constituted, 
much cannot be expected from education” (Rights of 
Woman, p. 89). The first step toward reform is the 
extension of equal rights and education to women, but 
this is only a first step. A “revolution in female 
manners” must sweep through society in order to effect 
real change (p. 292). The success of this revolution 
depends not only on a change in women’s conscious- 
ness but also on the transformation of men’s conscious- 
ness. “For I will venture to assert, that all the causes of 
female weakness, as well as depravity, which I have 
already enlarged on, branch out of one grand cause— 
want of chastity in men” (p. 227). As long as men 
respond to coquetry, women will continue to play the 
flirt. “The two sexes mutually corrupt and improve 
each other. This I believe to be an indisputable truth, 
extending it to every virtue. Chastity, modesty, public 
spirit, and all the noble train of virtues, on which social 
virtue and happiness are built, should be understood 
and cultivated by all mankind, or they will be cultivated 
to little effect” (p. 229). 

Although Wollstonecraft’s revolution may start with 
equal rights and educational reform, it will not reach its 
end until the deep prejudices agamst women are 
rooted out of society. It likewise requires a revaluation 
of caring activities within society. These are to be 
viewed not as ancillary to business and political activ- 
ities but as primary human and social virtues to be 
embraced by both men and women. The reward Woll- 
stonecraft holds out for this revolution is the emer- 
gence of a more dutiful and virtuous society. “The 
conclusion which I wish to draw, is obvious; make 
women rational creatures, and free citizens, and they 
will quickly become good wives, and mothers; that 
is—if men do not neglect the duties of husbands and 
fathers” (Rights of Woman, p. 275). Wollstonecraft 
aims at the radical transformation of society not so 
much by effecting substantial change in political or 
economic institutions as by instigating a revolution in 
the status and nature of care. 


WOLLSTONECRAFT’S NURTURING 
LIBERALISM 


It should be evident from the foregoing just how 
misleading it is to characterize Wollstonecraft simply 
as a classical liberal feminist. She rejects the classical 
liberal distinction between public and private spheres, 
emphasizes the importance of care-giving activities, 
and sets her sights on creating a more dutiful and 
caring citizen body. On the one hand, she emphasizes 
the importance of involving women in public affairs for 
the sake of making them more virtuous care givers. 
“Females, in fact, denied all political privileges, and 
not allowed, as married women, excepting in criminal 
cases, a Civil existence, have their attention naturally 
drawn from the interest of the whole community to that 
of the minute parts, though the private duty of any 
member of society must be very imperfectly performed 
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when not connected with the general good” (Rights of 
Woman, pp. 281-2). 

On the other hand, she asserts the importance of 
private care for the development of public virtue. 


Public education, of every denomination, should be di- 
rected to form citizens; but if you wish to make good 
citizens, you must first exercise the affections of a son and 
a brother. This is the only way to expand the heart; for 
public affections, 'as well as public virtues, must ever grow 
out of the private character, or they are merely meteors 
that shoot athwart a dark sky, and disappear as they are 
gazed at and admired. Few, I believe, have had much 
affection for mankind, who did not firet love their parents, 
their brothers, sisters, and even the domestic brutes, whom 
they first played with. The exercise of youthful sympathies 
forms the moral temperature; and it is the recollection of 
these first affections and pursuits that gives life to those 
that are afterwards more under the direction of reason 
(Rights of Woman, p. 256). 


Wollstonecraft ultimately aims at a massive and 
long-term reform of both public and private affairs. She 
calls upon the government to extend equal rights to 
women and to work for their equality in the family and 
careers. She also calls upon government to take a direct 
role in educating a more egalitarian and caring citizen 
body. Most generally, she claims the mission of gov- 
ernment goes beyond the abstract guarantee of equal- 
ity and rights to all; its task is more particularly to 
balance power differentials within society by taking the 
side of the weak against the strong (Sapiro 1992, 182): 
“Nature having made men unequal, by giving stronger 
bodily and mental powers to one than to another, the 
end of government ought to be, to destroy this inequal- 
ity by protecting the weak. Instead of which, it has 
always leaned to the opposite side, wearing itself out by 
disregarding the first principle of its organization” 
(Wollstonecraft [1794] 1989, VI, 17). In short, Wollsto- 
necraft reformulates the classical liberal definition of 
government by placing it within the larger framework 
of care. One might say she took Burke’s criticisms of 
Enlightenment liberalism seriously enough to recog- 
nize that liberal governments need to give special 
attention to caring activities if they are to avoid the 
selfish egoism and individualism associated with liberal 
justice. 

Wollstonecraft’s thought also does not fit neatly into 
the classical republican tradition. Landes (1988, 123- 
51), for example, has portrayed Wollstonecraft as an 
advocate of republican motherhood. Yet, most advo- 
cates of republican motherhood embrace a sharp dis- 
tinction between the public and private spheres (Sapiro 
1992, 178-9). They suggest that men have a civic role in 
the public realm, whereas women’s civic duties. are 
confined to their domestic duties as mothers and 
housekeepers. Wollstonecraft, by contrast, argues that 
men should play a more active role within the family as 
fathers and husbands, since family affection is the 
foundation of public virtue. She likewise urges an 
independent civic existence for women as a means for 
broadening their private affections. 

Wollstonecraft thus breaks down the traditional 
republican distinction between public men and private 
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women. She argues that the cultivation of virtue de- 
pends upon drawing more men into the home and 
more women into the public sphere. The inegalitarian 
division of society into public and private realms is just 
as corrupting as the hierarchical public world of sub- 
jects and kings. She also abandons the classical repub- 
lican association of human virtue with political and 
military glory. The main attribute of a virtuous individ- 
ual by her account is the self-conscious fulfillment of 
one’s responsibilities and duties to others. She calls 
upon both men and women to spend at least as much 
time and energy on domestic care as they devote to 
their political and business pursuits. 

Once the important role of care within Wollstone- 
craft’s philosophy is recognized, some of the contempo- 
rary criticisms of her thought are dispelled. Contempo- 
rary feminists claim, for example, that Wollstonecraft’s 
argument for women’s equality is contradictory (Gatens 
1991; Pateman 1989). She defines the duties of women as 
those of “mothers and wives” but fails to realize that these 
roles are at the root of women’s oppression. Pateman 
dubs this contradiction “Wollstonecraft’s dilemma.” That 
is, Wollstonecraft demands equal rights for women in the 
“gender-neutral” social world but simultaneously insists 
that women have special capacities and duties that differ- 
entiate them from men. The two demands are incompat- 
ible, according to Pateman (1989, 197), because they 

ize “two alternatives only: either women become 
(like) men, and so full citizens; or they continue at 
women’s work, which is of no value for citizenship.” 

Although Pateman highlights an important problem 
for most classical liberal theories, her criticism does not 
apply to Wollstonecraft, who redefines the public 
sphere as a space where care is to be accorded special 
prominence. She calls for a “revolution” in social 
thinking whereby the caring duties of both men and 
women are to be given equal respect with and made 
essential to the obligations of citizenship. Both men 
and women are called upon to share equally in the 
care-giving and civic functions of citizenship. Pateman 
is correct to observe that Wollstonecraft never consid- 
ers the possibility of such radical policy initiatives as 
public subsidies for parenting activities, but this policy 
is not contrary to her philosophy, given the role she 
envisions for government in fostering care. She identi- 
fies the reforms she considers most necessary to de- 
velop a society more attuned to the value of care, while 
admitting that her proposals are only “sketches” 
(Rights of Woman, p. 71). She leaves it to others to 
develop her ideas. 

There is thus nothing contradictory about Wollsto- 
necraft demanding equal rights for women and exhort- 
ing them to fulfill domestic duties. In her vision of 
society, domestic duties are every bit as deserving of 
respect as public occupations, and men are similarly 

ed to fulfill their domestic duties while engaging 
in public life. If liberal society is still confronted with 
the dilemma of incorporating women into the public 
sphere as full citizens, it is only because it has failed to 
implement reforms necessary to complete Wollstone- 
crafts revolutionary project. We still do not fully 
recognize the value of nurturing activities for public life 


or acknowledge the degree to which some of our most 
pressing social problems may stem from our neglect of 
care. 


CONCLUSION 


Wollstonecraft provides contemporary feminists with a 
model that combines justice and care. Throughout her 
writings, she identifies care as the foundation and end 
of social relations. She stresses the importance of 
nurturing family relations for creating autonomous 
individuals, and she defines moral progress as move- 
ment toward a more virtuous and dutiful society. At the 
same time, she argues that the development of nurtur- 
ing relations depends upon a just social structure, for 
only in such a structure can care find healthy expres- 
sion. Ultimately, she breaks down the dichotomy be- 
tween justice and care as well as between the public 
and private spheres by arguing that justice is an essen- 
tial feature of care. Justice facilitates the healthy 
expression of care and also provides a criterion for 
healthy care giving. 

Few contemporary feminists examine the influence 
of larger social forces on the delivery and expression of 
care, but Wollstonecraft makes this theme the focus of 
her political philosophy. She argues that under condi- 
tions of social or economic inequality, the delivery and 
expression of care are often distorted. Men neglect 
their wives and children, and women tyrannize over 
men and neglect or overindulge their children. Chil- 
dren then grow up without the capacity for sympathy or 
care. The implications for social policy are far-reach- 


ing. 

Wollstonecraft’s thought suggests that feminists (and 
others) concerned with care should focus more atten- 
tion on identifying and addressing the social inequali- 
ties that hinder the expression and delivery of care 
within society. Without a just social structure, care will 
be stymied, and duties will go unfulfilled. At the same 
time, a concerted effort should be made to elevate the 
status of caring activities within society. Wollstonecraft 
proposes public support for and direction of the edu- 
cation system, but her ultimate goal is to foster a 
revolution in thinking that will lead to equal respect for 
care-giving activities. In this regard, her thought also 
suggests that feminists (and others) concerned about 
care should focus more attention on identifying the 
institutions and conventions that diminish the impor- 
tance of care-giving functions within our society and 
should explore the possibilities for instituting public 
policies that will give more support and respect to 
nurturing activities. 

Wollstonecraft calls for a revolution in care but 
leaves it to others to identify the particular laws and 
policies that might be necessary to complete it. As 
women today struggle with the multiple demands of 
family, work, and civic activity, Wollstonecraft provides 
an inspirational vision of the interrelatedness of all 
these activities. Good mothers are, by her account, 
career women and politically active. The same is true of 
good fathers. The central tasks of government are to 
ensure that women and men have equal access to 
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careers and politics and to encourage them to fulfill 
their care-giving duties. Although Wollstonecraft may 
not provide all the answers for promoting a nurturing 
liberal order, her writings direct us to consider impor- 
tant questions. In what ways might social injustices 
inhibit the delivery of care within our society, and how 
might government better support caring activities? If 
Wollstonecraft’s analysis is correct, the answers to 
these questions may hold the key to addressing some of 
the central pathologies—including selfishness, indiffer- 
ence, and irresponsibility—that afflict contemporary 
liberal societies. 
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transition from protest to politics. I evaluate the link between black congressional representation and 
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political engagement, as measured by voting participation. There are two related objectives: Construct 


the election of blacks to Congress negatively affects white political involvement and only rarely increases 


political engagement among African Americans. 


districts under the Voting Rights Act, a political 

world of white lawmakers and black activists has 
given way to one in which black legislators figure 
prominently. Voting rights advocates have long ex- 
pected minority officeholding to usher a small-scale 
revolution in electoral politics: Black congressional 
representation will lead not only to more progressive 
legislation but also to greater appreciation by African 
Americans of the instrumental value of political par- 
ticipation. At the core of these expectations is the 
presumption that political interest and engagement are 
as much a response to the political environment and 
the opportunities it is perceived to present as they are 
a function of individual resources, such as education. 
Black congressional representation, by contributing to 
new political optimism, could prime the pump of 
minority voter participation‘and pull the black commu- 
nity into the political process. : 

The presumed link between black congressional 
officeholding and political engagement has not been 
examined directly by either political scientists who 
study participation or those concerned with minority 
representation. The latter emphasize legislative activ- 
ism over political behavior, and the former are slow to 

ize minority officeholding as a salient external 
stimulus. In the absence of rigorous analysis, political 
observers are left to speculate about the participation 
effects. On the basis of low aggregate turnout rates in 
districts represented by black members of Congress, 
many conclude that black officeholding has failed to 
make voters out of unengaged minority Americans and 
may even depress turnout among these constituents 
(Donovan 1992; Duncan’ 1990, 1993a, 1993b; Swain 
1995). Morris (1992, 169, 172) dismisses the strategy of 
political office seeking as a “myopic quest” and argues 
that minority officeholding often degenerates into a 
“ritual of process” that fails to build interest, knowl- 
edge, or a sense of efficacy among blacks. The anec- 
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dotal evidence is not encouraging, but more systematic 
analysis is required. The social and economic disloca- 
tion common to many of the congressional districts 
represented by blacks may be a factor in the low turn- 
out observed in these areas; the conventional wisdom 
about black officeholding fails to take this into account. 

I will evaluate the link between blacks in Congress 
and political engagement, as measured by voting par- 
ticipation. There are two related objectives: Construct 
a broader model of participation that takes into ac- 
count a key component of the political environment 
since the 1960s and more fully appreciate the political 
significance of minority officeholding by considering its 
nonpolicy consequences. The analysis will consider 
separately the political behavior of African Americans 
and whites, who often account for as much as 40% of 
the constituents represented by black House members. 
Using precinct data from midterm elections in eight 
states, I will estimate race-specific turnout rates and 
test a model that explains the variation in rates (across 
precincts) as a function of black officeholding, control- 
ling for differences in socioeconomic resources and 
electoral competitiveness. 

I will demonstrate that the election of African 
Americans to Congress is accompanied by a lower level 
of political engagement among whites and only rarely 
contributes to greater political involvement among 
black constituents. Indeed, observers have misread the 
anecdotal evidence. The low turnout characteristic of 
districts with a black representative and uniformly 
attributed to African American constituents must be 
disaggregated. In reality, political participation among 
blacks is generally consistent with (and occasionally 
higher than) what one would expect for African Amer- 
icans who have limited resources and reside in districts 
with limited competition, and it is lower than what one 
would expect among whites. 

After a brief review of the context for this research, 
I will discuss the data, explain the ecological inference 
and causal models applied, and outline the primary 
hypotheses. The subsequent analysis finds support for 
the hypothesis that black congressional representation 
affects the level of participation among whites and 
African Americans. I then will examine the empirical 
results and alternative explanations for the political 
behavior observed. The article concludes with some 
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thoughts on the implications of this research for the 
literature on participation and racial representation. 


LITERATURE REVIEW 


Social scientists have devoted decades of research to 
identifying the factors that influence an individual’s 
decision to participate in the political process (e.g, 
Conway 1991; Teixeira 1987; Verba and Nie 1972; 
Verba, Schlozman, and Brady 1995; Wolfinger and 
Rosenstone 1980). To a great extent, whether con- 
cerned exclusively with voting or with participation 
more generally, this research emphasizes the primacy 
of individual resources: age, education, and income. 
Some believe this traditional model falls short and 
introduce such variables as racial group consciousness 
to complement the baseline model and reinforce the 
focus on the individual (Conover 1984; Miller et al. 
1981; Shingles 1981). These scholars argue, in effect, 
that the decision to participate is primarily an internal 
process, a function of ability, knowledge, and interest, 
and is largely unresponsive to social and political cues. 

In the last ten years, a body of literature has emerged 
that recasts the participation issue, moving beyond the 
dominant socioeconomic framework to consider the 
role of political context in shaping individual political 
behavior (e.g., Leighley 1995; Mitchell and Wlezien 
1995; Rosenstone and Hansen 1993; Tate 1991, 1994). 
The aim is to reconstruct a compelling account of 
political life by recognizing that political choices are 
contingent on the surrounding environment, that is, 
reflect its constraints and opportunities (Huckfeldt and 
Sprague 1993). Rather than abstract individuals from 
time, place, and setting in an effort to make sense of 
their politics, contextual models argue for the necessity 
of multiple levels of analysis: In order to understand 
how people behave, we must understand where they 
live. In practice, the constraints and opportunities are 
as varied as registration laws that depress political 
participation (e.g., Nagler 1991; Timpone 1998), court 
rulings (e.g, Brown v. Board of Education), political 
campaigns (e.g., Jesse Jackson in 1984 and 1988) that 
inspire activism (McAdams 1982; Tate 1991), public 
policy reforms (e.g., more restrictive welfare provi- 
sions) that spur record rates of naturalization (Shaw, 
de la Garza, and Lee 2000), or economic heterogeneity 
that stimulates local political interest (Oliver 1999). As 
Rosenstone and Hansen (1993, 5) conclude, to under- 
stand electoral behavior we need to place individuals in 
society and identify the “aspects of political life that 
make people accessible and amenable to the appeals of 
political leaders.” 

Adopting the logic of this systemic perspective 
(Barker 1994), some scholars have asked whether local 
changes in the political status of African Americans 
affect political engagement (Bobo and Gilliam 1990; 
Gilliam 1996; Gilliam and Kaufmann 1998; Kaufmann 
1999). In the most complete research to date, Bobo 
and Gilliam (1990) find that African Americans in 
areas of high black empowerment—as indicated by 
contro! of the mayor’s office—are more active than 
either African Americans in low empowerment areas 
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or whites of comparable socioeconomic status. Em- 
powerment, they conclude, influences black participa- 
tion by contributing to a more trusting and efficacious 
orientation toward politics (see also Abney and 
Hutcheson 1981; Howell and Fagan 1988) and by 
greatly increasing black attentiveness to political af- 
fairs. As for whites, the findings suggest that they pay 
less attention to local politics when blacks control local 
offices but do not become generally less trusting and 
efficacious as a result. In later research (Gilliam 1996) 
these conclusions are refined: The influence of minor- 
ity officeholding on white political behavior is contin- 
gent on ideological compatibility. That is, white resi- 
dents who might be considered members of the 
governing coalition (liberal Democrats in a city with a 
black Democratic mayor) remain engaged, but mem- 
bers of the outgroup retreat to the political margins. 

Despite the mounting evidence of a relationship 
between minority officeholding and political behavior, 
the study of participation effects has not made signifi- 
cant inroads into the scholarship on black congres- 
sional representation. (Also, minority empowerment 
scholars have not extended their research to include 
black members of Congress, arguably the most prom- 
inent class of black officeholders). To date, analytical 
efforts have focused on the policy consequences of 
minority representation in Congress (Cameron, Ep- 
stem, and O’Halloran 1996; Canon 1999; Hutchings, 
McCierking, and Charles 1999; Lublin 1997; Swain 
1995; Whitby 1998). From this perspective, black elec- 
toral success is significant only insofar as it serves as a 
unique guarantor of continued influence on the legis- 
lative process. 

Two studies depart from the legislative bias manifest 
in much of the research on black members of Congress. 
Analyzing data from the 1996 congressional elections 
in Georgia (Bositis 1998; Voss and Lublin 2001) and 
Florida (Voss and Lublin 2001), the authors find 
evidence of black mobilization in districts represented 
by African Americans. In addition, Bositis (1998) finds 
that white registered Democrats may choose not to 
participate rather than support a black Democratic 
incumbent or the Republican challenger. Although 
these studies are a useful starting point, the exceptional 
circumstances surrounding the congressional elections 
in Georgia and Florida limit our ability to generalize 
from the findings. (The three African American incum- 
bents were running for reelection in majority-white 
districts, following a court order that eliminated the 
majority-black districts from which they initially were 
elected.) The observed behavior may be an artifact of 
unique electoral conditions and a poor indicator of 
what to expect in a more typical scenario (i.e., a black 
incumbent running in the district from which elected 
originally). Furthermore, Voss and Lublin (2001) and 
Bositis (1998) do not consider factors, such as socio- 
economic resources, that influence participation and 
distinguish the constituency of black representatives 
from other constituencies. 

Į will extend the approach developed in the minority 
empowerment literature to evaluate the relationship 
between black congressional representation and polit- 
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ical participation among whites and African Ameri- 
cans. I will use King’s (1997) solution to the ecological 
inference problem to estimate race-specific turnout 
rates from aggregate data, and I will take into account 
the effect of socioeconomic resources and electoral 
conditions on voting participation. 


THE DATA 


This article uses turnout data from midterm elections 
in eight states: Georgia, Michigan, New Jersey, and 
Pennsylvania in November 1990; Maryland, Missouri, 
Tennessee, and Virginia in November 1994.1 Together 
these states have 102 congressional districts. Ten were 
represented by blacks at the time of the midterm 
election, as shown in Table 1. These members of 
Congress represent a diversity of political styles, expe- 
riences, and constituencies. The sample includes virtual 
newcomers (e.g, Donald Payne and John Lewis ir 
1990; Albert Wynn and Robert Scott in 1994) and 
veterans of elected office (e.g, John Conyers, Harold 
Ford, Bill Gray, George Crockett, Bill Clay); those who 
appealed to and enjoyed white support (e.g., Lewis, 
Payne, Gray, Mfume, Wynn, and Scott) and those who 
largely ignored the white constituency (Crockett, Con- 
yers, Clay); majority-black districts with a long history 


1 Since 1990, congressional districts in 21 states have elected African 
Americans, The eight states covered in this analysis were selected 
with attention to two critena: geographic diversity and variation in 
district type. Southern and border states account for slightly more 
than half of the 21 states that have elected blacks to Congress, with 
the remainder drawn equally from the Northeast and Midwest. The 
eight states analyzed here reflect that regional vanation. The sample 
includes historically black and newly black congressional districts. 
This borrows the termmology developed in Swam 1995 (p. 
47), which defines historically black districts as those with a black 
voting age population of more than 50% that have had black 
representation in Congress for ten years or more, The sample also 
includes one majority-white district (Missouri’s Fifth) represented by 
an African-American Incumbent, Alan Wheat. Because Wheat did 
not run for reelection m 1994, he 1s not discussed here. 


TABLE 1. Black Congressional Representatives In the Sample 
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Election Year State District District Type 

November 1990 Georgla 5 John Lewts Newly black 
Michigan 1 John Conyers Historically black 
13 George Crockett Historically black 

New Jersey 10 Donald Payne Newty black 
Pennsytvania 2 William Gray Historically black 

November 1994 Maryiand 4 Albert Wynn Newty black 
7 Kwels] Mfume Historically black 
Missouri 1 Willlam Clay Historically black 

5 Alan Wheat Majority white 

Tennessee 9 Harold Ford Historically black 

Virginia 3 Robert Scott Newly black 
Note: “Histoncally black” is defined (Swain 1996) aa a dietnct with black reoreeeantabon m Congrees for at east ten years. 
who served from 1882 to 1994, was the first and only black congressman to represent Miesoun’s meyorttywhrte Fifth Distinct He gave up 





of black representation (e.g., Michigan’s First) and two 
districts, Virginia’s Third and Maryland’s Fourth (the 
nation’s largest black suburban community), newly 
created in the 1990 redistricting. 

With the exception of George Crockett, all the black 
officeholders were running for reelection. Crockett 
announced his retirement in March 1990, and most of 
the activity for his seat, which represents downtown 
Detroit and Grosse Pointe, took place in the primary. 
Idelson (1990a, 3319), reporting for the Congressional 
Quarterly, noted in August 1990 that the race “crystal- 
lized early into a scramble to catch the front-runner 
[Barbara Rose] Collins,” who had an immediate base 
from her strong 1988 House campaign (Idelson 1990a). 
City Councilwoman Collins, with the crucial support of 
Mayor Coleman Young, defeated a crowded field to 
enter the November election with virtually no compe- 
tition, a position not unlike that enjoyed by strong 
incumbents nationwide (Idelson 1990b). 

The data for each state are aggregated at the pre- 
cinct level and consist of turnout results and Census 
demographics for every precinct in the state. (The data 
are not limited to the precincts represented by African 
Americans.) In total, there are more than 28,000 
precinct observations. The primary turnout measure is 
total ballots cast as a proportion of the total voting age 
population in the precinct.? (Below, I will discuss how 
this aggregate figure is used to derive race-specific 
turnout rates.) In addition, the data include informa- 
tion about the race, party affiliation, and tenure of each 
precinct’s congressional representative. Measures of 
electoral competitiveness are also included (from Bar- 
one and Ujifusa 1991, 1995; Duncan 1991, 1995). 

The data on electoral participation and political 


2 For the 1990 elections, the turnout data were assembled by the 
author as part of the Record of Amencan Democracy Project (King et 
al. 1997) at Harvard Univermty. For 1994, the turnout data were 
provided by the National Committee for an Effective Congress, an 
organization that offers consulting services to political candidates. 
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context are supplemented by a number of demographic 
indicators that measure the social and economic con- 
ditions within each precinct. The demographic mea- 
sures differ slightly election year but are similar 
enough to be considered comparable. For November 
1990, the Census data were extracted from the state’s 
PL94-171 file, compiled by the Bureau of the Census 
(with the voluntary participation of the states) in 
preparation for the 1990 redistricting cycle, and from 
the state’s STF3A file. 

PL94-171 files provide information on the racial 
composition of precincts but not detailed sociceco- 
nomic information, which is taken instead from the 
minor-civil division we in which each precinct is 
located. The loss of accuracy that results from the 
substitution of MCD data for precinct data is relatively 
minor, since in many cases MCDs consist of no more 
than a handful of precincts.4 The MCD surrogate 
allows for a sense of the socioeconomic context in 
which precinct residents live. The available MCD data 
are disaggregated by racial group, with separate socio- 
economic indicators for whites and for blacks. From 
the MCD data I extract several measures. For each 
racial group, Education is measured as the proportion 
of residents (over age 25) in the MCD with at least 
some college education, including those who complete 
college and graduate work. Per Capita Income for 
whites and for blacks in the MCD is included. Finally, 
I use as a measure of homeownership the proportion of 
white and black households that are renter occupied 
(Proportion Renters). 

As opposed to the 1990 data, which were assembled 
by the author, the November 1994 data were provided 
by the National Committee for an Effective Congress 
(NCEC). Rather than socioeconomic data at the MCD 
level, that information for 1994 was at the precinct 
level, based on the 1990 Census but updated to reflect 
population changes as of 1995. The one limitation is 
that the precinct information is not disaggregated by 
racial group. (The data do indicate the racial com 
sition of each precinct.) Education for the 1994 elec- 
tion cycle is measured as the proportion of total 
precinct residents (over age 25) with at least some 
college education. Income is measured as median 
household income rather than per capita. Neighbor- 
hood stability is measured as the proportion of total 
precinct households that are renter occupied. Despite 
clear differences in the precise measures employed 
across these election years, there is no reason to believe 
that these should significantly alter the estimates of the 
primary relationship of interest—that between black 
representation and voting participation. 





3 Bach state's 1990 Summary Tape File 3A (STF3A) (U.S Bureau of 
the Census 1990b) contams detailed population and housmg charac- 
teristics collected by the Bureau of the Census usmg the long form, 
the extended onnarre mailed to one in six American house- 
holda. The Public Law 94-171 file (U.S. Bureau of the Census 1990a) 
also contains data compiled as part of the 1990 Census of Population 
and Housing. The file contains only the selected decennial Census 
tabulations—race and Hispanic origin—that the Bureau of the 
Consus is required by law to provide to the states by April 1 of the 
year following the census. 

4 See Appendix A for summary statistics on the distribution of MCDs 
and precincts. 
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TABLE 2. The Ecologlcal Inference Problem 


Voting Age 
Population 


Voting Decision 
Vote No Vote 


THE MODEL 


Two Stages of Analysis 


Identifying the relationship between turnout and black 
congressional officeholding is a two-step process. The 
first stage applies a model of ecological inference to 
estimate the rates of white and black turnout for each 
precinct and congressional district in the sample. In the 
absence of individual-level data (e.g., survey data), 
these rates are calculated on the basis of aggregate 
statistics on the voting age population in each area. 
Table 2 illustrates what is involved in using these 
available aggregate statistics to infer back to the spe- 
cific quantities of interest. In Table 2, X, represents the 
p rtion of the voting age population who are black, 
and T, represents the proportion of the voting age 
population who turn out to vote in precinct i.5 The goal 
of the first stage is to use these marginals, the only 
observed data, to estimate the quantities within the 
body of the table that are not observed directly. 
Specifically, the quantities of interest are B?, the pro- 
portion of voting age blacks who vote, and B7, the 
proportion of voting age whites who vote. 

In stage one, B? and BY (and accompanying standard 
errors) are calculated for each precinct in each state 
using the estimation procedure, EI, developed in King 
(1997).° In order to control for electoral effects, the 
analysis first sorts the precincts by congressional dis- 
trict and then independently analyzes each of the 102 
district groups.” The ecological inference model ap- 
plied here takes into account the multiple precinct- and 
MCD-level factors that may influence the (unob- 
served) precinct-level rates of black and white turnout. 
Namely, the model posits that B? and B” are each 


functions of four exogenous covariates: (median or per 


3 Latinos are included in this anatyss on the basis of their racial 


Edu. The programs were run in GAUSS 3.2 for Windows and 
GAUSS for Unix. 

7 The reason for groupmg the geographic units m this way is faurly 
straightforward. In estimating the parameters of mtarest m precinct 
z King’s (1997) method of ecological inference not only uses the 
aggregate statistics available for precinct i itself (Le., X, and 7,) but 
also “borrows” from the mformation available in the other precincts. 
If the estimates of precinct i are partially determined by the 
information in the rest of the data set, then that information should 
be made as relevant as possible. The more that all these precincts 
have in common, the more relevant their information will be to 
estimating the quantities of interest in any one precinct. 
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capita) income, homeownership, education, and black 
population density.’ The literature on participation has 
demonstrated the role of such factors in both motivat- 
ing and enabling political participation. 

These factors may be particularly important to un- 
derstanding differences in voter turnout between dis- 
tricts with and without black representation, in light of 
the social and economic dislocation characteristic of 
many of the majority-minority districts that elect Afri- 
can Americans to the House.? Furthermore, by explic- 
itly modeling the dependence of B? and BY on these 
demographic and socioeconomic factors, the first-stage 
analysis sidesteps the well-documented problems (Cho 
1998; Cho and Gaines 2001; Freedman et al. 1998; 
Freedman et al. 1999; Herron and Shotts 2000a) 
associated with the basic application of King’s EI 
technique (i.e., the problematic assumption of no ag- 
gregation bias).!¢ 

There are two states for which precinct-level B? and 
B? estimates are generated using a pared down El 
model: Michigan and Pennsylvania. All the precincts in 
Pennsylvania’s First, Second, and Third congressional 
districts fall into a single MCD (Philadelphia), as do all 
the precincts in Michigan’s First and Thirteenth (De- 
troit). Because of the resulting constancy in MCD-level 
covariates within each of these five district data sets, 
the EI model used to generate B? and BY estimates for 
the precincts in each data set could only include 
precinct-level black population density as a covariate. 
Although it is possible to apply the full model to 
precincts in every other district in these two states, in 
the interest of comparability, I extend the pared down 
specification to the analysis of all the districts in 
Pennsylvania and Michigan. 

In the second stage of analysis, the precinct esti- 
mates of B? and BY are pooled across districts for each 
state and used as dependent variables in regressions 
that mode] black turnout and white turnout as func- 
tions of district-level political conditions—including 
black congressional representation and electoral com- 


2 Black population density (X,) is mcluded separately as an cxoge- 
nous covariate, even though it primarily seems to affect turnout 
indirectly through the other three socioeconomic covariates. (That is, 
m ordinary least-squares regressions of T, on X, mcluding the other 
covariates often drives the coefficient on X, to statistical magnifi- 
cance.) There were enough cases, however, in which the relationship 
between turnout and black populahon density extended beyond 
these intervening variables to justify separate inclusion m the EI 
model. 


°’ Talking these factors into account m explaining the low turnout in 
congressional districts represented by blacks is a significant depar- 
academic research and journalistic inquiries 
(Bositis 1998; Donovan 1992, Duncan 1990, 1993a, 1993b; Voss and 
Lublin 2001) 

10 See Append B for further discuston of this issue and details on 
EI model specification. 

1 For precincts in all the districts with variation in the MCD-level 
covariates (Pennsylvania Fourth through Twenty-third, Michigan 
Second through Twelfth and Fourteenth through Eighteenth), I 
tested the full array of seven models discussed in Appendix B. The 
correlations observed across these specifications were smilar to 
those observed in other states. (The B? estimates in Pennsylvania 
were generally more stable across specificanons than the same 
estimates m Michigan.) The final EI model adopted for these two 
states includes only precinct-level black population density, dichot- 
omired at the district median. 


petitiveness.12 This second stage regression uses 
Lewis’s (2000) feasible generalized least-squares 
(FGLS) estimator. As does the more common vari- 
ance-weighted least-squares approach, the estimator 
takes into account the uncertainty in the estimates of 
B? and BY. Lewis demonstrates, however, that FGLS 
also significantly improves on the inefficiency and over- 
confidence that can result from the weighted least- 
squares approach to models with estimated dependent 
variables, 14 


Hypotheses 


I will examine two hypotheses about the relationship 
between black congressional representation and polit- 
ical participation. First, African Americans residing in 
areas represented by a black House member are more 
likely to go to the polls than are similar blacks in 
districts with a white incumbent. Second, white constit- 
uents in these same black-represented districts are less 
likely than their counterparts in white-represented 
districts to go to the polls. 

In testing these hypotheses, I take into account 
elements of the electoral context that distinguish pre- 
cincts and districts in ways relevant to participation. 
These elements are whether an Incumbent is running, 
how long s/he has been in office (Tenure), and the 
competitiveness of the House race (Winning Vote Mar- 
gin). To the extent that black representation is corre- 
lated with these political factors, the variable serves as 
a praxy for both the racial and nonracial characteristics 
unique to these members of Congress. In icular, 
electoral competition is credited with pattic- 
ipation (Barker, Jones, and Tate 1999; Gilliam 1985). 
The lack of competition characteristic of politics in the 
congressional districts that most African Americans 
represent may itself be a major barrier to participation. 
Taking this factor into account allows us to evaluate 
whether political engagement is any higher or lower 
than what one would expect under a white representa- 
tive in an equally uncompetitive district (e.g., in a 
district drawn to preserve partisan advantage). 





12 Tn the case of Michigan and Pennsylvania, the MCD-Level socio- 
economic covariates—income, education, and homeownership—that 
could not be included m the first EI analyms were mciuded as 
controls in the second-stage regressions. 

3 The FGLS procedure is implemented in Stata 6.0 usmg program 
code written and provided by Jeff Lewis. 

14 Lewis's (2000) FGLS estimator does not, however, correct for the 
statıstical inconsistency identified by Herron and Shotts (2000b) in 
their critique of El-based second-stage regressions. As a result of EI 
estimates that are themselves consistent, and measured with an 
error negatively correlated with the parameter’s true value, Herron 
and Shotts argue that coefficient estrmates in regressions that use EI 
estimates as dependent variables will have attenuation bias. In 
practical terms, the regression results presented in tables 3 and 4 may 
understate the effect of black representation on voting participation. 
Herron and Shotts outline a potential solution that may correct for 
the statistical mconsistency. At the time of this writing, that solution 
has not been sufficiently tested, and the tradeoffs involved in its 
implementation, such as the effect on vanance, have not been fully 
enumerated. 

3 Recall that, with the exception of Pennsylvania and Michigan, the 
socioeconomic influences on political participation are taken into 
account m the first-stage EI model 
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RESULTS 


In the first stage of the analysis, I used ecological 
inference to estimate white and black turnout rates for 
each of the precincts and congressional districts in the 
eight states covered. Figure 1 summarizes the (weight- 
) district-level estimates of black and white turnout, 
allowing for an initial look at the variation in behavior 
across political settings. Each plot has standard error 
bars, which capture the uncertainty in the district 
estimates. The horizontal bars measure uncertainty (+ 
1 stan error) in the black turnout estimates. The 
vertical measure uncertainty in the white turnout 
estimates. The overall pattern in the relative width and 
height of the error bars reflects the far greater certainty 
of estimates for white as compared to black turnout. 
Because most pofigressionial districts are dominated 
by precincts that are overwhelmingly white, the data 
oe contain more information on which to make 
inferences about white as compared to black voting 
behavior. For example, imagine a precinct that is 80% 
white and 20% African American. A 40% voting age 
turnout is consistent with a black turnout rate ranging 
from 0 to 100% but a range for whites between 25% 
and 50%. Thus, the data are sufficiently informative to 
roduce narrow error bars around the white estimate 
ut there is great uncertainty in the black estimate. The 
districts represented by black members of Congress— 
identified by name in each plot—are the exceptions, 
because they consist of precincts that are predomi- 
nantly black. In these districts, the estimates of black 
turnout have the narrow error bars more characteristic 
of the white turnout figures elsewhere in the plots. 
With the a of Tennessee, participation in 
the November 1 and 1994 midterm elections was 
generally low; it rarely exceeded 45% of the voting age 
population among either whites or blacks. In 75% of 
the House districts, African-American turnout ranged 
between 15% and 44%; it dipped below 15% in half a 
dozen cases. White turnout, more tightly clustered in 
the 20-40% e, never fell below 17% of age- 
eligible adults. Within states, African-American turn- 
out tended to be considerably more variable across 
districts than was white turnout. The white participa- 
tion gap between the lowest and highest turnout dis- 
tricts in a state never exceeded 15-20 percentage 
seat whereas the gap among black voters was never 
ess than 35 percentage points.1 
In slightly more than half the congressional districts, 
white adults of voting age participated at higher rates 
than did African Americans in the same district. Black 
turnout in many of these districts trailed white turnout 
by more than 10 percentage points. (In Missouri, there 
were two cases in which white turnout exceeded black 
turnout by more than 25 percentage points.) Yet, 
there were a number of districts (46) in which African 
Americans participated more than whites. Most of the 
pa seg gaps were relatively small (e.g., more than 
were less than 10 points), but in Georgia’s Tenth 


16 This 1s calculated on the-basis of the point estumates. The gaps are 
even wider when uncertamty 1s factored in. 

17 Jn the aftermath of the 1994 midterms, many commentators cited 
this participation gap between whitea and blacks as critical to the 
success of the Republican sweep. 
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the gap reached 29 percentage points. New Jersey is 
the one state in which voting icipation rates among 
blacks and whites were ro comparable (within 3 
percentage points) in most districts. 

Tennessee, by far, had the highest turnout rates for 
both whites and African Americans. In the majority of 
districts, well more than 50% of voting age blacks cast 
ballots. In the Ninth, represented by ten-term black 
Democrat Harold Ford, African-American turnout was 
highest (77%). White participation was similarly high 
throughout the state, ranging from 51% to 69%. In 
three cases, including Ford’s district, white turnout 
trailed black turnout by about 10 points. 

The pattern observed in Ford’s Memphis district was 
similar to the dynamics witnessed in two other districts 
Tepresented by black members of Congress. In Penn- 
sylvania’s Second (represented by Bill Gray) and Mich- 
igan’s First (represented by John Conyers), black turn- 
out exceeded white turnout by 6 and 4 percentage 
points, respectively. Among the remaining seven con- 
gressional districts represented by blacks, African- 
American turnout was either comparable to (Virginia’s 
Third) or slightly trailed white turnout. Aside from 
Tennessee, white participation was often at its lowest in 
districts represented by blacks. For example, in New 
Jersey, where white voters turned out at rates as high as 
33%, in Newark only 18% of whites went to the polls. 

Even at the aggregate level, taking only socioeco- 
nomic covariates into account, there is some evidence 
that political behavior may differ under black represen- 
tation. The patterns in Figure 1 most strongly hint at 
demobilization among white constituents and show 
only limited signs of higher turnout among African 
Americans. As noted earlier, however, the districts and 
precincts represented by black members of Congress 
typically face electoral conditions, such as less compe- 
tition, that distinguish them from other areas in the 
state in ways relevant to participation. In order to 
isolate voting behavior unique to precincts with black 
representation, these areas must be evaluated in light 
of the patterns observed among other constituents 
under comparable political conditions. The multivari- 
ate analyses presented in tables 3 and 4 disentangle 
race from other aspects of the political environment, 
which allows a fuller appreciation of the political 
salience of black representation. 

Table 3 summarizes the results of multivariate mod- 
els that predict white turnout. Each model has controls 
for party affiliation, tenure, incumbency, and electoral 
competitiveness. In addition, the models for Pennsyl- 
vania and Michigan include the MCD-level socioeco- 
nomic covariates that could not be incorporated in the 
first-stage EI model due to constancy in the values of 
the covariates in several districts. The results are 
consistent with what was observed at the aggregate 
level. With only two exceptions, white turnout rates in 
precincts represented by black members of Congress 
trailed those observed in precincts represented by 
other (white) Democratic incumbents who faced lim- 
ited electoral competition. The difference between the 
number of eligible voters who went to the polls in 
precincts represented by black as compared to white 
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FIGURE 1. Estimated Congressional District Turnout, November 1990 and November 1994 
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TABLE 3. Predicting White Tumout In 1990 and 1994 Midterm Elections 
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representatives ranged from as little as 4.5 (+ 1.8) 
percentage points in Virginia (Robert Scott) to 18.3 (+ 
1.9) points in New Jersey (Donald Payne). The white 
constituents of Grosse Pointe and Grosse Pointe Park, 
the wealthy enclaves in the far eastern corner of 
Crockett’s Thirteenth, turned out at a rate 5.6 (+ 1.7) 
percentage points below that of their counterparts in 
similar areas elsewhere in Michigan represented by 
white legislators. In John Conyers’s neighboring dis- 
trict, where the white population consists primarily of 
middle-class ethnics in the northeastern and southwest- 
em corners of Detroit, white turnout in 1990 was 7.2 
(+ 1.5) percentage points lower than what one would 
have expected. In states such as Georgia and Missouri, 
where summary statistics suggest that white constitu- 
ents of black House members should behave no differ- 
ently from their counterparts elsewhere in the state 
(recall Figure 1), turnout lagged by 11.5 (+ 5.0) and 
16.8 (+ 15) points, respectively. 
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Whether the incumbent is fairly new to the political 
stage or has more than a decade of service, white 
electoral participation is lower when the congressional 
representative is black. Even in Philadelphia, twelve 
years into the tenure of a man whose moderate style 
actively deemphasized race and racial issues, white 
constituents showed political reticence. This reticence 
is substantively significant when one considers the 
generally low level of participation characteristic of 
midterm elections, when rates rarely top 45%. In that 
context, the 4.6 (+ 2.3) point lag among Kweisi 
Mfume’s white Baltimore constituents is noteworthy. 

In two instances, white constituents departed from 
the norm of political disengagement. In Maryland, 
Albert Wynn, a young black congressman who—like 
many of his political generation—adopted a biracial 
approach to representing his newly created majority- 
black district, made his first bid for reelection in 1994. 
Wynn’s white constituents, concentrated in the middle 
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Tumout 1980 
Independent Variables Georga Michigan New Jersey 
Conetant 719 (071) 328 (317) 338 ( 020)" 
Black Representatves 
John Lews 104 (022) 
George Crockett — 216 (176) 
John Conyers ~.012 (.021) 
Donald Payne 007 (016) 
Bil Gray 
Bul Clay 
Harold Ford 
Robert Scott 
Kwerel Mfume 
Abert Wynn 
SES Conditione* 
Per capita income .009 ( 030) 
Proportion renters 036 (060) 
Education —.099 (057) 
Electoral Gondrbors 
Incumbent candidate -147 (17  —.093 013" 
Party of incumbent —.153 (0377 142 (020;"* = .080 ( 005)" 
Tenure x incumbent 037.007)" .001 (003)  .001 (002) 


Number of oases 647 2,999 4,768 


"Per Capita Income” is measured as the log of biack per capita income. 





and upper-middle income communities of Montgom- 
ery County, showed no signs of political reticence: 
Turnout rates were consistent with levels observed in 
similar precincts elsewhere in the state. Judging by this 
response, neither black electoral success nor the mech- 
anisms that secured it undermined the perceived utility 
of political engagement. In fact, the continued engage- 
ment of whites in the electoral process may reflect the 
wisdom of Wynn’s decision to “eschew expressions of 
militancy for pronouncements on national issues” 
(Barone and Ujifusa 1995).18 

In the second instance, black-represented precincts 
in Tennessee were the sites of political mobilization 
among whites (and, as will be discussed below, among 


18 By comparison, Robert Scott, who also practiced the “politics of 
commonality” (Canon 1999) m his Virgima district, experienced 
demobilization among his white constituents. Ths pattem may 
reflect the “disenfranchisement” and “adverse” effect alleged by the 
plaintiffs in Meadows v. Moon, (1997) the U.S. district court decision 
that struck down Scott’s district. 


TABLE 4. Predicting Black Turnout In 1990 and 1994 Midterm Elections 
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blacks). The “revolutionary” zeal that propelled the 
state to dramatic political changes in 1994—evident in 
the summary statistics in Figure 1—showed no signs of 
moderation in Harold Ford’s Memphis district. In fact, 
participation rates in that district were 7.3 (+ 2.1) 
percentage points higher than in precincts represented 
by other long-term Democratic incumbents with mod- 
erate competition. Ford is unique among African- 
American incumbents in that he represented a racially 
balanced district in which whites (as a sizable minority) 
posed a credible threat to black political dominance. 
Ford’s district has the smallest black majority of any in 
this sample: African Americans make up only 54% of 
the voting age population. Rather than withdraw from 
politics (the response in eight of the ten cases exam- 
ined), white constituents may have recognized and 
adapted to the opportunities made available by their 
sheer size. A history of relatively narrow margins (Ford 
won with 58% of the vote in 1990, 1992, and 1994) 
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served as a constant reminder that black electoral 
success is fragile in areas where racial polarization in 
vote choice is the norm. Ford may have mobilized 
white constituents because he symbolized the triumph 
of a narrow black majority.!9 

Ford’s district was the site of heightened political 
engagement among African Americans as well and for 
that reason is somewhat unique: Only rarely does black 
representation favorably affect black turnout. Table 4 
summarizes the results of FGLS regressions that pre- 
dict black turnout as a function of black representation, 
controlling for the electoral conditions in the precincts 
under study. As evidenced by the coefficients in the top 
half of the table, in only four cases can one conclude 
that black House members experienced a level of 
African-American political activity that differs from 
what one would expect in similar districts represented 
by whites. In six cases, there appears to be no appre- 
ciable difference. A word of caution is in order here, 
however. As detailed in Appendix B, the black turnout 
results from Michigan (Crockett and Conyers), Vir- 
ginia (Scott), and Maryland (Wynn and Mfume) 
proved to be highly sensitive to changes in the EI 
model specification. The results of the second-stage 
regressions for these three states varied between a 
finding of no statistically significant effect, which is the 
result reported in Table 4, and a statistically significant 
positive effect. Because the results did not hold across 
multiple specifications, it is not possible to draw defin- 
itive conclusions about black political behavior under 
black representation in these three states. For the 
other five states, the results were robust across El 
model specifications, including the finding of no statis- 
tically significant effect in New Jersey (Payne). We can 
feel confident in that set of results. 

Although rare, when black congressional officehold- 
ing is associated with greater voter turnout, thousands 
of African Americans participate who otherwise would 
play no role in the electoral process. In Missouri, black 
voter participation was 25.5 (+ 10.9) percentage points 
higher in Bill Clay’s St. Louis district than in any other 
district represented by long-term Democratic incum- 
bents. In Atlanta, where turnout among whites lagged 
almost 12 points behind rates in similar districts in 
Georgia, political engagement among African Ameri- 
cans was up 10.4 (+ 2.2) percentage points. Again, 
these patterns of heightened mobilization under black 





19 The political dynamics in Memphis hint at what might be expected 
in any district with a delicate racial balance, such as a heterogeneous 
majority-minonty district with significant numbers of Latinos as well 
as African Americans and non-Hispanic whites. In such a district, in 
which blacks and Latmos comprise a narrow and fragmented major- 
ity, whites may recognize the potential electoral inftuence of a highly 
mobilized minority. This potential influence may be enhanced by the 
high rate of noncitrzenship that keeps many Latmos of votmg age 
from the polls. In Calrfornsa, for example, it is estimated that 52% of 
Latino adults cannot vote because they are not citizens (Stiles ot al. 
1998). There are very few Latinos in any of the black-represented 
districts covered here. More than half of the heterogeneous majority- 
minority districts created in the 1990 redistricting cycle are in 
California and Texas. Future research should focus on the multira- 
cial dynamics m these districts. 
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representation hold up across multiple model specifi- 
cations. 

New Jersey stands out as a case in which the results 
of the second-stage regressions consistently indicate 
that black mobilization did not accompany black elec- 
toral success. Although the Tenth had been majority- 
black for more than a dozen years, Donald Payne, 
elected in 1988, was the first African American to 
represent the district. His election followed the reluc- 
tant departure of Peter Rodino, who was encouraged 
to retire by black leaders eager to send an African 
American to Congress. Perhaps the unresponsiveness 
of black constituents in Newark reflects the racial 
indifference that kept Rodino in office long after the 
makeup of his district had changed. In that respect, 
African Americans in Newark exhibited the same 
independent streak witnessed in the Atlanta area in the 
early 1980s. After court-ordered redistricting, Geor- 
gia’s Fifth was transformed from a majority-white 
district that elected an African American to Congress 
(Andrew Young in 1972, 1974, and 1976) to a majority- 
black district with a white incumbent (Wyche Fowler). 


DISCUSSION 


One limitation of aggregate data is that it does not 
provide a window into individual motivations. I can 
only speculate about the reasons for the political 
behavior observed under black representation, in par- 
ticular the striking asymmetry between its uniformly 
negative relationship to white voter participation and 
its much less consistent effect on black turnout. Future 
research should explore more fully the individual atti- 
tudes that animate the political dynamics in these 
districts. Bobo and Gilliam (1990) suggest that the 
influence of minority officeholding on trust and efficacy 
accounts for its effect on political engagement. To 
some extent, the patterns of participation documented 
in this research lend themselves to a similar interpre- 
tation. In Georgia, Pennsylvania, Missouri, and Ten- 
nessee, where greater political activity was exhibited 
among African Americans in black-represented dis- 
tricts, black officeholding may foster confidence in 
elected officials (and, possibly, in the institutions in 
which they serve) and the perception of greater minor- 
ity influence on the direction of politics. Conversely, 
and apparently more consistently, black officeholding 
may erode confidence among whites. 

The increase (among blacks) and the erosion 
(among whites) of trust and efficacy may be the by- 
products of purely symbolic politics, or they may reflect 
substantive concerns about policy responsiveness 
and/or constituency service. Abney and Hutcheson 
(1981, 100) argue that “leaders may be significant 
agents of opinion change simply as a result of the 
images they project,” regardless of the policies they 
pursue. After an era in which it was not uncommon for 
states and localities to discourage minority political 
participation, black representation may help eliminate 
the impression among African Americans that elec- 
tions are none of their business. Furthermore, blacks in 
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Congress may build group pride, providing what Gil- 
liam and Kaufmann (1998, 743) describe as “a reser- 
voir of psychic benefits” that contributes to political 
engagement. From this “symbolic politics” perspective, 
it is African-American elected officials in their capacity 
as role models who pull the black community into the 
political process. Meanwhile, the message of black 
electoral success for whites may be that they play a 
diminished role in political life. 

Alternatively, the relationships may have more to do 
with the substantive consequences rather than symbolic 
significance of black congressional officeholding. Cony- 
ers and Wallace (1976, 115) observe that constituents 
tend to define black elected officials as the “specialized 
representatives of black people.” In a country in which 
politics historically has been an important vehicle in 
the mobility (and “mainstreaming”) of racial and eth- 
nic groups, white and black constituents alike may 
equate minority officeholding with the advancement of 
a minority public policy agenda. (In reality, the link is 
more ambiguous and, as Canon [1999] demonstrates, 
involves fewer tradeoffs.) For white constituents, the 
perception that a black officeholder’s chief priority is to 
“solidify, manage, and protect” (Morris 1992, 170) 
minority interests—and that minority interests and 
white interests are necessarily incompatible—may 
erode confidence in the utility of political involvement. 
We would expect African Americans, however, to 
embrace and be energized by these priorities. (Add to 
this the special efforts of black members of Congress, 
guided by an election calculus that defines African 
Americans as their “natural constituencies,” to main- 
tain the political involvement of this core of voters.) Of 
course, this only begs the question of why we do not 
observe a more consistent pattern of African-American 
mobilization im districts with a black representative. 
Perhaps there is an element of truth to Morris’s (1992, 
173) lament that, with the election of black politicians, 
the “black masses have tended to become politically 
quiescent, believing that their black elected officials 
would realize their interests.” 

Evidence from survey data suggests that a preoccu- 
pation with the substantive consequences of black 
congressional representation probably accounts for the 
behavior I observe. Research has found that white 
constituents perceive greater ideological and policy 
conflict with black members of Congress, tend not to 
assess them favorably, and are reluctant to contact 
them on issues of concern (Gay 1997, 2000). The 
political reticence evident in Table 3 may be another 
implication of these perceived failures in representa- 
tion. To African Americans, black members of Con- 
gress appear to hold policy preferences consistent with 
their own, although not necessarily more consistent 
than the priorities of liberal white Democrats. This 
ideological and policy congruence encourages particu- 
larized contact and may account for the more intense 
ballot box activity in Atlanta, St. Louis, Memphis, and 
Philadelphia. The fact ‘that African Americans are 
inclined to see similarities between biack Democrats 
and more liberal white Democrats also may help 
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explain the inconsistent pattern of mobilization and 
restraint; in the absence of a categorical difference, 
there is no categorical effect. Future research should 
explicitly investigate the link between participation and 
perceptions of responsiveness. 

Finally, it is worthwhile to consider how these find- 
ings compare with earlier research on minority empow- 
erment in American cities. Recall that areas of high 
black empowerment have greater participation by Af- 
rican Americans; there is more limited evidence of 
white demobilization under black mayors. As com- 
pared to local officeholding, the influence of black 
congressional representation differs more in degree 
than in kind (limited evidence of black mobilization, 
consistent pattern of white demobilization). These 
parallels are interesting for two reasons. First, one 
might expect black congressional officeholding to have 
little or no significant effect on political participation 
because of the nature of the office. Unlike black 
mayors, whose activities can: affect everything from 
neighborhood police patrols to the frequency of trash 
collection, members of Congress do not have the 
executive authority to make decisions that directly 
affect the daily lives of constituents. Despite this more 
limited relevance, there still seems to be an influence 
on political participation. Second, the more consistent 
pattern of white demobilization observed in this re- 
search suggests that earlier findings may have been 
confounded by “white flight from cities to suburbs.” To 
the extent that this phenomenon precipitates or comes 
on the heels of black empowerment in urban areas, the 
absence of pronounced participation effects tells us 
more about whites who choose to remain in the city 
than it does about white responses in general. The 
more fluid, less stable nature of congressional district 
boundaries makes white flight a less feasible alterna- 
tive; as a result, one would expect to see a more 
pronounced relationship between officeholding and 
participation. 


CONCLUSION 

In districts in which African Americans enjoy political 
prominence, white constituents are more likely to 
remain on the margins of the electoral process. Black 
congressional incumbents routinely experience white 
turnout rates that are 5-18 points lower than at polling 
places elsewhere in the state. This consistent pattern of 
white demobilization is not offset by an equally consis- 
tent pattern of black mobilization. The optimism of 
some who champion minority representation (and, by 
extension, the districting mechanism that ensures it) as 
a way to increase black voter participation may be 
misplaced. Only occasionally is there greater political 
involvement among African Americans represented in 
Congress by an African American. Those black office- 
holders who do succeed in making voters out of 
previously unengaged minority constituents may expe- 
rience black turnout that is 6-26 percentage points 
higher than rates in other districts. But, more often 
than not, African Americans represented by a black 
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member of Congress display the same patterns of 
behavior as their counterparts in other districts. In 
short, black electoral success can have a measurable 
but strikingly asymmetric effect on political behavior, 
both engaging (some black) and disengaging (many 
white) constituents in the electoral process. 

This research bridges the literatures on political 
participation and minority congressional representa- 
tion and expands our understanding of both. The 
political significance of black congressional represen- 
tation cannot be reduced strictly to measures of policy 
activism or constituency service, and it cannot be 
understood by singular attention to its consequences 
for black Americans. To a great extent, the behavior of 
white constituents truly distinguishes the political dy- 
namics of black-represented districts. The findings of 
this study should alert us to the significant role that 
minority representation has played in compromising 
the appeal of politics for many white Americans, while 
fostering a more dynamic political life for only some 
African Americans. (This dichotomy puts into doubt 
any hope that black electoral success might lead to an 
attenuation in racial conflict.) If we look beyond indi- 
vidual resources, we realize that political participation 
today reflects the tangible successes of the post—civil 
rights era. 


APPENDIX A: DISTRIBUTION OF 
PRECINCTS AND MINOR CIVIL DIVISIONS 
IN THE 1990 DATA 


Table A-1 summarizes the distribution of precincts across 
MCDs, as well as the distribution of MCDs across congres- 
sional districts, for the subset of states whose November 1990 
elections are analyzed. (For the November 1994 elections, all 
the data are available at the precinct level.) In most cases, 
MCDs consist of no more than ten precincts. In Michigan 
and Georgia, 90% of MCDs contain fewer than five pre- 
cincts. Two important exceptions should be noted. All the 

in Pennsylvania’s First, Second, and Third congres- 
sional districts fall into one MCD (Philadelphia). Similarly, 
all the precincts in Michigan’s First and Thirteenth congres- 
sional districts fall into one MCD (Detroit). As a result, there 
is no variation in the MCD-level variables (e.g., education, 
income, homeownership) among the precincts in these dis- 
tricts. This has significant implications for the subsequent 
analysis, as is discussed in the teat. 


APPENDIX TABLE A-1. 
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APPENDIX B: El MODEL SPECIFICATION 


Classifying by Race and Hispanic Origin 


Latinos are included in the EI analysis on the basis of their 
racial identification. All respondents to the 1990 Census of 
pesara and Housing were required to indicate an ethnic 
tigin (Hispanic, Not Hispanic) as well as a racial identifica- 
Hon oi (visie. black, American Indian, Asian, other). Nation- 
wide, 52% of people of Hispanic origin self-identified as 
white. Latinos who racially identified themselves as white in 
the 1990 Census are included in the “white” racial category in 
this analysis. For most of the states and congressional dis- 
tricts covered here, Latinos do not constitute a large segment 
of the population. In 79 of the 102 districts, they account for 
3% or less. The districts with larger numbers of Latinos 
(10-18%) are concentrated in the Northeast; only one of 
these “outlier” districts was represented by a black member 
of Congress, Donald Payne of Newark, New Jersey. Twelve 
percent of Payne’s constituents were of Hispanic origin. (New 
Jersey was also the home of a majority-minority district that 
was 41% Latino, represented by Robert Menendez.) The 
only southern black incumbent with an above-average pro- 
portion of Latino constituents (6%) was Albert Wynn. 


Sensitivity of Findings to El Model 
Specification 


In the first-stage EI analysis, I explicitly modeled the depen- 
dence of B? (proportion of black voting age population who 
vote) and BY (proportion of white voting age population who 
vote) on a set of demographic and socioeconomic factors 
found by previous research to affect levels of political partic- 
ipation. This approach sidesteps the well-documented prob- 
lems (Cho 1998; Cho and Gaines 2001; Freedman et al. 1998; 
Freedman et al. 1999; Herron and Shotts 20002) associated 
with the basic application of King’s EI technique (ie. the 
problematic assumption of no aggregation bias). Although 
King (1997, 1999b) maintains that EI is robust to violations of 
the aggregation bias assumption—in part, as a result of 
leveraging information contained in the deterministic 
bounds— other researchers argue theoretically and demon- 
strate empirically that in the presence of aggregation bias 
King’s (1997) technique produces biased and statistically 
inconsistent estimates of the quantities of interest (Cho 1998; 
Cho and Gaines 2001; Freedman et al. 1998; and Freedman 
et al. 1999). In addition, Herron and Shotts (2000a) argue 
that researchers risk logical inconsistency by assuming away 
aggregation bias in first-stage EI, only to include covariates in 
a second-stage regression. 

Correctly modeling the dependence of B? and BY on a set 


Distribution of Precincts and Minor Civil Divisions 


Number of Precincts 
per MCD 


Number of MCDs 
per District 


90th 
Median Percentile 


5 44 231 
56 73 
27 
26 
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of covariates presents its own challenges. One important 
objection raised to King’s (1997) technique concerns the 
apparent variability in the parameter estimates across differ- 
ent model specifications and the lack of an objective test for 
whether one specification is superior to another (Cho and 
Gaines 2001). Concerned about the practical implications of 
this issue for this research, I tested seven different model 
specifications and generated seven sets of B? and BY esti- 
mates for each of the roughly 28,000 precincts in the eight 
states. The models were as follows: Model 1 (basic EI with no 
covariates); Model 2 (black population density covariate); 
Model 3 (income, homeownership, and education covari- 
ates); Model 4 (income, homeownership, education, and 
black population density covariates). In models 2 through 4, 
the covariates are coded as continuous variables. In three 
additional models, I include the same covariate combinations 
as models 2 through 4 but dichotomize these covariates at 
their district medians (e.g., coded 0 below the median, 1 at or 
above the median: Model 5 (black population density covari- 
ate, dichotomized); Model 6 (income, homeownership, and 
education covariates, dichotomized); Model 7 (income, 
homeownership, education, and black population density 
covariates, dichotomized). This more flexible functional form 
was meant to accommodate the House districts in which 
precinct-level black population density, X, is generally low 
and does not cover the unit interval. In those cases, the 
linearity assumption implied by the use of continuous covari- 
ates—especially black population density itself—forces EI to 
extrapolate to portions of the parameter space for which 
there are few observations (King 1997). I was concerned that 
the resulting estimates would be too sensitive to outliers. 
For By’, the estimates across specifications were remark- 
ably stable, showing little evidence of the variability Cho and 
Gaines (2001) note in their critique of Burden and Kimball 
(1998). Most of the correlation coefficients fell between .90 
and .99, and they rarely dipped below .80, which is strong 
consistency for a simulation-based estimator. The estimation 
benefits from the high degree of residential segregation 
characteristic of most American cities (i.e. in the precincts 
with any whites there are many whites and thus a lot of 
information) as well as from the overall racial balance (i.e., 
whites account for 72% to 90.3% of the population for the 
states in my sample, so predominantly white precincts 
abound). The result of all this information is very tight 
bounds on BY. The tight deterministic bounds on white 
turnout for most of the precincts help to avert extreme 
sensitivity to slight changes in model specification. The 
weakest correlations across alternative models can be found 
in the precincts with few whites. In these cases, not only are 
the estimates sensitive to model specification, but also the 
inclusion of covariates greatly increases the standard errors 
on these estimates. That said, in light of the overall stability 
in the parameter estimates, I felt comfortable choosing the 
EI model most consistent with the findings from earlier 
research on political participation (i.e., Model 7). For the 
record, however, all the models yielded the same conclusions. 
For B°, the estimates across model specifications were 
considerably less stable. In some districts, correlations dipped 
below .10. Ecological inference on black turnout benefits 
from the high incidence of residential segregation, but it 
suffers from the simple fact that there are very few precincts 
with any appreciable numbers of black residents (e.g., in New 
Jersey, the median precinct-level black population density 1s 
2%.) When I restrict the EI analysis to those precincts with at 
least 15% black population, all the correlation coefficients 
rise to levels similar to those observed for B”. Only in the 
districts with majority-minority populations are there signif- 
icant numbers of precincts with a large black population and, 
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thus, plenty of information with which to generate robust 
ecological inferences. For precincts elsewhere, not only are 
the estimates highly variable across specifications, but also 
the inclusion of covariates severely strains the limited data 
(ie., the standard errors are substantial). 

In the final analysis, there were five states (Georgia, 
Missouri, New Jersey, Pennsylvania, and Tennessee) whose 
substantive results and conclusions were. unaffected by 
changes in EI model specifications. In the other three states 
(Maryland, Michigan, and Virginia), results varied across 
specifications. The measured effects of black representation 
estimated in the second-stage regressions for these three 
states varied between no statistically significant effect and a 
significant positive effect. As in B7, I adopted Model 7, which 
is most in line with the literature on participation. As can be 
seen in Table 4, for the three specification-sensitive states, 
Model 7 yields statistically insignificant results on the rela- 
tionship between B? and black representation. Because these 
results do not hold across multiple specifications, however, it 
is not possible to draw definitive conclusions about black 
political behavior in those three states. 
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Americans to office has not greatly improved the well-being of the black community. This study 


[iments tices no geai ingot the wal ig of the back comuni. Th ay 


focuses on the white community, however, and finds that black leadership can have a profound 
effect. Under black mayors there is positive change in the white vote and in the racial sentiments expressed 
by members of the white electorate, Although white Republicans seem largely immune to the effects of black 
incumbency, for Democrats and independents an experience with a black mayoralty tends to decrease racial 
tension, increase racial sympathy, and increase support of black leadership. 


wo questions have commanded much attention 

from scholars interested in black political repre- 

sentation in the United States. The first is 
whether white Americans will vote for black candidates 
(and why or why not). The second is whether African 
Americans, once elected, can improve the economic 
standing of blacks. Answers to both questions have 
generally been discouraging. Given the choice, the vast 
majority of white Americans will vote for a white 
candidate, even if it means switching parties. Despite 
the success of a number of highly visible black candi- 
dates, such as former governor Douglas Wilder, former 
mayor Tom Bradley, and Congressman J.C. Watts, and 
despite the increasingly popular belief that race no 
longer plays a major role in the voting booth (Citrin, 
Green, and Sears 1990; Swain 1995; Thernstrom and 
Thernstrom 1997) 70% to 90% of white voters will 
choose the white candidate in a typical biracial contest 
(Henry 1987; Lieske and Hillard 1984; Loewen 1990; 
McCrary 1990). Race still matters when white voters 
are faced with a black office-seeker. 

Even when blacks are elected, their leadership does 
not greatly improve the economic well-being of African 
Americans at the city, regional, or state level. Studies 
suggest that black incumbents can modestly change 
local hiring policies and spending priorities (Eisinger 
1982; Mladenka 1989), but these and other changes are 
not dramatic. The overall substantive effect on most 
members of the black community is negligible (Brown- 
ing, Marshall, and Tabb 1990; Reed 1988; Singh 1998). 
According to Manning Marable, the election of Afri- 
can Americans “can be viewed as a psychological 
triumph, but they represent no qualitative resolution to 
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1 Although the vast majority of whites say in surveys that they will 
vote for a qualified black candidate (Schuman et al. 1997), experi- 
mental studies suggest that a candidate’s race can and does affect the 
white vote in subtle and important ways (Reeves 1997; Sigelman et 
al. 1995; Terkildsen 1993). 


the crises of black poverty, educational inequality, 
crime, and unemployment” (quoted in Perry 1996, 6). 

Some scholars interpret these results as proof of the 
ineffectiveness of black Jeaders (Browning, Marshall, 
and Tabb 1997; Reed 1988; Smith 1996). This conclu- 
sion, however, ignores the potentially positive effect of 
black representation on the white community in terms 
of racial attitudes, tolerance, and future voting behav- 
ior.? That is the focus of this study. The election of 
blacks to public office may not improve conditions for 
African Americans to the degree many may wish, but it 
may have less visible but equally consequential effects 
on white voters: educating them about black leaders, 
reducing white fears regarding the types of policies 
these leaders will enact, and improving race relations 
as a result. Does experience with black leadership 
change the racial attitudes, policy preferences, and 
voting behavior of white Americans?? 


THREE HYPOTHESES ABOUT WHITE 
POLITICAL BEHAVIOR 


The Information Hypothesis 


I argue that black political representation should sig- 
nificantly improve white attitudes toward African 
Americans and increase the likelihood that whites will 
vote for the black incumbent in the next election (even 
if s/he runs against a white challenger) because they 
have learned critical information regarding the degree 
to which black leadership will affect their economic 
well-being. When a black challenger runs for office, 
many whites fear that s/he will favor the black commu- 
nity over the white by redistributing income, encour- 
aging integration, and generally channeling resources 
toward African Americans. As research shows, this 
does not happen. For the vast majority of whites, their 
world under black leadership is almost identical to 





? It also ignores the symbolic significance of black representation for 

the black community (Bobo and Gilliam 1990). 

3 Few systematic empirical studies directly addreas these 

The most common view, based largely on the end of black leadership 

in New York, Chicago, and Los Angeles, is that whites continue to 

Tespond negatively to black incumbents (Abney and Hutcheson 1981; 
ing, Marshall, and Tabb 1997; 1993; see also Gay 1999 

on congressional incumbents). Accounts from a few cities, however, 

contradict this negative view (Eisinger 1980; Pettigrew 1976; Watson 

1984). 
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their world under white leadership. Once blacks have 
the opportunity to prove that their election does not 
harm white interests, white fear should decline, oppo- 
sition to black candidates should diminish, and white 
attitudes toward blacks and black leadership should 
improve. Black representation, therefore, serves an 
enormously important although difficult to observe 
informational role. The greater the number of African 
Americans elected to positions of power, the more 
whites will learn about the effects of their leadership, 
the less they will fear it, and the more likely they will be 
to vote for black candidates in the future. Thus, the 
limited effect of black representation on the economic 
well-being of the black and the white community 
should have a profound influence on white racial 
attitudes and behavior toward black incumbents. 

Why do white Americans fear black leadership? I 
argue that there are two reasons: limited first-hand 
experience with African Americans in positions of 
authority and a long history of racial conflict. Normally, 
voters have less information about a challenger than an 
incumbent, but when the challenger is black, informa- 
tion is even more sparse (Popkin 1995). Most whites 
have little first-hand experience with black representa- 
tion and do not know what the consequences are likely 
to be. In short, uncertainty is a central feature of black 
challenger elections. 

Because white voters have little or no personal 
experience with black leadership, they often rely on 
racial stereotypes for information about how African 
Americans are likely to behave once elected (Conover 
and Feldman 1989). This inevitably hurts black candi- 
dates’ chances for white support. When whites know 
little about a candidate other than race, they rate 
blacks worse than whites on nineteen out of twenty 
leadership and personality characteristics; they view a 
black candidate as less trustworthy, less able to get 
things done, and even less intelligent (Williams 1990). 

These stereotypes and the fears they produce are 
manifested in the white vote. When blacks first run for 
an Office, the election tends to be more competitive 
than usual, turnout often exceeds normal levels, and 
partisan identification is often ignored (Lublin and 
Tate 1995; Watson 1984). The result is that most black 
challengers face a white population that votes in almost 
overwhelming numbers against them. 

Once an African American is elected, however, 
whites obtain important information about the effect of 
black representation on their lives. They can now base 
their assessments on an incumbent’s record rather than 
on stereotypes, exaggerated fears, or the incendiary 
predictions of white opponents. When whites do not 
lose their jobs, when blacks do not move into white 
neighborhoods in large numbers, and when black crime 
does not proliferate, voters learn their fears were 
groundless. A white resident of Los Angeles put it this 
way: “A lot of people were very suspicious and fearful 
before Bradley got in. But they never say anything now. 
I am sure they have changed their opinions” (U.S. 
News and World Report 1975).4 





4In some cases the black Incumbent may confirm white fears. 
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The election of an African American is 
important in the minds of white voters because it marks 
one of the first times that blacks have authority or 
control over the white community. When blacks have 
(or are perceived to have) the power to inflict harm on 
whites and choose not to do so, whites learn that black 
control does not mean their downfall. In many cases, 
the contrast between their fears and the reality is so 
stark that whites are forced to reevaluate blacks and 
black leadership. As a result, whites may vote for black 
incumbents even if they did not support them in the 
first election, 


The Racial Prejudice Hypothesis: 
Enduring Raclal Stereotypes 


It can be argued that black representation, no matter 
how positive its effect on the white community, will 
have no influence on white attitudes toward African 
Americans or white behavior toward black incumbents. 
If racial prejudice is the primary factor bebind white 
opposition to black empowerment, as many suggest, 
then there is little reason to suspect that white views or 
behavior will change (Adorno et al. 1950; Allport 1954; 
Hurwitz and Peffley 1998). Racial stereotypes are 
simply too deeply ingrained (Devine 1989; Fazio et al. 
1995) and too stable (Fiske 1998; Rothbart and John 
1993) to be swayed by a single black politician who wins 
public office. Even if the words and actions of black 
incumbents do not fit racial stereotypes, whites can use 
an array of tactics to try to maintain their stereotypes 
and create cognitive consistency (Hamilton 1981; Mac- 
rae, Hewstone, and Griffith 1993). They can ignore 
events that disconfirm their current views or discount 
contradictory evidence as an exception to the rule 
(Macrae, Hewstone, and Griffith 1993; Weber and 
Crocker 1983). Given the intractability of racial stereo- 
types and the enduring nature of racial prejudice, it is 
possible that a single black politician cannot change the 
way whites think about race or the way they vote in 
black incumbent elections. 


The Racial Threat Hypothesis: White 
Backlash 


Black leadership may have another and more alarming 
effect on white attitudes and behavior. If, as research- 
ers from Blumer (1958) to Bobo (1983) have suggested, 
white Americans identify as a group and feel threat- 
ened whenever blacks are in a position to endanger the 
wealth and political power of the white community, 
then whites are likely to respond negatively to black 
incumbency. Black electoral victories can be seen as a 
direct threat, disrupting the traditional balance of 
racial power. The election of African Americans to 
important leadership positions should, in this case, 





Manon Barry’s tenure as mayor of Washington, DC, probably did 
not snprove white views of black leadership. As long as the black 
incumbent does significantly better than whites fear, however, views 
of black leadership should improve. 
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heighten racial tension and result in widespread white 
backlash. 

: This prediction is supported by an array of white 
responses to black empowerment in the. past. An 
increasing black presence in the local community has 
been linked to greater levels of racial conservatism 
(Glaser 1994), an increased sense of threat among 
whites (Fossett and Kiecolt 1989; Giles and Evans 
1986), and increased white-on-black violence (Alt 
1994). Similarly, challenges to white power and author- 
ity are associated with higher white voter turnout, 
widespread defection of whites from the Democratic 
Party (Lublin and Tate 1995; Watson 1984), and 
greater Republican Party identification among white 
voters (Huckfeldt and Kohfeld 1989).5 Given this fre- 
quent pattern of black empowerment and white back- 
lash, one might expect black representation to increase 
white racial intolerance and opposition to black incum- 
bents. 


DATA AND MEASURES 


To determine which of these hypotheses is accurate, I 
look at changes in white racial attitudes and policy 
preferences and white voting behavior before and after 
the transition from a white to a black mayoralty. 
Although each hypothesis should apply to any level of 
political leadership, I focus on the mayoralty for sev- 
eral reasons. First, the information hypothesis is most 
likely to be true at the mayoral level because most 
people know who their mayor is (Cole 1976), which is 
a necessary condition for it to hold. Furthermore, 
mayors often act unilaterally, whereas legislators must 
obtain majority support, which means that citizens can 
track a mayor’s performance more directly. If the 
streets are cleaned, the mayor has done her job. If 
garbage piles up, the mayor can be blamed.* A black 
mayor, therefore, provides more information to white 
residents than would a black legislator and should have 
a greater effect on white attitudes. Second, tests of 
these hypotheses require a sufficient number of cases of 
black and white mayors in cities with different racial 
makeups to allow for empirical analysis. I chose the 
mayoralty level because of the jarge number of black 
mayors across the country. In 1992 there were 338, 37 
of whom served in cities with a population of more 
than 50,000 (Joint Center for Political Studies 1994). 


Data 


To assess changes in the racial attitudes and policy 
preferences of whites under black mayors, I use data 
from a pooled sample of the American National Elec- 
tion Study (ANES) from 1984 to 1992. The ANES 
contains an array of questions gauging white racial 


5 For exceptions to this pattern, see Green, Strolovitch, and Wong 
1998 and Kinder and Mendelberg 1995. 

$A poll assessing views of mayoral power found that even in 
Washington, DC, a city with limited local autonomy, a large majority 
of residents beleve the mayor “can control” or “exact influence” on 
almost every issue of concern to the city (Washington Post, June 11, 
1978, A1). 


attitudes and political orientation. By pooling samples 
from 1984 to 1992, I obtained an ample number of 
white responses from a wide variety of cities with a 
black or white mayor.” Details on the sampling, survey 
instruments, and other methods can be found in Miller 
and NES (1994). Data on the race of mayors and 
council members are from Black Elected Officials: A 
National Roster (Joint Center for Political Studies 
1983-94). Other data on city characteristics, such as 
racial demographics and median income, are from the 
relevant Census publications (U.S. Bureau of the Cen- 
sus 1990, 1964, 1994). 

I also present data on the outcome of every black 
incumbent reelection bid in every city larger than 
50,000 for 1965 to 1999, which I compiled from Black 
Elected Officials: A National Roster (Joint Center for 
Political Studies 1965-97) and local newspaper reports. 


Measures of White Racial Attitudes 


I look for change in white views in three broad arenas 
of race relations: (1) perceptions of change in the 
status of blacks, (2) general feelings toward blacks, and 
(3) perceptions of racial group conflict. I chose these 
three because each addresses a critical aspect of the 
information hypothesis. 


Status of Blacks, For the information hypothesis to be 
accurate, white residents must perceive the limited 
negative effect of black incumbents. Whites have to 
learn that racial change under a black mayor is not 
significantly faster or more threatening than racial 
change under previous white mayors. To measure the 
variable Status of Blacks, I examined white responses to 
the following question: “How much real change do you 
think there has there been in the position of black 
people in the past few years?” (1 = a lot, 5 = some, 
0 = not much at all.) The more whites feel that blacks 
are doing better or gaining on whites, the higher they 
should score on this measure. 


General Feelings. If whites are not discounting the 
words and actions of black incumbents as isolated or 
anomalous events but are instead generalizing about 
the larger black community from their experiences, 
then this should be reflected in their overall views of 
the black community. When whites perceive that Afri- 
can Americans do not want to harm the white commu- 
nity, their feelings toward blacks should improve. In 
order to measure general feelings toward blacks, I 
examined white responses to all ANES questions that 
tapped views about the black community but did not 
relate to specific policy debates or ask directly about 
black-white conflict. There were five items: a feeling 
thermometer and four other statements. 

The most general measure was a feeling thermome- 


7 From this pooled sample I drew only ts from cities or 
primary areas larger than 25,000. Among this subsample, 1,605 white 
respondents lrved ın 18 cities with black mayors. The racial compo- 
sition of these 18 cities ranges from 10% to 76% black. Across the 
whole pooled sample, 6,543 respondents from 70 cities are repre- 
sented. 
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ter, that is, how warmly or favorably whites felt toward 
blacks as a group, on a scale from 0 to 100. I reversed 
the scale to get a measure of Antiblack Affect. Since 
different respondents can assign very different meaning 
to the same value on a scale, I controlled for each 
respondent’s feelings toward whites on a similar re- 
versed feeling thermometer.® The result is a measure of 
how negatively respondents felt toward blacks as a 
group, relative to their feelings toward whites.’ 

There were four other general race questions in the 
surveys from which the pooled sample was drawn. 


“Irs really a matter of some people not trying hard 
enough; if blacks would only try harder they could be just 
as weil off as whites.” 

“Trish, Italian, Jewish and many other minorities over- 
come prejudice and worked their way up. Blacks should do 
the same without special favors.” 

“Over the past few years, blacks have gotten less than 
they deserve.” 

“Generations of slavery and discrimination have created 
conditions that make it difficult for blacks to work their 
way out of the lower class.” 


In each case, respondents were asked their degree of 
agreement with the statement (disagree strongly, dis- 
agree somewhat, neither agree nor disagree, agree 
somewhat, and agree strongly). 

On the surface, each question addresses a different 
issue (e.g., whether blacks work hard, whether blacks 
are distinct from other minorities), but all focus on a 
central element of race relations: the extent to which 
blacks face barriers in American society. The four 
questions are linked not only conceptually but also 


empirically. Because answers are highly correlated, . 


with interitem correlation ranging from a low of .39 to 
a high of 59, I was able to create a Racial Resentment 
scale.!° Responses to each question were ordered from 
least (5) to most (1) sympathetic to blacks." Individual 
responses were then added together, and the scale was 
normalized to a 0-1 range. The reliability of the scale 


8 Some respondents tend to be high raters and others low raters, so 
I also standardized responses by controlling for each respondent's 
mean score across an array of six feeling thermometer measures. The 
following results are almost identical with or without feelings toward 
whites controlled and with or without i by the mean. The 
scale has a mean response of 55 (std. dev. = .10). 

’ This scale can also be viewed as a measure of prejudice (Allport 
1954; Hurwitz and Peffley 1998). The more whites prefer whites as a 
group over blacks as a group, the more they can be seen as 
prejudiced. 

1 Tt is almost identical to a racial resentment scale developed by 
Kinder and Sanders (1996). 

u Respondents who agreed with the last two statements were coded 
as racially sympathetic. Respondents who agreed with the first two 
statements were coded as unsympathetic. 

12 Į tried two alternate tests to ensure the robustness of resulta. First, 
I repeated the following analysis with each individual question rather 
than the whole scale. Although statistrcal significance usually de- 
clined, there were few substantive changes to the results. Second, I 
used maxmmum likelihood estimation confirmatory factor analysis to 
develop a latent factor ing the main theme of these four 
questions. When I substituted the latent factor into the following 
analysis, the results were almost identical Black leadership improves 
white views of the black community, no matter how those views are 
measured. 
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is high, with a Cronbach’s alpha of .77. If black 
leadership does change white views about African 
Americans as a group, then this should be reflected on 
both the antiblack affect and the racial resentment 
scale. - 


Racial Conflict. I also looked for change in percep- 
tions of Racial Group Conflict. If black incumbents do 
not target the white community but instead serve the 
interests of many whites, then it is likely that white 
perceptions of racial conflict will decline. As a measure 
of this variable, I examined responses to the following 
question: 
Some say that the civil rights people have been trying to 
push too fast. Others feel they haven’t pushed fast enough. 
How about you: Do you think that civil rights leaders are 
trying to push too fast, are going too slowly, or are they 
moving about the right speed? (1 = too fast, 5 = about the 
right speed, 0 = too slowly.)!4 


The Model 


To determine which of the three hypotheses of white 
political behavior (information, racial prejudice, or 
racial threat) is most accurate, I compare the attitudes 
of a representative sample of white respondents in 
cities with a white mayor to the attitudes of a similar 
sample of white respondents ing the tenure of a 
black mayor, and I also look at the effect of the number 
of years during which black mayors served. 

To ensure that any differences between black-run 
and white-run cities are a function of black leadership, 
I control for demographic and socioeconomic factors 
that are known or ed to affect white racial 
attitudes. The model has the following control vari- 
ables: personal socioeconomic characteristics: age, ed- 
ucation, income, gender, employment status, home 
ownership, and the number of years the respondent has 
lived in the city. The contextual variables are: percent- 
age of the city population that is black, level of 
urbanism, year of the interview, and residence in the 
South. All independent variables are coded 0 to 1 for 
ease of interpretation. I also include in each model a 
dummy variable for cities that are within twelve months 
after the transition to the first black mayor. I do so for 
two reasons. First, because there is usually heightened 
racial tension during the transition year, white respon- 
dents are likely to be less rather than more racially 
tolerant in the period surrounding the election. Sec- 
ond, whites have had little time to learn about the 
consequences of black leadership. 

A possible confounding factor is white flight after the 
election of a black mayor. The rapid outmigration of a 
large number of whites could clearly affect the remain- 
ing mix of white racial attitudes under black incum- 


13 The scale is centered on a value of .60, which indicates that most 
white respondents were slightly more resentful than sympathetic on 
these questions. Scores are distributed normally. Standard deviation 


is 24, 

14 Although a broader array of questions might be preferred, this 
single item has real advantages. Bobo (1983) argues that this 
question is a key mdicator of the state of racial conflict and shows 
that responses are algnificantly, if weakly, correlated with recent or 
imminent changes in local racial policymaking. 
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bents. Several factors suggest that this is not a signifi- 
cant problem. First, the rapid outmigration of whites 
was largely stemmed by the time these surveys were 
taken (Massey and Hajnal 1995). Between 1980 and 
1990 in the cities included in this survey, the proportion 
of blacks increased only 2.3% in cities with black 
mayors and a relatively similar rate of 1.4% in cities 
without black mayors. Thus, attitude changes between 
1982 and 1992 are not likely to be due primarily to 
white flight from black-run cities. Second, studies sug- 
gest that because of high moving costs, whites who 
leave cities tend to be younger and better educated 
(South and Deane 1993). The poor, older, and less 
educated whites who remain are more likely to be 
racially intolerant (Bobo 1983; Rieder 1985). Third, 
although race is often very important for interneigh- 
borhood moves, it is less relevant for moves into and 
out of metropolitan areas (Long 1988). Fourth, I 
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(Hajnal 1998) show that increases in aggregate white 
support for incumbent black mayors in 23 cities are not 
correlated with changes in the racial demographics of 
the area. Finally, as the tables below indicate, length of 
residence in the community is generally not signifi- 
cantly related to respondents’ racial attitudes. 


RESULTS 


Black officeholders do matter. As demonstrated by 
Table 1, on two different measures, whites who live in 
cities governed by black mayors have significantly more 
positive racial attitudes than whites who live under 
white mayors. First, under black mayoral leadership, 
both antiblack affect and racial resentment are lower 
(p < .01). Second, as predicted by the information 
hypothesis, white perceptions of racial group conflict 









TABLE 1. 











Status of Blacks 





The Effect of a Black Mayor on White Racial Attitudes 


Antiblack Affect Racial Resentment 















(Ordered Logit) (OLS) (OLS) (Ordered Logit) 

Black Mayor 

(1 = yes, 0 = no) 04 (.12) —.02 (.01)* —.04 (.01)" —.21 (.12) 
Education -1.1 (14 —.05 (.01)"* =24 (02) —1.5 (.16)""* 
Income 05 (.18) —.00 (.01) —.01 (.02) 04 (17) 
Age 1.7 (20) 04 (.01)"* —.01 (.02) 1.2 (.21) 
Gender (1 = male) 03 (.08) 00 (.01) 02 (.01)"* 28 (.08)"* 
Ideology (1 = Ilberal) -1.0 (20 —.06 (.01)"** —.27 (.02)"*"* -2.1 (21) 
Party ID (1 = Democrat) —.30 (.13)" 01 (.01) —.02 (.01) —.55 (.14)"" 
‘Employment status 

(1 = unemployed) 04 (.22) —.00 (.01) 01 (.02) —,62 (.23)* 
Years living In city 23 (.12) 01 (.01) 03 (.01)* 14 (13) 
Percentage black In city 31 (28) 04 (.01)* 13 (.03)"™ 86 (.28)* 
Level of urbanism —.20 (.15) 01 (.01) —,07 (.02)""* 05 (.16) 
South (1 = yes) 55 (10) 01 (.00)** 03 (.01)* 30 (.10)* 
1988 —.23 (14) —.24 (.15) 
1988 —.62 (.13)"* 02 (.01)** 04 (.01)" —.30 (.14)" 
1990 —.84 (15 —.04 (.01)" —.24 (.16) 
1992 —.84 (.12)"* —.02 (.01)"* 01 (.01) —,38 (.13)* 
First year of black 

mayoralty (1 = yes) 14 (.18) 01 (.01) 01 (.02) ~.13 (.19) 
Constant 59 (.01)""* .88 (.02)"** 
Intercept 1 -3.0 (.23)"** —4.1 (25 
Intercept 2 —.56 (.22)* —.64 (23) 
adj. A’/pseudo R? 14 .09 21 15 
¥ 349 424 
N 2,855 2,914 2,481 2,597 





Source: Amenoan Nabonal Becton Study, Cumutatve Data Fie, 1964-82. 
Note: Figures are unstandardized coefficients with standard errors shown In parentheses. “‘p < .05, “p < .01, =p < 001. 
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also appear to be lower under black mayors (although 
the change is not quite significant). In short, white 
residents appear to learn something positive through 
their experiences with black leadership. The only area 
in which there is no apparent difference is in white 
perceptions of the pace of racial change, but that is 
exactly what one would expect from the information 
hypothesis. Whites should recognize that racial change 
under a black mayor is not significantly greater than 
under previous white mayors. Overall, the results in 
Table 1 strongly support the information hypothesis 
and largely contradict the racial stereotype and white 
backlash hypotheses. ‘ 


Alternate Specifications 


Table 1 overlooks an important issue. Given the cross- 
sectional nature of the data, it is possible that the 
causal arrow is reversed. Cities with more tolerant 
white residents may be more likely to elect a black 
mayor. Although the available evidence seems to sug- 
gest that the presence of a black mayor depends much 
more on the size and resources of the black community 
than on the characteristics of the white community 
(Karnig and Welch 1980), I undertook several different 
tests to assess the link between a black mayoralty and 
white views. In each case, the results were the same: A 
black mayoralty results in less negative views of blacks 
and black leadership. The first test is a two-stage 
least-squares model, described in detail in Appendix A, 
which uses instrumental variables to address the ques- 
tion of causality. The overwhelming conclusion from 
this alternate test is that white views on race are 
significantly more positive under a black mayor than 
under a white mayor, even after considering the possi- 
bility of reverse causality. 

As an additional check, I reanalyzed the data with a 
different sample of respondents. I included only whites 
from cities that have had a black mayor at any point in 
their history. Since all cities in this new subsample elect 
black mayors, any positive change in white attitudes 
under black leadership is less likely to result from 
intercity differences in racial attitudes and is more 
likely to be directly related to experience with an 
incumbent black mayor. This analysis reconfirms the 
results from Table 1. Using this more select group of 
cities, the analysis reveals once again that under black 
mayors antiblack affect declines, racial resentment 
wanes, and whites perceive less racial group conflict 
(analysis available from the author). 

One last confirmation is to look at changes over time 
during the course of a black mayoralty in a city. On the 
one hand, if learning is the result of experience with 
black leadership, then white racial attitudes should not 
change overnight; instead, they should improve over 
time under a black mayor. On the other hand, if blacks 
are elected simply because whites in some cities are 
more racially tolerant than whites in other cities, then 
there is no reason to expect a positive change in white 
racial attitudes as the years under black leadership go 
by. The analysis of change over time is presented in the 
next section. 
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When Do White Attitudes Change? 


The results so far strongly support the information 
hypothesis. Black leadership appears to have a positive 
influence on the way whites view the black community. 
Yet, all the previous specifications compared respon- 
dents living in either black mayoral cities or white 
mayoral cities. If the information hypothesis is correct 
about experience with black leadership, then as years 
pass under a black mayor, whites should feel less and 
less threatened and should express increasingly positive 
racial attitudes. 

I test this proposition by looking at the effect of a 
black mayoralty over time. The dependent variables 
are the same: the status of blacks, antiblack affect, the 
racial resentment scale, and perceptions of group 
conflict. The independent variable of interest in this 
case is the length of time a city has experienced a black 
mayoralty. In order to ensure that the results are not 
skewed by racially tolerant respondents from one or 
two cities with decades of black mayoral leadership, I 
normalize the years of black leadership in each city. 
Thus, in each city the length of time under black 
leadership is measured on a scale from 0 to 1; 0 is the 
first year a black mayor was elected, and 1 is the last 
year the city had a black mayor.15 This allows me to 
compare respondents who have experienced varying 
numbers of years under of black leadership.!6 

As can be seen in Table 2, the results offer further 
support for the information hypothesis. First, white 
attitudes toward the black community improve over 
time under a black mayoralty. As the number of years 
under a black mayor increases, both antiblack affect 
and racial resentment decline significantly (p < .01). 
Second, there appears to be some change in white 
views of the pace of racial change. While the change is 
not quite significant, whites in cities that have more 
experience with a black mayoralty tend to think black 
status is changing more slowly. In other words, white 
residents slowly realize that black mayoral leadership 
has not had a dramatic, negative effect on the well- 
being of the white community.17 


15 To create the scale, I added up the number of years the city has 
had a black mayor (excluding any mterim years in which a white 
mayor held office). I then calculated for each respondent the number 
of years the city has had a black mayor at the time of the survey. I 
then divide the latter number by the former. Thus, a respondent from 
Los Angeles who was surveyed m 1984 would receive a value of 55 
(the 11th year of 20 total years of black leaderahrp in Los Angeles). 
A Los Angeles resident surveyed in 1992 would receive a value of .95 
(the 19th of 20 years of black leadershrp). 

16 As a secondary test, I repeated the analysis with a nonnormalzed 
scale (the number of years under black leadership). The results were 
almost identical, but the level of significance of the effects of black 
leadership on white racial attitudes declined in some cases. 

17 Tn fact, white views closely mirror events surrounding the mayoral 
transition. White respondents who live m cities experiencing the 
transition from a white mayor to the city’s first black mayor believe 
that African Americans have made significant gains. Over time, as 
the real effect of a black mayoralty becomes clear, whites come to 
realize that black gains have not been very significant. By the end of 
black leadership in a city, whites believe that the status of blacks is 
mproving more slowly than it was before the onset of black mayoral 
leadership (not shown in Table 2). 
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TABLE 2. How Time under Black Leadership Affects White Racial Views | 
Racial 











Status of Blacks Antiblack Affect Racial Resentment Group Confilct 
(Ordered Logtt) (OLS) (OLS) (Ordered Logit) 

Years under a black mayoralty 

(1 = last year, 0 = first year) —.88 (.48) —.05 (0.2) —.15 (.04)™ 75 (.52) 
Education —1.3 (.29)"" —.05 (.01)"* —.22 (0.3)"" —1.8 (.33)"" 
Income —.48 (.38) .00 (.01) —,.00 (0.3) —.05 (.39) 
Age 2.2 (.44)""* .04 (.02)* —.00 (.04) 1.4 (.48)" 
Gender (1 = male) 14 (15) .00 (.01) .02 (.02) 16 (.17) 
Ideology (1 =lIlberal) —.89 (.41)* —.06 (.02)""* —.28 (.04)"* —2.5 (.47)"" 
Party ID (1 = Democrat) —1.0 (.54) 01 (.01) —.02 (.03) —.61 (.30)* 
Employment status 

(1 = unemployed) .09 (.54) — 00 (.02) .02 (.04) .05 (.59) 
Years living In city —.07 (.27) .01 (.01) .03 (.02) —.03 (.31) 
Percentage black In city .01 (.01) .02 (.02) .08 (.04)* 77 (52) 
Level of urbanism .19 (.35) —.01 (.01) —.07 (.03)* .38 (.40) 
South (1 = yee) 34 (.19) 01 (.01) .03 (.02) .39 (.22) 
1986 04 (.36) —.34 (.39) 
1988 —.28 (.34) —.02 (.01)* 05 (.02)* —.73 (.38) 
1990 —.33 (.39) —.02 (.02) —.94 (.44)* 
1992 —.35 (.42) —.01 (.01) .08 (.03)"" —1.6 (.47)"" 
Constant .62 (.02)*** .96 (.04)"* 
Intercept 1 —3.5 (.59)"*"* —4.3 (.65)"" 
Intercept 2 —1.2 (.57)" —.61 (.61) 
Adj. R?/pseudo R° .14 .08 .21 .20 
l` 101 138 
N 689 800 800 622 











Note. Figures are 


Source: Amenoan Nabonal Becton Study, Cumutatrve Data Fie, 1964-82 
coefficients with ther standard errors shown m parentheses. ‘p < .05, “p < 01, “p < .001. 





There is one area in which white perceptions do not 
change. As Table 2 shows, there is no alteration in 
white perceptions of racial group conflict. It may be 
that despite positive improvements in white racial 
attitudes, racial divisions remain quite large outside the 
arena of mayoral politics. Just how much or how little 
the racial divide over policy alters under black mayors 
is discussed later. 

Overall, these results closely match anecdotal evi- 
dence about the transition in many of these cities. 
Maynard Jackson’s mayoral victory in Atlanta, for 
example, was followed by a period of bitter racial 
confrontation. Only after the mayor played a pivotal 
role in breaking a strike of low-paid, mostly black 
garbage workers did he begin to receive more support 
from the white community. According to one advisor, 
this action helped make whites “less paranoid” (Scott 
1977). Similarly, survey data in Birmingham suggest 
that racial tension increased immediately after the 
election of Richard Arrington as mayor (Russakof 


1983). By the time he ran for reelection, however, 
Arrington and other local politicians were complaining 
that the national news media no longer covered the 
campaign because the city’s mayoral politics lacked the 
racial animosity of old (Russakof 1983). In short, 
experience with black leadership matters. 


A Broader Effect The White Vote 


The onset of black leadership appears to signal the 
beginning of a real transformation in white racial views. 
This change is important in and of itself, but it begs a 
second question. Does the information that whites get 
from black incumbency translate into concrete changes 
in their behavior? If experience with black mayors 
teaches whites to be less fearful of blacks and black 
leadership, this should be reflected in the white vote. 
The heated white opposition that black challengers 
typically face should diminish after they become in- 
cumbents. 





White Residents, Black Incumbents, and a Declining Racial Divide 


Several studies indirectly address this question at the 
local level. Each points to the same conclusion: White 
opposition to black challengers is much more pro- 
nounced than white opposition to black incumbents 
(Bullock 1984; Hajnal 1998; Vanderleeuw 1989; 
Watson 1984). An analysis of mayoral elections in 23 
cities found that, on average, white support for the 
same black candidate increased by 25% when s/he 
became an incumbent (Hajnal 1998). Bullock’s (1984) 
study of 52 elections in the Atlanta area found that 
black incumbents received more than twice as many 
white votes as black challengers. Watson’s (1984) study 
of the transition from a white to a black mayoralty in 
eight major cities found the same pattern: Whites 
intensely opposed black challengers but supported 
black incumbents in large numbers. Based on 42 local 
elections in New Orleans, Vanderleeuw (1991) con- 
cluded that racially polarized voting in black incumbent 
elections was substantially lower than in black chal- 
lenger elections. In short, it appears that many white 
voters do change their mind about black leadership.1® 

The reelection rate of black incumbents also can tell 
us a fair amount about how widespread is white 
acceptance of black leadership. The data displayed in 
Table 3 break down these contests by both the race of 
opponents and the racial demographics of the city. The 
table clearly indicates two important facts about in- 
cumbent black mayors. First, they are generally very 
successfull in retaining office—78% win their reelec- 
tion bid.19 Second, and more important, even when 
white voters can control the election outcome and have 
the option of choosing a white challenger, black incum- 
bents still do extremely well. In majority white cities 
against white opponents, they win reelection 74% of 
the time. If white voters did not support black 
incumbents in large numbers, the figures in Table 3 
would be very different.2! More research on white 


18 More descriptive studies reveal an equally dramatic and equally 
telling change in the tone of these electrons as well (Eismger 1980; 
Pettigrew 1976). Black challenger elections are often characterized 
as ugly, racaalized affairs in which the chotce is literally black versus 
white. Phrases such as “racist tactics,” “race dominated,” “highly 
” and “race conscious” are common. In contrast, when 
black incumbents run, the election more likely to be about their 
records, their styles, and the msue preferences of voters. Bids by black 
incumbents tend to be described as “surprisingly uneventful” races, 
with “ho-hum voters,” “low-key amity among the candidates,” and 
“almost dignified politicking” (quotations from T accounts 
of elections m Fimt, Michigan (1983), Memphis, Tennessee (1991), 
Los Angeles, California (1969), Chicago, Dlinois (1983), and Atlantic 
City, New Jersey (1984). 
1° This figure is similar to the 84% reelection rate of all white 
mayoral Incumbents between 1970 and 1985 (Wolman, Page, and 
Reaviey 1996). 
D Even when whites make up more than 70% of the population and 
should totally dommate electoral outcomes, they reelect black m- 
cumbents 78% of the time (80% of the tme when they face white 
opponents). 
21 The same overall pattern can be observed at the congressional 
level. Although black challengers rarely wm election in minority 
black districts (only 25 different black challengers have done so), 
once elected, black House members are nearly mvincible. In mmority 
black districts, these 25 incumbents have won reelection 97% of the 
tme (82 out of 85 cases). Data from majority white and mired 
districts also indicate that black House members win reelection with 
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TABLE 3. Incumbent Black Mayor 


Reelection Rates, 1965-68 


All cttles with Incumbent 


black 78% (126) 


Majority white cites 71% (45) 74% (27) 


Minority white cites 83% (80) 90% (28) 


Sources: Netonsl Roster of Black Eecied Oficiala, iocal newspapers 
The data Include all orties with a populetbon over 50,000. 


83% (58) 


voting patterns needs to be done, but the evidence 
presented here supports the conclusion that black 
mayoral leadership makes a difference in the views and 
actions of white voters. 


An Even Broader Effect Racial Policy 
Preferences of Whites 


The information hypothesis does not predict whether 
experience with black leadership will change white 
policy preferences. It simply claims that black control 
of the political process should allay white fears and 
improve white views of blacks and black leadership. 
The implication for policy preferences is unclear. 
Whites may feel less threatened by black leadership 
and be quite willing to vote for a black incumbent, but 
this may not affect how they feel about affirmative 
action programs that could cost them a job. 

Despite this ambiguity, it is important to look at 
white policy preferences. The well-being of the black 
community is, in many ways, tied to the policy prefer- 
ences of whites, and the greater willingness of whites to 
support ial assistance could have a dramatic effect 
on the status of blacks.> 

To see whether white policy positions change under 
black mayoral leadership, I looked at three questions 
on racially explicit policy matters included in the survey 
for the ANES pooled sample. 1 Do you think the 
government in Washington should see to it that white 
and black children go to the same schools? (2) Should 
the government in W on see to it that black 
people get fair treatment in jobs? (3) Should the 
geen in Washington make every possible effort 

the social and economic position of 
blacks 4 The wording of the entire questions and 
descriptive statistics are given in Appendix B 


more white support than they received as challengers (Bullock and 
Dunn 1997; Swain 1995). 

2 Jf whites oppose race-based policies because of the costs and 
benefits involved, then black representation should have little influ- 
ence on white preferences since it does nothing to alter the costs and 
benefits. If white policy preferences are driven by racial antagonism, 
then black representation may have an influence. 

D An examination of policy preferences 1s a demanding test of just 
how much white views change under black leadership. Over the past 
few decades, surveys have shown progress in terms of 
white support for the principle of racial equality but a minimal 
mcrease in the willingness to support special programs to assist 
blacks (Schuman et al 1997) 

M Because these questions explicitly mention federal action, it is 
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Source: American Natonal Gecbon Study, Cumutatrve Data File, 1984-82. 


Table 4 suggests that learning does not extend to the 
arena of racial policy. Black mayoral leadership ap- 
pears to have no clear influence on white policy 
preferences. There is no link between a black mayor- 
alty and white views on either affirmative action in 
employment or general assistance to blacks. The only 
issue for which there is some relationship is school 
integration, but even that is not significant (p < .10). 
Moreover, I found no positive change over time in 
white views on school integration. In short, under black 
mayors, white views of blacks may improve, and whites 
may feel less threatened, but there is no greater 

ingness to support programs specifically designed to 
aid the black community. 

A review of the policy debate in black-run cities 
seems to confirm this view. Despite years and often 


unclear whether answers reflect racial preferences or attitudes 
toward government. As at least a partial correction, I mcluded as a 
control variable a scale that measures attitudes toward limited 
government. The results were little affected A description of the 
scale s given in Appendix B. 





TABLE 4. The Effect of a Black Mayor on White Policy Preferences 
School Integration 


(Ordered Logit) (Ordered Logit) to Blacks (OLS) 
Black mayor (1 = yes, 0 = no) .26 (.15) —.20 (.19) .02 (.01) 
Education -49 (.19)* .99 (23)"* .16 (.02)"* 
Income — 26 (22) .57 (.26)* —.01 (.02) 
Age —.41 (27) —1.4 (.33)"" .08 (.02)"™* 
Gender (1 = male) .05 (.10) 40 (.12)""* .01 (.01) 
Ideology (1 = Ilberal) .88 (.28)" .47 (.32) 12 (.02)"" 
Party ID (1 = Democrat) .34 (.18) 50 (.21)* .06 (.01)"* 
Employment status (1 = unemployed) -10 (28) .92 (.41)* —.01 (.01) 
Years living In city -16 (.18) —.25 (21) —.01 (01) 
Percentage black In city —.75 (.34)* .08 (.42) —.10 (.02)"" 
Level of urbanism .22 (.19) .20 (24) .04 (.02)* 
South (1 = yes) —.13 (.18) —.01 (.15) —.02 (.01)* 
First year of black mayoralty (1 = yes) —.31 (.24) .01 (.27) —.02 (.02) 
1986 —.01 (01) 
1988 —.05 (.16) —.07 (.01)*™ 
1990 .10 (.15) —.07 (.01)""* 
1992 -13 (.12) —.03 (.16) —.07 (.01)"" 
Limited government scale (1 = pro-govt.) 1.4 (.30)"" 2.2 (.36)** .39 (.02)*** 
Constant .08 (.03)** 
Intercept 1 .99 (.30)""* 1.9 (.38)*** 
Intercept 2 2.4 (.31)"" 2.1 (.38)*"* 
Adj. R?/pseudo R? .08 13 21 
x 118 168 
N 1,414 1,242 2,912 


Note: Figures are unstandardzzed coefficients with thes standard errors shown In parentheses. “p < 05, “p < 01, ™p < .001. 


Falr Treatment In Jobs Government Assistance 


decades of black mayoral leadership, whites and blacks 
in most of these cities continue to clash over minority 
contract set-asides, police behavior, public schools, and 
downtown versus neighborhood development (Rivlin 
1992; Stone 1997). Tom Bradley may have won the 
majority of the white vote in Los Angeles, and he may 
have demonstrated that black leadership does not 
harm white residents, but his tenure did little to change 
white opposition to school busing or affirmative action 
(Sonnenshein 1993). As the riots in Los Angeles in 
1992 so vividly attest, racial conflict can be reignited 
que’. with or without a black mayor. In other words, 

ere has been real, positive change in terms of the 
white vote and white racial attitudes, but there is still a 
long way to go before whites and blacks agree on what 
policies to enact. 


Raclal Threat 


A review of tables 1 and 4 reveals another interesting 
finding. In line with substantial research on racial 
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threat (e.g., Giles and Evans 1986; Glaser 1994), the 
size of the black population has a strong, negative 
effect on both white racial attitudes and white policy 
preferences. A higher ercentage of blacks in the city 
means greater antiblack affect, increased racial resent- 
ment, and a greater sense of racial group conflict as 
well as less willingness to support school integration or 
special assistance to blacks. The contrast between the 
effects of black mayoral leadership and a large black 
population is striking. Whereas the actions of blacks in 
positions of authority seem to quell white fears, a large 
black presence seems to increase racial tensions. This 
supports the view that the nature of interracial contact 
matters (Jackman and Crane 1986). 


Which Whites Change Their Mind? 


To say that black leadership results in a significant 
positive change in white racial attitudes and voting 
behavior does not mean that all whites change their 
mind about blacks and black leadership. Despite their 
experiences with black mayors, many whites resist 
black leadership. 

Given the different racial agendas of the Republican 
and Democratic parties (Carmines and Stimson 1989; 
Huckfeldt and Kohfeld 1989) and the possible link 
between conservatism and antiblack affect (Kinder and 
Sanders 1996; Kinder and Sears 1981), one might 
predict that white Republicans will be more resistant to 
the information that black leadership provides, 
whereas white Democrats may be much more recep- 
tive. If so, one would expect that most if not all positive 
change in racial attitudes and voting behavior that 
occurs under a black mayor would be confined to white 
Democrats. 

I tested this proposition by separating out the re- 
sponses of white Democrats, Republicans, and party 
moderates to black mayoral leadership.” The depen- 
dent variables are the same racial attitude questions 
examined earlier. The only change is that in Table 5 I 
interact the years under a black mayor variable with 
dummy variables for Democratic and Republican party 
identification. Moderates become the baseline group. 
The coefficient for Years Black Mayor indicates how 
white moderates respond to time under black leader- 
ship. To determine how Democrats (Republicans) re- 
spond to time under black leadership, I add the 
coefficient for the Years Black Mayor-Democrat (Re- 
publican) interaction term and the coefficient for years 
black mayor. i 

Table 5 suggests that white Democrats, white Re- 
publicans, and white moderates respond differently to 
black leadership. First, the more time white moderates 
spend under black mayoral leadership, the more posi- 
tive are their racial views. As the years go by, white 
moderates feel less antiblack affect and are less racially 
resentful. Second, these changes are as positive or 
more positive for white Democrats. Third, Republicans 
respond less positively to time under a black mayoralty. 





B Moderates are self-identified independents or respondents who 
listed no party affiliation. ' 
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On two of the four attitudinal measures, Republican 
views improve significantly less than those of moder- 
ates as the years under black leadership go by. When 
one adds the Years Black Mayor-Republican interaction 
term to the years black mayor term, the net result is 
almost zero in two cases, which suggests little to no 
change in the level of racial resentment and antiblack 
affect among white Republicans. In other words, Re- 
publicans’ racial views do not change nearly as much as 
Democrats’ views under black leadership over time. 

The same pattern emerges if one simply compares 
whites in cities with either a black or white mayor, as 
was done in Table 1. White Democrats who have a 
black mayor have more positive views of blacks than do 
white Democrats who have a white mayor (data not 
shown). White moderates also have marginally more 
positive racial views under black mayors, but white 
Republicans’ racial views change only marginally. 
The same pattern is evident in the actual vote in 
mayoral elections that involve black incumbents. In the 
few cases in which voting data are broken down by both 
race and party identification, the evidence suggests that 
the bulk of the increase in white support for black 
incumbents comes from Democrats (Pettigrew 1976; 
Sonnenshein 1993). White Republicans, in contrast, 
tend to vote against black candidates, whether they are 
challengers or incumbents.?’ 

The information hypothesis does not apply to all 
white voters equally. White Democrats and moderates 
appear willing to incorporate the information they 
receive from their experience with black leadership. 
For these two groups, the information hypothesis 
seems to account for their response. In contrast, white 
Republicans tend to be less open to change. The results 
in Table 5 suggest that some either ignore or discount 
the words and actions of black incumbents. Thus, for 
some white Republicans a racial prejudice or racial 
stereotype hypothesis of political behavior may be 
more appropriate. 

In sum, black leadership has a polarizing effect on 
the white community. Democrats become more and 
more racially liberal, and Republican views stay largely 
the same. Indeed, the racial gap between white Dem- 
ocrats and white Republicans more than doubles in 
size under black mayoral leadership. Table 6 illustrates 
this growing gap by presenting the difference between 
mean Democratic views and mean Republican views 





æ% This pattern could be the result of “racist” white Democrats 
defecting from the party, but two factors make this highly unlikely. 
First, since most Democratic defectors became mdependents rather 
than Republicans, any negative change could be expected to occur 
among independents, not Republicans. Second, the gradual defec- 
tion of whites from the Democratic Party over time us unlikely to 
account for both the changes m white views under a black mayor and 
the changes over time in white views under black 1 
especially since each regression model includes a control for the year 
of the survey 

7 Since most black candidates to date have been Democrats, this is 
not surprising. Nevertheless, it seems ironic that racial considerations 
are most consequential for white Democrats (see Hurwitz and Peffley 
1998 on this pomt). Whether they support a black candidate seems to 
be greatly affected by therr views of blacks and black leadership. In 
contrast, white Republicans are likely to vote agamst liberal, black 
candidates, no matter what their racial views. 
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TABLE 5. How Time under Black Leadership Affects the Views of Democrats, Republicans, and 


Status of Blacks Antiblack affect Racial Resentment Raclal Group Conflict 


Moderates 
(Ordered Logit) 

Years black mayor —.88 (.48) 
Years black mayor x Democrat 26 (.34) 
Years black mayor x Republican —.14 (.33) 
Education —1.3 (.29)"" 
Income —.04 (.39) 
Age 2.2 (.44)"™ 
Gender (1 = male) .14 (.15) 
Ideology (1 = liberal) —.89 (.41) 
Party (1 = Democrat) —1.0 (.54) 
Employment status 

(1 = unemployed) -10 (.54) 
Years living In city —.07 (.27) 
Percentage black In ctty -12 (.50) 
Level of urbanism -19 (.35) 
South (1 = yes) .34 (20) 
1986 .04 (.38) 
1988 —.28 (.34) 
1990 —.33 (.39) 
1992 —.35 (.42) 
Constant 
Intercept 1 —3.5 (.59)*™ 
Intercept 2 -1.2 (.57)" 
Adj. R° 14 
¥ 101 
N a 669 
Source. American Natonal Election Study, Cumulative Data File, 1984-82. 
Note: Figures are 





across the same series of racial questions. As column 
one reveals, the gap between white Democrats and 
white Republicans in cities with a white mayor seems 
rather small. On two of the four questions, the differ- 
ence is not even statistically significant (p < .05). But 
the gap increases dramatically under black mayors, and 
it more than doubles in all but one case. Black leader- 
ship means even greater division between Democrats 
and Republicans. 


IMPLICATIONS 


Four lessons can be drawn from this study. The first 
and most obvious is that black representation does 
matter. Many have bemoaned the fact that black 
political leaders have not resolved the crises of black 
poverty, educational inequality, crime, and unemploy- 


(OLS) (OLS) (Ordered Logit) 
—08 (02)" T18 (04)" 57 (52) 
~.08 (.02)*** —.07 (.04)" 01 (.37) 

42 (.02)"** 14 (03) —.42 (.38) 
—.08 (.01)** — 22 (.03)"" -14.7 (31) 
01 (.02) 01 (.03) —.04 (.38) 

04 (.02)* 01 (.04) 1.5 48)" 
01 (.01) 02 (.02) 10 (17) 
—.06 (.02)"** —.28 (,04)"** ~2.6 (.48) 
15 (.02)"** 14 (.05)"* —.95 (.58) 
~.00 (.02) 01 (.04) 20 (59) 
01 (.01) 02 (.02) 13 (29) 
03 (.02) 09 (.04)" 01 (.01) 
01 (.01) —.05 (.03) 41 (39) 
01 (.01) 03 (.02) 38 (21) 
—.46 (.38) 
02 (.01)* 05 (.02)" —.80 (.37)" 
~.02 (.02) — 94 (.42)* 
02 (.01) 09 (.02)"* ~1.6 (.4)""* 
54 (.02)"* 88 (.05)"* 
—4.7 (85) 
—.95 (.61) 
14 2 21 
154 
800 800 658 





coeficients with ther standard errors shown m parentheses. *p < 05, “p < .01, =p < .001. 
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ment, and scholars interpret this as a sign of ineffec- 
tiveness (Marable 1992; Reed 1988; Singh 1998; Smith 
1996). This study shows, however, that politics as usual 
under black representation can have a very positive 
effect on whites. Officeholding by African Americans 
may not lead directly to racial equality or a change in 
white policy preferences, but it appears to result in a 
fundamental change in the white vote and in the racial 
sentiments expressed by at least part of the white 
electorate. 

Second, this transformation suggests that racial atti- 
tudes are not fixed in the minds of many whites but are 
apt to change as new information emerges. This is not 
to say that whites do not hold inaccurate racial stereo- 
types, only that a significant number are capable of 
change i 


The third lesson is the critical role information can 
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TABLE 6. Black Mayoralty and Polarization 
of the White Community 


Difference between Mean 
Democratic and Mean 
Republican View 


Cies without a Cities with a 
Black Mayor* Black Mayor? 


=.01 —.08" 
-.01 -.03" 
=04" -11% 


Sense of black threat* —.06"" era eb 
Source: Amencan Nabonal Gecbon Study, Cumulate Data Fie, 1984- 
92. 





Status of blacks* 
Antiblack affect* 


Racial resentment" 


Note: The difference between the Demoorato and Republican mean is 
sagnificant at ‘p < .05, “p < .01, “p <.001 All dependent vanables 
coded 0-1. 

“Ns for Democrats range from 744 to 962; Republicans range from 654 
to 848. 

Na for Demoorats range from 243 to 284, Repubbcans range from 273 
to 208. 





play in the racial arena. Whites often oppose a black 
challenger because they have little experience with 
blacks in office and are uncertain about how such 
leadership will affect their lives. Black incumbents who 
allay their fears are far more likely to receive white 
support. This suggests that black challengers should 
point to the past success of black officeholders. For 
example, if a black challenger can clearly demonstrate 
to white voters that election of a black mayor in 
Atlanta, Los Angeles, and other cities led to economic 
prosperity rather than decline, it may be possible to 
increase greatly the number of African Americans 
elected across the country. 

The fourth lesson is that black challengers and 
incumbents do not face the same racial concerns and 
obstacles to electoral success. Certain studies find little 
evidence of antiblack voting (Citrin, Green, and Sears 
1990; Swain 1995), but it is important to note that they 
focus on black incumbents. Others find widespread 
antiblack voting (Loewen 1990; McCrary 1990), and it 
is equally important to note that they focus on black 
challengers. In short, researchers must understand that 
white support of black candidates may depend less 
upon racial tolerance than on whether the candidate is 
an incumbent or challenger. 

Much of the debate on racial politics in America is 
too narrowly focused. On the one hand, pessimists 
repeatedly claim that there has been little or no 
progress in reducing racial divisions. They consider 
race and racial prejudice as the primary factor in 
American politics in general and in white voting pref- 
erences in particular (Bell 1992; Huckfeldt and Koh- 
feld 1989; McCrary 1990; Reeves 1997). Robert Starks 
(1991, 217), for example, insists that “race is such an 
overriding factor in American life that to support its 
elimination or diffusion as a factor in elections throu 
deracialization is folly.” According to pessimists, racial- 
ized voting patterns and the reason so few black 
candidates are elected outside of majority black places 
can be attributed to the fact that white voters cannot 
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overlook race. On the other hand, optimists see tre- 
mendous progress in the arena of racial politics (Swain 
1995; Thernstrom and Thernstrom 1997). “Whites not 
only say they will vote for black candidates; they do so” 
(Thernstrom 1995, A15). If racist voting occurs, it is 
prevalent only in isolated contests. A black candidate 
loses for much the same reasons as a white candidate 
loses. 

The changes in white racial views that occur under 
incumbent black mayors suggest that the debate is 
addressing the wrong issue. The key question is not 
whether but when race is central in the mind of white 
voters. The arguments about progress on the racial 
front do not, in the end, help us understand how race 
operates in American politics. This research shows that 
race is a complex factor, and we need to find out when 
and why both “racist” and “color-blind” voting occur. 

The overarching message of this study is one of 
caution and hope. On the one hand, we should not 
overestimate the effect of black representation. Racial 
policy still divides white and black America, as the data 
on policy preferences suggest. Racial conflict does not 
end with the election of African Americans, as recent 
riots in several major cities attest. All whites do not 
alter their views on blacks and black leadership, as the 
data on white Republicans show. Above all, the effect 
will remain limited unless more blacks are elected, a 
problem that is only slowly being rectified (Joint Cen- 
ter for Political Studies 1994). 

On the other hand, there are signs of real progress. 
Every new black leader provides additional informa- 
tion to the white community and reduces fears of black 
representation in general. It is no longer possible to 
claim, as Sam Yorty did in Los Angeles in 1969, that 
the city’s police force will quit en masse if a black man 
is elected mayor. That did not happen when Tom 
Bradley was elected or in numerous other cases, so this 
type of threat no longer rings true. The positive trend 
in white racial attitudes over the past few decades is 
clear (Schuman et al. 1997). Black representation may 
not be responsible for all this change, but it can account 
for some of it. 

What does this research imply about minority rela- 
tions in the future? What will happen when the first 
black U.S. president is elected? What effect will more 
female, gay, Latino, or Asian-American incumbents 
have on attitudes toward these groups? In the end, 
expanded representation is unlikely to address all our 
ills, but if it can foster even slightly better understand- 


ing among groups, it is a goal well worth pursuing. 


APPENDIX A 


In order to perform the two-stage least-squares analysis, I 
first devised a treatment or first-stage equation that modeled 


the presence of a black mayor in the city. In the first stage, 
five exogenous variables served as instrumental variables 
(black income per capita in the city, the percentage of black 
adults with a college degree in the city, the percentage of the 
city that voted Republican in the 1988 presidential election, 
the median income in the city in 1989, and the percentage of 
adults in the city with a college degree in 1990). These 
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instrumental variables fit the criteria by Bartels 
(1991). First, according to Karnig and Welch (1980), they are 
among the strongest predictors of a black presence 
(aside from percentage black, which is clearly related to white 
attitudes). In the present data set they explain an additional 
4% of the variation in the first stage. Second, they are, at least 
to a certain extent, exogenous. None of the instrumental 
variables is highly correlated with individual white racial 
attitudes in the survey (r < .15), and omitted-variable 
Hausman tests suggest that the instrumental variables are, 
with one exception, not significantly related to white racial 
attitudes in the second-stage equation (analysis not shown). 
It is also important to note that there is little theoretical 
reason to expect that the five instrumental variables have a 
direct effect on white racial attitudes. Because I control for 
income, education, partisanship, and political ideology at the 
individual level, it seems unlikely that citywide measures of 
income, education, and partisanship would have any addi- 
tional influence on white racial attitudes. The two-stage least- 
squares analysis confirms the results of the OLS analysis. As 
predicted by the information hypothesis, under black mayors 
white residents do not perceive any additional change in the 
status of blacks. Also, under black mayors antiblack affect, 
racial resentment, and sense of racial conflict all decline 


significantly (p < .05). The results of the overall two-stage 
least-squares analysis are available from the author. 


APPENDIX B: CODING AND DESCRIPTIVE 
STATISTICS, INDEPENDENT VARIABLES 


Education. Coded as a six-category variable: 0 = completed 
less than 9th 2 = 9-12 years, .4 = high school 
diploma, .6 = 1-3 years college, .8 = bachelor’s degree, 1.0 = 
graduate degree. Mean = .61, Std. Dev. = 31. 


Age. Age in years normalized to 0-1. Mean = 34, Std. 
Dev. = 22. 


Gender. Coded as 1 = male. Mean = .46, Std. Dev. = 50. 


Coded as 1 = unemployed, 0 otherwise. 
Mean = .04, Std. Dev. = 19. 


Ideology. Coded as a seven-category variable: 0 = very 
conservative, .17 = conservative, 33 = somewhat conserva- 
tive, 50 = moderate, .67 = somewhat liberal, .83 = liberal, 
1 = very liberal. Mean = .46, Std. Dev. = .22. 


Partisan Coded as a seven-category variable: 
0 = strong Republican, .17 = weak Republican, 33 = 
independent/Republican, 50 = independent, .67 = indepen- 
dent/Democrat, .83 = weak Democrat, 1.0 = strong Demo- 
crat. Mean = 51, Std. Dev. = 34. 


Years Living in City. Years of residence in municipality 
normalized, Mean = .29, Std. Dev. = 31. 


Percentage Black in City. Percentage black normalized. 
Mean = .26, Std. Dev. = .19. 


South. Coded as 1 = south, 0 otherwise. Mean = .26, Std. 
Dev. = 44. | 


Urbanism. Coded as a three-category variable: 0 = central 
city of 50 largest metropolitan areas, 50 = central city of 
other metropolitan areas, 1.0 = suburb of metropolitan area. 
Mean = .61, Std. Dev. = .28. 


Limited Government Scale. Should government in Washing- 
ton provide fewer services, even in areas such as health and 
education, in order to reduce spending? Should government 
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in Washington see to it that every person has a job and a good 
standard of living? Mean = 48, Std. Dev. = 21. The reliability 
of the scale was reasonable, with a Cronbach’s alpha of .48. 
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Resurgent Mass Partisanship: The Role of Elite Polarization 
MARC J. HETHERINGTON Bowdoin College 


characterize them as weak and in decline. Parties on the elite level, however, have experienced a 


[F iaire ker as weak and n decine Faries on he le ee hee hae eerie o 


resurgence over the last two decades. Such a divergence between elite behavior and mass opinion is 
curious, given that most models of public opinion place the behavior of elites at their core. In fact, I find that 
parties in the electorate have experienced a noteworthy resurgence over the last two decades. Greater partisan 
polarization in Congress has clarified the parties’ ideological positions for ordinary Americans, which in tum 
has increased party importance-and salience on the mass level. Although parties in the 1990s are not as 
central to Americans as they were in the 1950s, they are far more important today than in the 1970s and 


1980s. The party decline thesis is in need of revision. 


ith few exceptions (see Keith et al. 1992), the 
scholarly consensus on contemporary Ameri- 
can political parties in the electorate centers 
on party decline. There is disagreement about its 
sources—whether people are more negative (Nie, 


Verba, and Petrocik 1979) or neutral (Wattenberg ~ 


1984)}—and about its abruptness—whether precipitous 
(Wattenberg 1984) or less steep but still meaningful 
(Konda and Sigelman 1987)—but the conventional 
wisdom is that parties have long been irrelevant to 
many. Bartels (2000) cites a litany of scholarly work 
that suggests party decline in the electorate will persist 
into the new century.! According to some, that trend is 
potentially irreversible because of the antiparty elec- 
toral changes implemented in the 1960s and 1970s 
(e.g, Aldrich 1995, 245-53; Beck 1997, 385). 

The centrality of party decline in the thinking of 
public opinion scholars is curious, especially because 
Congress scholars discovered years ago that parties are 
resurgent on the elite level (e.g., Poole and Rosenthal 
1997; Rohde 1991). Since most theories of public 
opinion change focus on the behavior of elites (Brody 
1991; Carmines and Stimson 1989; Page and Shapiro 
1992; Zaller 1992), party resurgence in Congress 
should be consequential in understanding mass atti- 
tudes toward parties. Mass behavior should reflect, at 
least to some degree, elite behavior. Therefore, mass 
party strength should have increased as a result of 
greater partisanship at the elite level. 

I will demonstrate that the measures scholars have 
used as evidence of mass party decline now point to 
party resurgence. In most cases the movement has 


Poole, Daron Shaw, and Jonathan Weiler for their sound advice. I 
also wish to recognize the top-notch research assistance provided by 
Rob Ford and Joah Helfat as well as the computing expertise of Enc 
Chown. The data were obtained from the Inter-university Consor- 
tium for Political and Social Research, which bears no responsibility 
for their use. 

1 Bartels (2000) demonstrates that party identification has an m- 
creasingty large effect on presidential and congressional vote choice 
and is largely alone m challenging the party decline thesis, although 
opinion scholars note the resurgence on the elite level 
(eg., Aldrich 1995; Beck 1997). 


been extraordinary, especially in view of the glacial 
pace characteristic of most public opinion change. 
Moreover, it can best be explained by the increase in 
ideological polarization along congressional party 
lines. I will show that elite polarization has clarified 
public perceptions of the parties’ ideological differ- 
ences, which has led to a resurgence of parties in the 
electorate. 


INDIVIDUAL-LEVEL EVIDENCE FOR PARTY 
DECLINE 


Scholars have detailed party decline using data at both 
the aggregate and individual level, but I confine my 
analysis to the latter, using data collected by the 
National Election Study (NES). To public opinion 
scholars, the most familiar evidence of party decline is 
the rapid increase in political independence and the 
accompanying decrease in strong partisanship after the 
1950s. The percentage of independent leaners nearly 
doubled between 1960 and 1980, and the percentage of 
strong partisans dipped by more than one-third 
(Wattenberg 1984).2 One prominent explanation for 
party decline is that, in a candidate-centered era, 
parties have become irrelevant to many people. As 
evidence, Wattenberg (1984) cites a rapid increase in 
the percentage of Americans who are neutral toward 
both parties, as tapped by likes/dislikes questions in the 
NES survey. 

Although Konda and Sigelman (1987) express con- 
cerns about Wattenberg’s measures, they find further 
support, albeit muted, for the neutrality thesis. They 
measured party engagement as the total number of 
party likes and dislikes provided by respondents and 
discovered that engagement declined substantially be- 
tween 1952 and 1984. In later work, Wattenberg (1994, 
1996, 1998) focuses on the Perot candidacies and 
argues that the parties are still in decline. An apparent 
indicator of major party failure is the fact that Ross 
Perot received more votes in 1992 than any third party 
candidate since Theodore Roosevelt eighty years ear- 


2 Bartels (2000) finds that these trends have reversed recently, 
especially among the politically active. The proportion of party 
identiflers among voters was higher in 1996 than in any election since 
1964. 
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lier. In addition, split-ticket voting reached an apex for 
the NES survey era. Of course, Perot’s historically 
strong showing absent congressional Reform Party 
candidates potentially explains the increase in ticket 
splitting, although this phenomenon is often consid- 
ered a symptom of party decline (e.g., Beck 1997; 
Keefe 1998). 


INDIVIDUAL-LEVEL EVIDENCE OF PARTY 
RESURGENCE 


Because the conventional wisdom has a strong hold, I 
must demonstrate that a resurgence of party at the 
mass level has occurred. I do so by relying on many of 
the measures developed in the 1980s to show party 
decline. Key to Wattenberg’s (1984) argument is an 
increase in the percentage of people with neutral 
feelings toward both parties, as measured by the net 
number of likes and dislikes offered by NES respon- 
dents. If people give more reasons for liking a party 
than disliking it, they are considered positive toward 
that party. If they provide more dislikes than likes, they 
are considered negative. An equal number of likes and 
dislikes or no responses at all indicate neutrality. 

The solid lines in Figure 1 track changes in the most 
partisan and most neutral categories. The percentage 
of those neutral toward both parties declined by 6 
points between 1980 and 1996, and the percentage of 
those positive toward one party and negative toward 
the other increased by the same amount. Positive- 
negative replaced neutral-neutral as the modal cate- 
gory in 1988 and continued as the mode through the 
rest of the time series. Although the proportion of 
positive-negatives in 1996 does not approach that of 
1952, a movement toward greater partisanship is still 
evident. 
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This use of the likes-dislikes measure has several 
problems. In addition to obscuring differences between 
categories and overstating neutrality (see DeSart 1995; 
Konda and Sigelman 1987; Stanga and Sheffield 1987), 
the measure lacks a stated neutral point. People are 
classified as neutral if they unwittingly balance the 
number of likes and dislikes or, perhaps more prob- 
lematically, provide no answers at all. Feeling ther- 
mometers are more attractive because they have an 
explicit neutral point, 50 degrees, and almost all re- 
spondents provide valid answers (Craig 1985).3 

I can use the NES party thermometers to construct a 
measure of affect similar to Wattenberg’s. I classify 
those who answer 50 degrees to both thermometers as 
neutral-neutral, those who answer above 50 degrees to 
one party and below 50 degrees to the other as 
positive-negative, and so forth. The results appear as 
the broken lines in Figure 1. The most noteworthy 
finding is the recent upsurge in positive-negatives. Only 
about 35% fell into this most partisan category in 1980, 
compared to nearly half in 1996, which represents an 
increase of 40%. . 

Konda and Sigelman (1987) measure party engage- 
ment as the total number of likes and dislikes that 
respondents provide about the parties. Figure 2 reveals 
that parties are far more salient in the 1990s than in the 
1970s and 1980s; the mean number of responses in 
1996 was higher than in any year except 1952 and 1968. 
By this measure, the salience of party has increased by 
45% since 1980. 


3 The NES changed the phrasing of the party thermometer questions 
in 1978. Previously, 


quite large. Therefore, I cannot extend the analysis back any farther. 
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FIGURE 2. Mean Total Number of Likes and Dislikes about the Parties, 1962-96 
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Another indicator of party-centric attitudes is 
straight ticket voting. Wattenberg (1994) notes that 
voting for a presidential candidate and House member 
of different parties reached a high for the NES era in 
1992, but Figure 3 demonstrates that 1992 was an 
anomaly. Even if third party voters are included, a 
higher percentage of Americans reported voting for a 
presidential and House candidate of the same party in 
1996 than in any year since 1964. The tendency is much 
clearer among major party presidential voters. Straight 
ticket voting for president and House in this group has 
increased progressively since 1980. Even in 1992, when 

partisanship was supposedly at a low point, Bush and 
Clinton voters were more inclined to vote for the same 
party in their House election than were Bush and 
Dukakis or Reagan and Mondale voters. 

Although these data suggest a dramatic resurgence 
in party, Perot’s historically large vote share in 1992 
appears to suggest the opposite. Scholars have shown, 
however, that such factors as the third party candi- 
date’s personal characteristics (Rosenstone, Behr, and 
Lazarus 1996) and respondents’ trust in government 
(Hetherington 1999) better explain third party voting 
than does strength of partisanship. Moreover, Perot’s 


4 It can be argued that third party presidential voters are the least 
partisan and hence least likely to vote a straight ticket. Therefore, by 
focusing only on major party presidential voters, I may overstate the 
mcrease in party loyalty. If this were true, however, a higher level of 
straight ticket voting among major party preadential voters should 
have occurred in 1992, when Perot received 19% of the vote, than in 
1996, when he received just 9%. Instead, of mayor party presidential 
voters in the two years, 78% voted for a House candidate of the same 
party in 1992, whereas 82% did so in 1996. If the focus on major party 
voters merely eliminated those who might have been least likely to 
vote a straight ticket had Perot not run, I would have found more 
straight ticket voters in 1992 than 1996 because twice as many of the 
supposedly least partisan third party voters were eliminated. 


1952 1956 1960 1964 1968 1972 1976 1980 1984 1988 1992 1996 
Year 





personal fortune allowed him to overcome many of the 
handicaps—such as ballot access laws, small advertis- 
ing budgets, and dismissive news reporting—faced by 
most third party candidacies (Rosenstone, Behr, and 
Lazarus 1996). In short, Perot’s showing resulted pri- 
marily from factors other than party decline. 


WHAT CAUSES MASS OPINION TO 
CHANGE? 


Mass opinion in the aggregate tends to move glacially 
if at all (Page and Shapiro 1992). When it does move, 
it usually responds to changes in the information 
environment provided by elites. Although the authors 
of The American Voter partially blame cognitive limita- 
tions for Americans’ lack of ideological sopbistication, 
they also recognized the importance of elite-level cues: 
“There are periods in which the heat of partisan debate 
slackens and becomes almost perfunctory, and the 
positions of the parties become relatively indistinct on 
basic issues. In times such as these, even the person 
sensitive to a range of political philosophies may not 
feel this knowledge to be helpful in an evaluation of 
current politics’ (Campbell et al. 1960, 256). V.O. 
Key’s (1966) echo chamber analogy further suggests 
that elite behavior will set the terms by which the 
masses think about politics (see also Nie, Verba, and 
Petrocik 1979; Page 1978). If politicians provide party- 
oriented or issue-oriented cues, then the public will 
respond in a party-centric or issue-centric manner. 
They are unlikely to do so without such cues. 

Indeed, the most sophisticated recent theories of 
public opinion place elite behavior at the center of 
individual opinion change (Brody 1991; Carmines and 
Stimson 1989; Zaller 1992). For example, Carmines 
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FIGURE 3. Percentage of Voters Casting a Stralght Ticket for President and House, 1952-96 
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and Stimson (1989) identify changes in the behavior of 
Republican and Democratic elites as the engine for an 
issue evolution on race in the 1960s. Similarly, Brody 
(1991) argues that we can best understand presidential 
approval by observing elite behavior. He maintains that 
elite consensus generally predicts higher approval rat- 
ings, and elite division usually means lower approval 
(see also Mermin 1999). 

Zaller (1992, 311), who develops the connection 
between elite behavior and mass opinion most com- 
pletely, concludes that even those most attentive to 
politics “respond to new issues mainly on the basis of 
the partisanship and ideology of the elite sources in the 
messages.” If people are exposed to a heavily partisan 
stream of information, which will be more likely if 
elites are behaving in a partisan manner, then it follows 
that respondents will express opinions that reflect the 
heavily partisan stream. Because greater ideological 
differences between the parties on the elite level should 
produce a more partisan information stream, elite 
polarization should produce a more partisan mass 
response. 


PARTY RESURGENCE ON THE ELITE 
LEVEL 


One measure that taps changing elite behavior is Poole 
and Rosenthal’s DW-NOMINATE scores for mem- 
bers of Congress.5 These scores allow for both be- 
tween-member and between-year comparisons. An in- 
creasing ideological distance between Democratic and 
Republican elites should produce a more partisan 


3 DW-NOMINATE scores are the most commonly used estimate of 
the ideological position of members of Congress. Members’ ideal 
points are derived using a dynamic, weighted, nominal three-step 
estimation procedure based on all nonunanimous roll call votes 
taken in each Congress (see Poole and Rosenthal 1997 for details). 
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information environment for ordinary Americans, es- 
pecially in view of the media’s well-known bias toward 
framing politics in terms of conflict (Graber 1997, chap 
4). 

To measure party polarization in the House, I di- 
vided members by party, calculated the mean DW- 
NOMINATE score on each dimension for each cau- 
cus, and calculated the weighted Euclidean distance 
between them.® Figure 4 tracks the distance between 
the House Republican and Democratic caucuses from 
the 81st Congress, which began a few days into 1949, to 
the 104th Congress, which ended a few days into 1997. 
Polarization declined steadily from the late 1940s into 
the late. 1960s and remained relatively constant until 
the late 1970s, a trough that coincides with the decline 
of party in the electorate. Congressional behavior then 
changed in the late 1970s. With the 95th Congress, 
ideological polarization between the parties began a 
steady rise.7 

These changes in congressional behavior correspond 
closely with, but slightly precede, the increases in 
measures of mass partisanship described above. For 
example, half the growth in elite polarization occurred 


6 Because Poole and Rosenthal compute the coordinates with a 
weighted utility model, any use of the DW-NOMINATE scores to 
calculate a distance requires that the second dimension be weighted 
by 3 (see Poole and Rosenthal 1997). Scores for the Senate, 
although less polarized, follow much the same pattern as those for 
the House (Poole 1998). 

7 Scholars suggest several alternatives to the measure of polarization 
used here. For example, Aldrich, Berger, and Rohde (1999) employ 
a number of measures of both polarization and homogeneity, 
including the median distance between the parties, the intraparty 
homogeneity along the NOMINATE score’s first dimension, and the 
proportion of members of one party who ideplogically overlap the 
other party on the first dimension It is worth noting that the mean 
Euclidean distance measure that I employ here is correlated with 
these three measures at .99, .99, and .97, respectively, when I use data 
from the 85th to the 103d Congress. 


American Political Science Review 


Vol. 95, No. 3 


American Political Science Review 0 a a a mamama auauuMMMlMMM 


FIGURE 4: Mean Euclidean Distance between Republican and Democratic Party DW-NOMINATE 
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between the last Congress in'the Carter administration 
and the first Congress in the second Reagan adminis- 
tration. The start of the substantial increases in the 
total number of likes and dislikes and the trend toward 
straight ticket voting took hold in the election cycles 
that followed. On the heels of the second spike in elite 
polarization, which occurred during the second Con- 
gress of the Bush administration (the 102d), all mea- 
sures of party strength responded in kind, with the rise 
in the percentage of positive-negatives the most dra- 
matic. Elite polarization, therefore, appears to be a 
potential engine for change at: the mass level. 


INCREASED CLARITY OF PARTY IMAGES 
AS AN INTERMEDIATE STEP 


I place elite polarization at the heart of the explanation 
for party resurgence and hypothesize a set of causal 
dynamics between elites and ordinary Americans sim- 
ilar to those posited and demonstrated by Carmines 
and Stimson (1989, 160) regarding racial issue posi- 
tions. More partisan elite behavior caused by polariza- 
tion should clarify party positions for the public, which 
in turn should influence the importance and salience of 
parties.§ 

One way to test whether clarity has increased is 
simply to ask people whether they see important 
differences between what the parties represent. The 
NES does so, and the solid line in Figure 5 tracks this 
trend. From 1960 to 1976, the percentage who per- 


8 In exploring the influence of party activists on party ideologies, 
Aldrich (1995, chap. 6) employed variations on several of these 
measures of clarity and reached results consistent with mine. He does 
not suggest, however, that greater clarity reinvigorated partisanship 
at the mass level. 





ceived important differences ranged from the high 40s 
to the middle 50s. A marked upturn began in 1980, and 
58% or more have seen important differences every 
year since. The percentage reached 63% in 1996, the 
highest level in the series.? 

The “important differences” responses suggest 
greater clarity has occurred but not why. Since the 
polarization in Congress has been ideological, ideolog- 
ical differences are a likely reason. One measure of this 
is whether the public can array the parties correctly on 
a liberal-conservative scale. If ideological clarity has 
increased, then people should be both better able to 
place the Democrats to the left of the Republicans and 
more likely to perceive a larger distance between them. 

The broken line in Figure 5 demonstrates that 
people in the 1990s are better able to array the parties 
ideologically. From 1984 until 1990, only about 50% of 
the public did so correctly, but this figure reached 63% 
in 1996.1° In addition to arraying the parties correctly, 
respondents perceive a widening ideological gulf be- 
tween them. According to data from the NES Cumu- 


between 1980 and 1984. 
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FIGURE 5. Percentage of Respondents Who Perceive Important Differences between the Parties 
and Correctly Place the Parties Ideologically, 1960-06 
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lative File (Sapiro et al. 1997), the mean signed ideo- 
logical distance between the parties rose from 1.52 
points in 1984 to 1.94 points in 1992 and 1996, an 
increase of 28%.1! I use the signed rather than absolute 
ideological distance because elite polarization should 
also help people array the parties correctly. The use of 
absolute distance would make equivalent the place- 
ment of Democrats one unit to the left or to the right 
of Republicans, which would obscure the increasing 
proportion of correct placements. 

Similar to the pattern revealed by the indicators of 
party resurgence, the increases in party clarity occurred 
soon after increases in elite polarization. Apparently, 
as party elites began to clarify ideological cues, citizens 
became less inclined to see the parties as Tweedledee 
and Tweedledum. When people perceive that who wins 
and loses will lead to distinct futures, they should 
develop more partisan feelings and become more 
inclined to organize politics in partisan terms. 


WHY PARTY CLARITY HAS INCREASED 


As parties in Congress have become more polarized 
along party lines, people have become more inclined to 


11 To compare data from 1996 with other years in the NES Cumu- 
lative File, the NES provides a weight to correct for a too highly 
educated 1996 sample. I do this for all other descriptrve analyses, but 
I use the unweighted data for perceived ideological distance because 


without weighting the data for perceived ideological distance, the 
mean for education in 1984 (2.52) is still too high relative to that of 
1996 (2.54), which likely accounts for the leveling off of percerved 
ideological distance between 1992 and 1996. 
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see important differences between the parties, place 
them correctly in an ideological space, and perceive a 
wider ideological distance between them. I will test 
whether there is a causal connection between elite 
polarization and these mass responses. 

I employ a pooled cross-sectional design, using data 
gathered by the NES in both presidential and off-year 
elections between 1960 and 1996. This design allows 
me to merge contextual information over time, namely, 
the aggregate measures of ideological polarization in 
the House, with the survey data. Due to data limita- 
tions, I am often confined to the seven surveys taken 
between 1984 and 1996. The results are consistent, and 
often stronger, when I include appropriate dummy 
variables to confront these data limitations and extend 
the analysis back farther. 

The first dependent variable is whether a respondent 
Sees Important Differences between what the parties 
represent. It is coded 1 if the respondent claims to see 
important differences, 0 otherwise. The second depen- 
dent variable is Correct Ideological View of the Parties, 
which is coded 1 if the respondent places the Demo- 
cratic Party to the left of the Republican Party, 0 
otherwise.12 The third dependent variable is Perceived 
Ideological Distance between the Parties. It is measured 
as the signed difference between where respondents 
place the Republican and Democratic parties on the 
NES’s seven-point liberal-conservative scale. 

These three measures should be a function of a 
number of different attitudinal and contextual vari- 
ables. Most important for my purposes is Elite Polar- 
ization. J tap this as the mean Euclidean distance in the 
DW-NOMINATE scores between the Democratic and 


12 To conserve cases, I include m the analysis both those who placed 
themselves ideologically and those who did not. This means that all 
those who failed to place themselves and thus were not asked to 
place the parties are coded 0. 
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Republican House caucuses, which I lag by one Con- 
gress for two reasons. First, a Congress officially ends 
after most postelection surveys have been completed, 
so using a contemporaneous term would suggest that, 
for example, congressional behavior in 1993 affects 
1992 attitudes, which makes no temporal sense. Sec- 
ond, time elapses before the public perceives changes 
in elite behavior. Public opinion on race, for instance, 
did not react immediately to the parties’ change in 
position (Carmines and Stimson 1989). In merging the 
contextual with the individual-level data, I give each 
1996 respondent the mean Euclidean distance from the 
1993-95 session of Congress, each 1994 respondent the 
mean Euclidean distance from the 1991-93 session, 
and so forth.” 

A number of attitudinal measures also may affect the 
dependent variables, so they are added as controls. 
Strength of Ideology and Strength of Partisanship should 
play important roles. Those who place themselves near 
the poles of the seven-point scales demonstrate an 
understanding of ideology and partisanship and hence 
should be more inclined to see differences than those 
who place themselves at mid-scale. In addition, several 
social characteristics are relevant. Those with more 
Education will be less inclined to provide mid-scale 
responses than those with less (Delli ini and 
Keeter 1996), which increases both the probability that 
they will see differences between the parties and the 
extent of distance they see. Age should have a similar 
effect; older respondents, who have more political 
experience than younger ones, should be more inclined 
to see important differences, array the parties correctly, 
and see a wider gulf between them. In contrast, Women 
and African Americans exhibit less political expertise 
than males and whites, ively (see e.g., Mondak 
1999), so they should be less likely to see party 
differences. 

I also must account for contextual factors. People 
pay less attention and vote less in Of Year Elections, so 
they should tend to see the parties as less distinct in 
nonpresidential years. Years characterized by Divided 

may make a difference. On the one hand, 
both parties have a prominent voice in government, 
which may increase people’s ability to identify the 
parties ideologically. On the other hand, people may 
have a harder time deciding whether a president of one 
party or a Congress of the other is driving the ideolog- 


D I specify a model in which causation runs from the elite to mass 
level, but Rohde (1991) suggests the reverse. Our goals differ. Rohde 
explores changes in the direction of white southerners’ party identi- 
fication, whereas my concern is the strength of partisan attitudes. It 


nino peti aterm eagle wrap Reacts 
and Stimson 1989). 

44 I would have liked to mchide an objective measure of political 
Knowledge, but the NES did not provide a consistent battery of 
knowledge items until 1988 Scholars often use formal education, 
which I include In the model, as a proxy for political knowledge (e.g., 
Sniderman, Brody, and Tetlock 1991). 
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ical direction of the country. Indeed, in 1990, fewer 
than half of Americans could even identify which party 
controlled the House, despite the fact that the Demo- 
crats had done so for nearly 40 years, 30 two voices may 
only serve to confuse citizens. 

In sum, I estimate the following models to analyze 
the influence of elite polarization on three measures of 
mass-level clarity about the parties. 


Pr(sees important differences) 
= f (elite polarization, strength of partisanship, 
education, age, black, female, off-year 
election, divided government).15 (D 


Pr(correctly places the parties ideologically) 
= f (elite polarization, strength of ideology, 
trength of partisanship, education, age, 
black female, off-year election, 
divided government). (2) 


Perceived Ideological Distance 
= f (elite polarization, strength of ideology, 
strength of partisanship, education, age, 
black, female, off-year election, divided 
government). (3) 


The first two dependent variables are binary, so ordi- 
nary least-squares (OLS) estimates will be biased. 
Hence, I use logistic regression to estimate these 
models and use OLS to estimate the third. 

The results in the first column of Table 1 suggest that 
elite polarization has a significant effect on whether 
people see important differences between the parties.!® 
In fact, all variables perform as expected, except. for 
age, which is insignificant, and race, which is positive. 
That the Democrats have been much friendlier to the 
interests of African Americans appears to matter to 
this group in identifying important differences. 

Achieving statistical significance in a sample of 
19,000 is no great feat. More important, the effect of 
elite polarization is substantively important as well. If I 
account for the 1960 context of divided government in 
a presidential year and set the other variables to their 
1960 mean values, the predicted probability of seeing a 
difference between the parties is 512, which is almost 
identical to the 50.9% of respondents who 
secing a difference in 1960. If I hold all variables 
constant at their 1960 means, again account for divided 
government and election context, but increase elite 
polarization to its 1996 level, the predicted probability 
of seeing important differences rises to .611, an in- 


15 Because the NES only started to ask people to place themselves 
ideologically in 1972, I drop strength of ideology from the important 
differences equation, so I can include data from 1960, 1964, and 1968. 
Dropping of ideology should not affect the results unduly, 
given that the partial correlation between elite polarmation and the 
percentage who see important differences between the partes is an 
extremely robust .87, controlling for off-year election years. 
16 Because my measure of elite polarization is not independent from 
year to year, there may be some concern about autocorrelation. 
revealed no such problems. For instance, the 
Durbin- Watson statistics for the OLS models presented below are 
1.99 and 1.97, reapectrvely, which indicates not even a hint of 
autocorrelation. 
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Number of cases 19,206 14,109 


Source: Amencan National Gection Studies, Cumuiatve Fie, 1948-96. 
"p < .05, “p < 01, “p < 001; one-tailed tests. 





crease of .099, arid is not much different from the 
63.0% who reported seeing a difference in 1996. 
Among the other independent variables, between 
1960 and 1996 only education changed such that it 
would increase the probability of seeing important 
differences between the parties. In a simulation similar 
to the one above, holding elite polarization and all 
other variables at their 1960 means, and accounting 
only for the increase in education, the predicted prob- 
ability of seeing important differences rises by .058. 
Although both factors are important, increased elite 
polarization is better than 50% more important than 
increased education in explaining change over time. 
The results in the second column of Table 1 suggest 
that elite polarization has clarified mass perceptions of 
the parties’ ideological differences.17 Between the 97th 
17 The same pattern of results were obtained when I reestimated the 


models by successively dropping each year from the which 
suggests that the results are not a function of a single observation. In 


626 


deotogical Clarity and Measures of Party Strength as a Function of 
Attitudes, Social Characteristics, and Contextual Factors 


984-06 1 Full Model Reduced Form Full Model Reduced Form 
Param. Param. Est. Param. Est. Param. Est Param. Est Param. Est. Param. Est. 
Variable (Std. Err.) (Std. Err.) (Std. Err.) (Std. Err.) (Std. Err.) (Std. Err.) (Std. Err.) 
Elte potartzation 1.707" 2.065" 2.017" 3.103°* 3.087** 2.144" 2.918" 
(0.189) (0.444) (0.491) (0.488) (0.434) (0.8165) (0.722) 
Perceived Ideological — = 0.136% = 0.280 Es 
(0.009) 0.015) 
Education 0.383" 0.803"* 0.626"" 0.206% 0.266"" 0.g928"" 1 101°" 
(0.016) (0.022) (0.023) (0.023) (0.021) (0.037) (0.031) 
Strength of partisanship 0.469" 0.2868 0.135 0.606" 0.648*** 0.466*"* 0.601*** 
(0.016) (0.020) (0.024) (0.024) (0.022) (0.037) (0.031) 
Strength of ideology — 0.610 0.461** 0.336" 0.374** 0.475*** 0.683** 
(0.023) (0.025) (0.024) (0.022) (0.039) (0.035) 
Race (African American) 0.186" —0.458"** —0.568"" —0.125 —0.112* —0.225* —0.309"" 
(0.049) (0.058) (0.069) (0.087) (0.059) (0.110) (0.082) 
Age —0.006 0.052 0.0886 —0.035" —0 016 0.166 0.174 
(0.009) (0.011) (0.013) (0.013) (0.011) (0.021) (0.017) 
Sex (female) —0.305"" —0.310°" —0.074 —0.036 — 0.067 —0.803** —0.862** 
(0.031) (0.039) (0.044) (0.042) (0.039) (0.069) (0.080) 
Divided government —-0.138"" —0.069 —0.200° —0.194" —-0.203" ~0.334"* —0.351"* 
(0.035) (0.071) (0.078) (0.075) (0.069) (0.126) (0.111) 


2.321 = = 
11,304 
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and 103d Congress, the mean Euclidean distance be- 
tween party members’ DW-NOMINATE scores in the 
House grew from .530 to .690. When I increased elite 
polarization by this amount, set divided government to 
one, set off-year election to zero, and held all other 
variables constant at their 1984 means, the predicted 
probability that a respondent would correctly place the 
Democratic Party to the left of the Republican Party 
rose from 540 to .620. The increase of 8 percentage 
points caused by elite polarization between 1984 and 
1996 likely accounts for a large portion of the 9.6 
percentage point change that actually occurred. 


addition, the results are not time bound. When I estimated a model 
using data back to 1972, the first year the ideology questions were 
asked, and included a dummy variable for pre-1984 cases to account 
for the differing response rates to the ideology questions, the effect af 
elite polarzation remained statistically significant (B = 1.329, p < 
-001) and was substantively even larger, taking into account elite 
polarization’s greater range over the longer period. 
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Across their ranges, variables such as education, 
strength of ideology, and strength of partisanship all 
have larger effects than elite polarization, but none 
increased by as much as 5% between 1984 and 1996. 
Indeed, only the increases in education and strength of 
ideology were statistically significant. When I per- 
formed parallel simulations for these two variables, I 
found that the predicted probability of arraying the 
parties correctly rose by a paltry .018 and .011 points, 
respective 


ly. 

The same pattern of results emerges in explaining 
perceived ideological distance between the parties, and 
the effect of elite polarization is again substantively 
important.!® These results appear in the third column 
of Table 1. Multiplying the parameter estimate by the 
.16-point increase between 1984 and 1996 produces an 
increase of .323 in perceived ideological distance. The 
dependent variable increased by .4 between 1984 and 
1996, and greater elite polarization accounts for about 
80% of the change, other things being equal. 

For the third equation, the attitudinal variables and 
social characteristics performed as expected. It is im- 
portant to note, however, that only education and 
strength of ideology rose significantly between 1984 
and 1996. Multiplying their respective parameter esti- 
mates by their differences in means provides their 
contribution to the increase of .4. These calculations 
yield increases of .094 and .059, respectively. Both 
effects pale in comparison to that of elite polarization. 


EXPLAINING PARTY RESURGENCE 


As Carmines and Stimson (1989) would predict, the 
results thus far suggest that elite polarization has 
clarified public perceptions of the parties’ ideological 
positions. What difference does greater clarity make? 
Some suggest that perceptions of polarized parties may 
cause dissatisfaction (e.g., Dionne 1991; Fiorina 1996; 
Hibbing and Theiss-Morse 1995; King 1997), but I 
contend that greater ideological clarity should invigo- 
rate partisan attitudes. When people believe that par- 
ties provide choices not echoes (Key 1966; Nie, Verba, 
and Petrocik 1979; Page 1978), they ought to be more 
concerned about who dictates public policy. As people 
come to realize that Democrats and Republicans will 
pursue substantially different courses, attachment to 
one side or the other becomes more consequential, and 

image becomes more salient (see also Carmines 
and Stimson 1989). 

To test the influence of elite polarization on party 
affect and salience, I estimated models for two mea- 
sures of party resurgence: whether someone is a Posi- 
tive-Negative using the party feeling thermometers and 
the Total Number of Party Likes and Dislikes provided 
by a respondent. I used the same right-hand side 
variables as above for the same reasons, but I added 


18 When I estimated a model using data back to 1972 and inchided a 
dummy variable for pre-1984 cases, the effect of elite polarization 
remained statistically elgnificant (B = 2.035, p < .001) and was 
substantively much larger than that estimated from the 1984 and 
later cases. 


the third measure of party clarity, Perceived Ideological 
Distance, to the right-hand side as well. If people see 
sharper distinctions between the parties, parties should 
be more important and salient to them. ‘This specifica- 
tion allows elite polarization to have both a direct effect 
on party affect and an indirect effect through perceived 
ideological distance. 
In functional form, the models are as follows. 


Pr(Respondent is a Positive-Negative) 
= f (elite polarization, strength of ideology, 
strength of partisanship, education, age, 
black, female, off-year election, divided 
government, perceived ideological distance). (4) 


Total Number of Party Likes and Dislikes 
= f (elite polarization, strength of ideology, 
strength of partisanship, education, age, 
black, female, off-year election, divided 
government, perceived ideological distance). (5) 


Again, the first dependent variable is binary, which 
prompts the use of logistic regression. The second 
dependent variable is an interval scale, which allows 
the use of OLS. 

The results appear in the fourth and sixth columns of 
Table 1. Elite polarization is again positively signed 
and statistically significant for both dependent vari- 
ables. Its effect, moreover, is substantial. When I 
increased elite polarization from its 1984 to its 1996 
level, accounted for divided government and a presi- 
dential election year, and held all other variables 
constant at their 1984 mean values, the predicted 
probability that a respondent will be a positive-negative 
rose from .325 to .442. In addition, elite polarization 
has an indirect effect through perceived ideological 
distance. Recall that elite polarization expands the 
distance between the parties by 323. When I increased 
perceived ideological distance by this amount above its 
1984 mean, the probability of giving responses catego- 
rized as positive-negative rose by another .010, bringing 
the total effect of elite polarization to .127, ceteris 
paribus. Again, the effect of more education and 
stronger ideology did not approach that of growing 
elite polarization. 

The same pattern of results emerges for the total 
number of likes and dislikes mentioned about the 
parties, as shown in the sixth column of Table 1. The 
parameter estimate of 2.144 for elite polarization sug- 
gests that its increase of .16 between 1984 and 1996 
caused an estimated increase of .343 points in the 
number of likes and dislikes mentioned. Accounting 
for the rise of the .323 points in perceived ideological 
distance caused by elite polarization adds another .090 
points to the dependent variable. Thus, the total effect 
of elite polarization is .433, or more than 60% of the 
increase of .72 in the dependent variable. In compari- 
son, the total effects caused by a rise in education and 
strength of ideology between 1984 and 1996 are each 
less than one-quarter of that of elite polarization. 

A large percentage of respondents do not answer the 
ideological self-placement question and, as a result, are 
not asked to place the parties, so including perceived 


627 


Resurgent Mass Partisanship: The Role of Elite Polarization 


ideological distance on the right-hand side creates a 
good deal of missing data. This may undermine confi- 
dence in the results, especially since those missing tend 
to be less sophisticated than those who provide re- 
sponses. I therefore estimated reduced forms for equa- 
tions 4 and 5, dropping perceived ideological distance 
from the models. Its effect should be captured by elite 
polarization, which allows the ture of several 
thousand lost cases (Markus 1988). The results in the 
fifth and seventh columns of Table 1 demonstrate that 
elite polarization remains significant with or without 
the missing data.19 Taken together, these results sug- 
gest that increasingly strong partisan orientations on 
the mass level are a function of growing ideological 
polarization on the elite level. 


RECEPTION OF ELITE POLARIZATION 


Although the results thus far provide strong evidence 
that elite ideological polarization has produced a more 
partisan electorate, an even more rigorous test is to 
account for people’s differing ability to absorb this 
information. A more ideologically polarized House 
should produce a more ideologically polarized issue 
environment, but those with more political expertise 
should reflect it better than those with less. 

Many suggest that a measure of objective political 
knowledge is the best indicator of political expertise 
(e.g., Delli Carpini and Keeter 1996; Price and Zaller 
1993; Zaller 1992). Unfortunately, the NES only began 
asking a detailed battery of factual questions in 1988. 
The use of education as a proxy for knowledge is not 
ideal (see Luskin 1987), but many have done so (see 
e.g, Popkin 1994; Sniderman, Brody, and Tetlock 
1991). Indeed, people with more education should, on 
average, have better developed cognitive tools, which 
should allow them to absorb more political informa- 
tion. If a knowledge battery were available over a 
sufficiently long period, I would expect the results to be 
even stronger than those presented below.” 

I replicated each of the full models from Table 1 and 
introduced an interaction between elite polarization 
and education. Education is coded 1 for those who 
completed grade school, 2 for those who attended or 
graduated from high school, 3 for those who attended 
college, and 4 for college graduates and those with 
graduate degrees. The interaction should carry a pos- 
itive sign, which indicates that those who can best use 
the information generated by a more polarized envi- 


19 There are roughly 2,200 more cases for the positive-negative 
equation than for the likes/dislikes equation. This is largely because 
only half samples were asked the likee/dislikes questions in 1986, 
1990, and 1996. In addition, I estimated an equation using data back 
to 1952, droppmg both sdeological polarization and strength of 
ideology. Elite polarization remained significant (B = 2180, p < 
-001), which suggests that the results are not time bound. 

® The NES has asked one factual item in almost every survey since 
1960: which party controlled the House of Representatives prior to 
the election. When I specified an interaction between whether the 
respondent answered this question correctly and elite polarization, 
the same bamc pattero of results emerged. Because this is a single 
Ee E 2 ey eee ee 
n.d), I opted to use education instead. 
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ronment are. the most inclined to see differences be- 
tween the parties and provide more partisan opinions. 

Since the effects of each variable not included in the 
interactions are almost identical to those presented in 
Table 1, I include only the estimates for the interaction 
and its component parts in Table 2. In all five cases, the 
interaction is properly signed and statistically signifi- 
cant. To interpret the interaction, I calculated the total 
effect of elite polarization for the perceived ideological 
distance and the total number of likes and dislikes. 
Although the pattern is the same for the three logit 
models, the interpretation of the nonadditive OLS 
models is more straightforward. The total effect of elite 
polarization is derived as follows: 


Ente Polarmation = P1 + B3(Education,), (6) 
where Eppie polanraton is the total effect of elite polar- 
ization, and Education, is the ith respondent's level of 
education. In the perceived ideological distance equa- 
tion, the estimate for B, is —1.209 and for B, is 1.271, 
and in the likes/dislikes equation, B, is —4.209 and B, 
is 2.504. 

Given these estimated effects for perceived ideolog- 
ical distance, elite polarization has no effect (E = .062) 
on those who completed grade school (education = 1), 
and its effect for those with a high school education 
(education = 2) is a relatively small 1.333. For those 
with at least some college (education = 3) and college 
graduates (education = 4), however, the effect is 
substantial: 2.604 and 3.875, respectively. The results 
for the total number of likes and dislikes are similar. 
That is, elite polarization has no effect on people with 
less cognitive training, but among college attendees 
and graduates, the effect is a hefty 3.303 and 5.807, 


ly. 

In sum, these results should increase confidence that 
elite polarization is driving the impressive increase in 
party-centric thinking on the mass level. People with 
the greatest ability to assimilate new information, those 
with more formal education, are most affected by elite 
polarization. These findings are all the more impressive 
in view of the fact that education is not an optimal 
proxy for political knowledge.?! 


CONCLUSION 


The results presented here suggest that parties in the 
electorate have rebounded significantly since 1980 and 
that the party decline thesis is in need of revision (see 
also Bartels 2000). Although the environment does not 
mirror that of the 1950s, Americans in the 1990s are 
more likely to think about one party positively and one 
negatively, less likely to feel neutral toward either 
party, and better able to list why they like and dislike 
the parties than they were ten to thirty years ago.2 


2 The results for the interactive terms were replicated when I 


analyses for party placement and perceived ideological 

to 1972. The results for the total number of likes and dislikes also 
were replicated when I included data back to either 1972 or 1952. 
2 Although the results are not presented here due to space consid- 
erations, I found that elite polarization also has mcreased the 
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Education 


Places 
Sees Democrats to 
Important Left of 
Differences, Republicans, 
Parameter Parameter 
Estimate Estimate 
Varlable (Standard Error) (Standard Eron 
Elite polarization 0.068 —0.261 
(0.408) (1.304) 
Education 0.061 0.273 
(0.086) (0.311) 
Eltte polarization 0.620°* 0.886* 
x Education (0.164) (0.520) 


“D < .06, “p < .01; ““*p < .001; one-tailed tests 


Consistent with most theories of public opinion, these 
mass-level changes have resulted from changes in elite 
behavior. Greater ideological polarization in Congress 
has clarified public perceptions of party ideology, 
which has produced a more partisan electorate. 

Although I have focused on the strength as opposed 
to direction of partisanship, the results of this study 
may have implications for the latter as well. In discuss- 
ing macropartisanship, some suggest that such short- 
term influences as changes in economic conditions and 
presidential approval ratings have profound effects on 
the distribution of Republicans and Democrats (e.g., 
Erikson, MacKuen, and Stimson 1998; MacKuen, Erik- 
son, and Stimson 1989), whereas others argue that 
their effects are minimal (e.g., Green, Palmquist, and 
Shickler 1998). Because strength of partisanship in the 
aggregate has fluctuated markedly over the last 50 
years, heterogeneity in the time series is likely. When 
people hold their partisan ties more intensely, the 
probability of party identification change is reduced. 
Hence, the effect of short-term forces on macroparti- 
sanship should be smaller when strength of partisan- 
ship is relatively high and larger when it is relatively 
low.? 

The resurgence of party is, of course, good news for 
those who trumpet the unique role that parties tradi- 
tionally have played in organizing political conflict 
(e.g, Shattschneider 1975). Voting theories work best 


tendency of respondents to vote for presidential and House candi- 
dates of the same party, controlling for strength of partisanship, 
strength of ideology, race, age, sex, whether a House seat m open, 
whether a House seat is contested, whether a third party presidential 
candidate is running, and whether the House incumbent is of the 
p Leas respondents party identrfication. 

Green, Palmquist, and Shickier (2000) note that the proportion of 
siege (aes HCE aid ual Fao FIDEA A te 19506 espe 
extraordinary increases m consumer confidence and consistently high 
presidential approval ratings, which casts farther doubt on the import 

They argue that this may be the result of 
le estimation decisions made by MacKuen, Erikson, and 
Stimson (1989), but it also may have become harder to move 


macropartisanship m the 1990s because strength of partisanship has 
grown. 
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TABLE 2. Replication of Models In Table 1, Adding an Interaction between Elite Polarization and 





Perceived Respondent Total Party 
Ideological Is a Posittve- Likes and 
Distance, Negative, Dislikes, 
Parameter Parameter Parameter 
Estimate Estimate Estmate 
(Standard Error) (Standard Error) (Standard Error) 
—1.209 —0,.094 —4209" 
(1.485) (1.448) (2.280) 
0.135 —0.537" ~—0.570 
(0.331) (0.320) (0.504) 
1.271* 1.239 2.504" 

(0.552) (0.532) (0.839) 





when people perceive that the parties represent dis- 
tinct ideologies, which allows voters to make rational 
calculations about alternative futures (e.g., Downs 
1957; Hinich and Munger 1994). My study suggests that 
voters are much better able to make such ideological 
distinctions than in the past. On average, partisanship 
allows less sophisticated Americans to connect their 
values and interests with vote choice (Delli Carpini and 
Keeter 1996), so voters should be able to participate 
more effectively as a result. 

The election of 2000 provides further empirical 
evidence of mass party resurgence. The proportion of 
Republican and Democratic voters is nearly equal 
today (Miller 1998), so a national election strongly 
influenced by party should be very close, and American 
elections do not get much closer than the 2000 contest. 
Preliminary data from the 2000 NES suggest that more 
than 90% of both Democratic and Republican identi- 
fiers voted for their party's presidential candidate 
(Burns et al. 2001), and partisans of every stripe were 
significantly more loyal to their party’s standard-bearer 
in 2000 than in either 1992 or 1996 (Pomper 2001, 138), 
which also were two highly partisan elections (Bartels 
2000). In addition, straight ticket voting for president 
and House remained above 80% among major party 
presidential voters (Burns et al. 2001), which produced 
a tazor-thin Republican majority in the House that 
reflects the closeness of the presidential race. Public 
reaction to the election was also strongly partisan. In 
early December, more than 90% of Republicans criti- 
cized Al Gore’s legal efforts in Florida (Saad 2000), 
despite clear indications that a plurality of Floridians 
intended to vote for him (Brady et al. 2001), and more 
than 80% of Democrats approved of Gore’s legal 
challenge (Saad 2000). 

It is easy to overlook a party resurgence when 
symptoms often associated with weak parties, such as 
third party candidacies and divided government, are 
regular features of the political environment. Such 
phenomena, however, result from other factors in 
addition to weak partisanship. Ross Perot certainly 
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benefited from party independence, but his success was 
mostly a function of his personal style and fortune 
(Rosenstone, Behr, and Lazarus 1996). To the extent 
that weak parties do advantage third party efforts, 
moreover, it is noteworthy that third parties have 
drawn progressively fewer votes over the last three 
presidential elections, a period when my results suggest 
that partisanship in the electorate has strengthened. 

In addition, strong parties do not automatically 
produce unified government (Fiorina 1992). Although 
the late nineteenth century was America’s most parti- 
san era, divided governments were the norm. Today, 
candidate quality and fundraising play a dominant role 
in understanding which voters split their tickets (Bur- 
den and Kimball 1999). If either party gains a signifi- 
cant identification advantage among regular voters, 
unified government will almost certainly result. Until 
then, we are likely to see close presidential elections 
and small majorities for one or the other party in both 
houses of Congress. 


APPENDIX A. QUESTION WORDING 


Partisanship 


Generally speaking, do you usually think of yourself as a 
Republican, a Democrat, an Independent, or what? (If 
Republican or Democrat.) Would you call yourself'a strong 
(Republican/Democrat) or a not very strong (Republican/ 
Democrat)? (If independent, other, or no preference:) Do 
you think of yourself as closer to the Republican or Demo- 
cratic Party? 


Ideology Questions 


We hear a lot of talk these days about liberals and conser- 
vatives. Pm going to show you (1996: Here is) a seven-point 
scale on which the political views that people might hold are 
arranged from extremely liberal to extremely conservative. 
Where would you place yourself on this scale, or haven’t you 
thought much about this? 

Where would you place the Democratic Party? 

Where would you place the Republican Party? 


Sees Important Differences 


Do you think there are any important differences in what the 
Republicans and Democrats stand for? 


Party Feellng Thermometers 


Td like to get your feelings toward some of our political 
leaders and other people who are in the news these days 
(1990: who have been in the news). I'll read the name of a 
person and Pd like you to rate that person using the feeling 
thermometer. Ratings between 50 and 100 degrees mean that 
you feel favorably and warm toward the person; ratings 
between 0 and 50 degrees mean that you don’t feel favorably 
toward the person and that you don’t care too much for that 
person. You would rate the person at the 50 degree mark if 
you don’t feel particularly warm or cold toward the person. If 
we come to a person whose name you don’t recognize, you 
don’t need to rate that person. Just tell me and we'll move on 
to the next one. 

The Democratic Party. 

The Republican Party. 
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Likes/Dislikes 
Is there anything in particular that you like about the 
Democratic Party? What is that? 


Anything else [you like about the Democratic Party]? Up 
to five mentions. 
Js there anything in particular that you dislike about the 
Democratic Party? What is that? 
ing else [you dislike about the Democratic Party]? 
Up to five mentions. 
Is there anything in particular that you like about the 
Republican Party? What is that? 
Anything else [you like about the Republican Party]? Up 
to five mentions. 
Is there anything in particular that you dislike about the 
Republican Party? What is that? 
ing else [you dislike about the Republican Party]? 
Up to five mentions. 
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several of the prominent theories about democratic political structures and derive hypotheses from 


| V J ¢ attempt to explain when and why democratic states will prevail in international crises. We review 


framework about crisis outcomes. These hypotheses are tested against the population of 422 
Pe er ia a ie ir Our findings provide further evidence that the democratic peace 
is not a spurious result of common interests. Moreover, ee alse ea e AN ee ee 
among the many theories of the democratic peace. In particular, we find strong evidence that democratic 
structures are important because of their ability to generate domestic audience costs. Our findings 
political 


political 
also support the argument that democratic 
conflicts that they will win. 


relations embraced the democratic peace as the 

“closest thing we have to an empirical law” in 
our field (Levy 1989, 88), new research is both chal- 
lenging its validity and broadening its ramifications. 
The purpose of this work is to expand the canvas of 
democratic peace scholarship by testing for evidence of 
other behavior unique to democratic states. The bulk 
of the literature centers on the absence of war between 
democracies. Realist scholars argue that this could be a 
result of common international security interests, 
whereas democratic peace theorists suggest numerous 
different ways in wbich either democratic political 
structures or norms are the cause. We believe that the 
lack of military conflict between democracies is a 
heavily overdetermined outcome, which creates a prob- 
lem for social scientists trying to understand the causal 
processes at work. It is difficult to assess the relative 
merits of various theoretical models when all of them 
seem capable of explaining the democratic peace. 

When faced with competing explanations of a single 
phenomenon, one should derive additional hypotheses 
from each and test those new expectations (Campbell 
and Stanley 1966; King, Keohane, and Verba 1994). 
That is, to parse the various theoretical perspectives on 
the democratic peace, we must extend their logics to 
other behaviors. If two theories can explain a common 
set of events, but one of them also can explain an 
additional set, then it should be judged superior be- 
cause of its scope. We contend that the most useful and 
powerful theories of the democratic peace should be 
able to explain other facets of democratic behavior in 
international conflicts. Our analysis shifts the debate to 
a new dependent variable: the winners and losers of 
international crises. 

Moreover, we also begin the difficult task of differ- 
entiating among the many theories of the democratic 
peace. We will review several prominent theories about 
democratic political structures and derive hypotheses 


A little more than a decade after international 
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1 Some scholars have begun to address other dependent variables 
(Dizon 1994; Gaubatz 1996; Lake 1992; Siverson and Emmons 1991). 


structures encourage leaders to select international 


from each regarding crisis outcomes. These will be 
tested against the population of 422 international crises 
between 1918 and 1994. Briefly, our findings provide 
further evidence that the democratic peace is not a 
spurious result of common interests. In particular, we 
find strong evidence that democratic political struc- 
tures are important because of their ability to generate 
domestic audience costs. Our findings also support the 
argument that democratic political structures encour- 
age leaders to select international conflicts that they 
will win. We do not find support for the view that the 
democratic peace is a result of the military prowess of 
democratic states.? 


THE DEMOCRATIC PEACE PUZZLE: 
FOUR EXPLANATIONS 


The so-called law of the democratic peace is based on 
two empirical findings, the first of which is well estab- 
lished, the second more tentative. The first finding is 
that over the last two centuries democracies have rarely 
engaged in violent or potentially violent conflicts with 
one another (Bremer 1992; Bueno de Mesquita and 
Lalman 1992, 152; Chan 1984; Doyle 1986; Maoz and 
Abdolali 1989; Maoz and Russett 1993; Owen 1994; 


2 We were unable to test the prominent normative theories of the 
democratic peace because of serious difficulties regarding the mea- 
surement of democratic norms of conflict resolution. This limits the 
comprehensiveness of our tests, but that 1s a problem shared with the 
entire literature on democracy and conflict. A number of studies 
(e.g. Brecher and Wilkenfeld 1997; Dixon 1994) use democratic 
political structures as a proxy measure for democratic norms. We 
believe this strategy is inadequate and may be very misleading for at 
least two reasons democratic norms and institutions are 
distinct concepts, and this distinction is central to some of the 
normativo explanations (Owen 1994). Second, because most hypoth- 
eses that flow from normative arguments also can be derived from 
structural ones, direct and valid measures of democratic norms are 
essential for testing these arguments, but the existing quantitative 
measures are for our purposes. Maor and Russett 
ee eee a eae 


than it captures democrabe norms. Rather than conduct tests that 
are hampered by measurement problems, we restrict our attention to 
structural theories of the democratic peace. 
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Russett 1993; Small and Singer 1976; Weede 1992). 
The more tentative claim is that democracies are just as 
likely to engage in conflict with nondemocracies as are 
other states. Rousseau, Gelpi, Reiter, and Huth (1996) 
demonstrate that democracies are only less likely to 
initiate force against other democracies once an inter- 
national crisis has erupted; otherwise, it remains un- 
clear whether democracies are more pacific in general 
or only toward other democracies. Rousseau et al. 
(1996) uncover suggestive evidence that democracies 
are generally more pacific, whereas Ray (1995) force- 
fully contends that they are generally more peaceful. 

Given the prominence of the democratic peace 
notion among international relations scholars, it is not 
surprising that a large number of theories have been 
proposed to explain this result. These theories can be 
categorized into two groups: explanations that focus on 
democratic norms and explanations that focus on dem- 
ocratic political structures. In this work we will com- 
pare three prominent structural explanations and the 
central realist critique of these approaches. 

In collaboration with various coauthors, Bueno de 
Mesquita has developed a prominent structural theory 
of the democratic peace.? Bueno de Mesquita and 
Lalman (1992) contend that democratic political struc- 
tures impose costs on democratic leaders who choose 
to use force in international disputes. More specifically, 
liberal leaders face institutionalized constraints that 
impede their capacity to mobilize the state’s resources 
for war without the consent of a broad spectrum of 
interests. Moreover, these constraints are readily ap- 
parent to other states and cannot be manipulated by 
leaders. Thus, democracies send credible signals to 
other states of an aversion to using force. These signals 
allow democratic states to avoid conflicts with one 
another, but they may attract aggression from nondem- 
ocratic states. Democracies may be pressured to re- 
spond to such aggression—perhaps even preemptive- 
ly—through the use of force. Elaborations of this 
argument (Bueno de Mesquita and Siverson 1995; 
Reiter and Stam 1998) demonstrate that when democ- 
racies choose to initiate conflicts, the domestic costs of 
using force drive them to select conflicts in which they 
are especially likely to prevail. 

A second structural argument, presented by Lake 
(1992), suggests that the democratic peace is a result of 
the power of democratic states. The focus is on the 
rents that political structures extract from a state’s 
wealth. Because democratic political structures con- 
strain elites from extracting excessive rents, democratic 
states have little incentive to engage in territorial 
expansion. In addition, moderate rents increase the 
wealth and economic growth of democratic states, 
which in turn augment their military capacity despite 
weak extractive capacities. Because of this military 
power, democracies prevail in the conflicts into which 
they are drawn. 


3 Other scholars (e.g., Doyle 1986, 1997; Maoz and Russett 1993) 
present similar structural arguments. We focus on the Bueno de 
Mesquita model because it represents the most extensive and most 
formal development. 
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A third structural explanation is presented by Fearon 

(1994a, 1997), who focuses on audience costs as a tool 
in coercive bargaining.* He argues that international 
crises are public contests in which the disputants’ 
performances are evaluated by their domestic audi- 
ences. The institutionalized electoral constraints in 
democracies allow the domestic audience to impose 
large and transparent costs on leaders. Therefore, 
when democratic leaders choose to escalate interna- 
tional crises, their threats are taken as highly credible. 
In disputes between liberal states, the credibility of 
their bargaining signals allows them to negotiate a 
peaceful settlement before mobilization. An undevel- 
oped implication of Fearon’s work is that audience 
costs favor democratic states when facing a nonliberal 
rival. Mobilization for war ties the hands (Schelling 
1960, 1966) of liberal leaders and signals their commit- 
ment to fight because their ability to retain office is in 
jeopardy. 
Realists have challenged the notion of a democratic 
peace, but a number of these critiques have serious 
methodological problems. Spiro (1994), for example, 
relies on a flawed application of probability theory (see 
Russett 1995), and Layne (1994) inappropriately se- 
jects on the dependent variable and draws determinis- 
tic predictions from a probabilitistic theory. More 
formidable is the critique by Farber and Gowa (1995), 
who assert that the democratic peace is not due to 
regime type but to structural imperatives of the inter- 
national system. In other words, they challenge not the 
empirical phenomenon that democracies rarely fight 
one another but the assertion that democracy is the 
reason. More specifically, they point to the common 
international security interests shared by democratic 
states during the Cold War.* 

Democratic peace theorists have responded to Far- 
ber and Gowa (Oneal and Russett 1999; Thompson 
and Tucker 1997a, 1997b), but their work is the most 
significant realist critique of the democratic peace. 
Clearly, democratic states profit from their peaceful 
relations, enjoy economic and technological benefits, 
and reap mutual gains from shared military costs. 
Resolving this spuriousness problem can be compli- 
cated, but perhaps the most straightforward solution is 
to examine the effect of democracy on other conflict 
behaviors when democracies do not share common 
interests. If democracy influences these other behav- 
iors, then the power of Farber and Gowa’s argument is 
substantially undermined. 


EXPANDING THE CANVAS: HYPOTHESES 
ON CRISIS OUTCOMES 


Bueno de Mesquita and colleagues, Lake, Fearon, and 
Farber and Gowa provide plausible explanations of the 
fact that democratic states rarely fight one another. 
Each theory yields hypotheses about a variety of other 
conflict behaviors. We will examine the expectations of 
each perspective regarding the substantive outcomes of 


4 For a related model, see Schultz 1999. 
5 For a similar critique, seo Gartrzke 1998, 2000. 
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international crises, that is, which disputants will pre- 
vail. We hope to establish the general veracity of these 
four approaches and illuminate the effect of democracy 
on conflict behavior more generally. 


Bueno de Mesquita and the Democratic 
Costs of Using Force 


Bueno de Mesquita and various associates argue that 
democratic states are reluctant to use force interna- 
tionally because they face a higher domestic cost than 
authoritarian states (Bueno de Mesquita and Lalman 
1992; Bueno de Mesquita and Siverson 1995; see also 
Reiter and Stam 1998). This implies that a range of 
concessions are less costly for democratic leaders than 
the decision to use force. Therefore, democratic states 
should be less likely to prevail in international crises 
because of their greater willingness to make conces- 
sions in order to avoid armed conflict. 

Although the average democratic state may be less 
likely to prevail, the domestic cost approach (Bueno de 
Mesquita and Siverson 1995) implies that the same 
cannot be said for a democratic state that chooses to 
become a challenger in international crises. If it ini- 
tiates a crisis, then its leaders must either be highly 
motivated to prevail on the issue at stake or highly 
confident of their ability to prevail, since the domestic 
costs are very large if they lose a conflict they initiate 
(Bueno de Mesquita and Siverson 1995). Therefore, 
democratic challengers should be more likely to prevail 
than authoritarian states because democracies are 
more selective about the crises they choose to initiate 
(Reiter and Stam 1998). Because democratic defenders 
do not select themselves into international crises, they 
should reflect the more general tendency of democratic 
states to yield in international conflicts because of their 


unwillingness to use force. 


Hypotuests 1. Democratic defenders will be less likely 
than other defenders to prevail in an international 
crisis. Democratic challengers will be more likely than 
other challengers to prevail. 


Lake and the Powerful Democratic Pactifists 


Lake (1992) bases his theory on an analogy between 
states and monopolistic firms in a market. Both, he 
argues, attempt to extract “rents” from society in order 
to increase their wealth and power. Democratic gov- 
ernments, however, cannot be as ruthless as authori- 
tarian states in extracting rents because their hold on 
power is more closely tied to societal preferences. 
This relative inability to extract rents has two coun- 
tervailing effects. First, because democracies cannot 
extract resources from territory as intensively as autoc- 
tacies, democratic leaders have fewer incentives for 
territorial expansion. This status quo orientation 
means that democracies have little incentive to initiate 
military conflicts, particularly with other democracies 
that share their satisfaction with the status quo. Fur- 
thermore, the moderate rents extracted by democratic 
states allow for a high growth rate and overall level of 
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wealth. Although democracies are relatively less ex- 
tractive than authoritarian states, their wealth permits 
them to amass greater military capacity. 

These countervailing influences suggest two distinct 
hypotheses. First, democratic states should be less 
likely than authoritarian states to mitiate international 
crises (Lake’s explanation of the democratic peace). 
Second, when democracies are drawn into interna- 
tional conflicts, they should have the military capability 
to win. Lake explicitly links his theory of the demo- 
cratic peace to a theory about the outcomes of inter- 
national crises. 


HyroTHesis 2. Democratic states will be more likely to 
prevail in an international crisis, regardless of whether 
they are challengers or defenders. 

The central distinction between the Bueno de Mes- 
quita and Lake theories lies in their expectations 
regarding the success of democratic defenders. Both 
approaches expect democracies to be unlikely to initi- 
ate disputes (for differing reasons), and both expect 
democracies to be highly successful when they do 
choose to initiate. Only Lake expects democratic de- 
fenders to be successful. 

In Lake’s model, military capabilities and the inter- 
ests at stake are intervening variables, so we must 
examine the effect of democracy both with and without 
these variables (King, Keohane, and Verba 1994). If 
Lake is correct, then the addition or removal of these 
variables should substantially affect the apparent influ- 
ence of democracy.® 


Fearon, Audience Costs, and the Credibility 
of Democratic Signals 


As did Schelling’s (1960, 1966) pioneering work, recent 
scholarship on crisis bargaining focuses on audience 
costs as a central mechanism in sending credible bar- 
gaining signals (Fearon 1994a, 1997; Schultz 1999). 
Essentially, audience costs are imposed upon the bar- 
gainer by someone other than the adversary as a result 
of the bargainer’s statements or actions. The notion 
relates to the democratic peace because democratic 
leaders face substantial domestic costs—including re- 
moval from office—if their international behavior con- 
tradicts the preferences of their domestic audience. 
Many authoritarian leaders are relatively immune from 
such pressure, and even those who are not immune do 
not appear to be vulnerable. Regular, competitive 
elections, however, send a credible signal of vulnera- 
bility to domestic audiences. 

In conflicts involving a democratic and an authori- 
tarian state, the strategic conflict literature indicates 
that the audience cost factor favors liberal negotiators, 


€ Because Lake (1992) argues that democracies may not valus 
geopolitical influence or control over territory as highly as pondem- 
ocracies, we test hypotheas 2 on crises in which the defender’s 
domestic regime is at stake, crises in which the defender z threat- 
ened with grave damage in war, and crises in which the very existence 
of the defending state is threatened. If democratic defenders are ever 
likely to prevail, it will be m these crises, in which a great deal is at 
stake. 
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whose domestic audience lends credibility to their 
bargaining signals (Fearon 1994a; Putnam 1988; 
Schelling 1960). Credibility is generated through two 
related but distinct mechanisms: tying hands and burn- 
ing money (Fearon 1997).? This kind of credibility 
usually is not available to nondemocratic leaders, who 
often need not worry about the domestic response to 
their behavior. Moreover, even authoritarian leaders 
who face significant domestic constraints do not appear 
internally vulnerable to other states. 

States that have domestic audience costs because of 
their democratic political structures should be able to 
coerce an opponent into backing down if they are 
willing to make the escalatory bargaining moves that 
may generate audience costs. Furthermore, domestic 
audiences observe leaders’ decisions not to escalate 
international crises, and the failure to respond to an 
external threat is also likely to entail domestic costs. 
Thus, the failure of democratic states to make escala- 
tory bargaining moves will be taken as a credible signal 
of their unwillingness to use force. The implication is 
that democracies have few incentives to bluff in inter- 
national crises, and their bargaining moves—whether 
escalatory or deescalatory—will be taken as more 
credible indicators of their true resolve than will the 
moves of authoritarian states. In other words, when 
democracies are engaged in crises with nondemocra- 
cies, their asymmetric susceptibility to audience costs 
should increase the effect of relative resolve on the 
outcome of the crisis. If the democracy escalates, then 
it should effectively persuade the nondemocracy to 
yield.8 If the democracy chooses not to escalate, then 
the nondemocracy should be better able to force it to 
yield, confident that the democracy will not fight. 


Hypotuesis 3. Relative demonstrations of resolve should 
have a greater effect on the probability that the chal- 
lenger will prevail in crises between democratic and 
nondemocratic states than on the outcomes of disputes 


Farber and Gowa: Common Interests and 
Common Polities Revisited 


Farber and Gowa’s (1995) critique centers on the claim 
that democratic states have rarely fought since World 
War II because they share fundamentally similar secu- 
rity interests. The outcomes of crises between democ- 
racies represent an especially promising area for test- 
ing this critique, because democracies manifestly do 
not have a common interest in who wins the crises that 
do arise among them. Realist theory holds that crisis 


7 Leaders may “tie their hands” by explicitly promising constituents 


signals his or ber resolve regarding the issue at stake (Fearon 1997). 
£ It is important to note that the central variable in this argument is 
observable demonstrations of resolve, not any underlying structure of 
Tarea ap elke’ Fearon (1d0da) divs a ona durtnction beiee 
the balance of interests at stake and the process of escalatory 
signaling that generates audience costs. 
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bargaining takes place either directly through violence 
or in the shadow of potential violence (Huth 1988; 
Mearsheimer 1983; Organski and Kugler 1980; Waltz 
1979). Perhaps the most central implication of realist 
theory is that states will use their military capability to 
achieve and defend their security interests, which sug- 
gests that crisis outcomes should reflect the relative 
military capacities of the disputants. For a crisis re- 
solved on the battlefield, that statement seems obvious. 
Even if the disputants choose to negotiate, however, 
realism contends that such bargaining occurs in the 
shadow of potential violence and thus also reflects 
military power. 

Most important for our purposes, realists such as 
Farber and Gowa should not expect this basic relation- 
ship between relative military capabilities and dispute 
outcomes to be altered by the regime type of the 
disputants. If the democratic peace is a result of 
common interests rather than common polities, then 
once those common interests evaporate—as in an 
international confrontation— democracies should seek 
victory through the threat of military force just as other 
states do. 


prevail in an international crisis will not depend on the 

jointly democratic status of the disputants. 

Contrary to Farber and Gowa (1995), theories ofthe 
democratic peace suggest that democracies will be 
strongly constrained not to use force against other 
democratic states. If this is so, then crises between 
democratic states do not involve bargaining in the 
shadow of violence to the same extent as do other 
international crises. That is, if common polities are the 
constraint, then democracies should not fear the use of 
force by another democracy, even if the two states are 
embroiled in an international crisis (Rousseau et al. 
1996). Most theories of the democratic peace would 
lead us to expect that relative military capability will 
have a substantially reduced effect on the outcome of 
crises between democracies.? 


HYPOTHESIS 5. The more jointly democratic the two 


ity that the challenger will prevail in the international 

Farber and Gowa’s argument does not lead us to 
expect that powerful democracies will make conces- 
sions to weaker ones in order to avoid war. Given their 
focus on common security interests, however, their 
approach might lead us to expect that powerful allies 
will make concessions to weaker allies in order to 
maintain the allegiance of the weaker party. Imagine, 
for example, that two allies view a third state as a threat 


? Some of the theories we examme would make this prediction, but 
we do not assert that any theory of the democratic peace must make 
this prediction. Lake’s (1992) argument, for example, does not 
clearly imply this result. Bueno de Mesquita and Lalman (1992) and 
Fearon (1994a) point more directly to this prediction. 
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to their security, but the allies have a strong conflict of 
interest over a separate issue. One might argue that the 
more powerful ally will not impose a one-sided resolu- 
tion to the secondary issue because it fears alienating 
the support of the weaker ally against the common foe. 
A possible example is British concessions to Iceland in 
the Cold War. Such cases seem to fit Farber and 
Gowa’s focus on alliance ties among democracies since 
World War II (Farber and Gowa 1994; Siverson and 
Emmons 1991). Consequently, we believe a fair test of 
hypotheses 4 and 5 must allow for any way in which 
alliances might account for the attenuated influence of 
military capabilities on the outcomes of crises between 
democracies. 


Hypotuesis 6. The influence of relative military capabil- 
ities on the probability the challenger will prevail in 
international crises will be attenuated if the crisis 
occurs between states that share international security 
interests. 


We would not construe lack of support for bypoth- 
esis 6 alone as strong evidence against Farber and 
Gowa’s claim. If hypothesis 5 is supported and hypoth- 
eses 4 and 6 are not, then we would view our results as 
sharply inconsistent with Farber and Gowa’s critique of 
the democratic peace. 


Control Variables 


We include several control variables in our analysis of 
crisis outcomes. These are drawn from the realist 
literature on crisis bargaining and war and represent 
something of a baseline model against which the 
hypotheses can be examined. It is important that we 
include these variables because they are correlated 
with democracy. As such, they represent potentially 
confounding variables. 


Relative Military Capabilities. As noted above, realist 
theories of crisis bargaining expect that the substantive 
outcomes of international crises should reflect the 
power differential of the two disputants (Huth 1988; 
Mearsheimer 1983; Organski and Kugler 1980; Waltz 
1979). Hypotheses 4 and 5 concern the effect of relative 
military capabilities in interaction with alliances and 
joint democracy. In addition, we include Relative Mili- 
tary Capability as an independent variable on its own. 


Relative Interests at Stake. The realist focus on self- 
help implies that state behavior is driven by self- 
interest. States with very substantial interests at stake 
in a crisis will be more willing to suffer costs to defend 
their interests. If one disputant has more at stake than 
another, realist theory would expect it to prevail in the 
crisis (Schelling 1966; Stam 1996). Once again, this 
outcome may occur with or without violence; even if 
the dispute is settled peacefully, the potential for 
escalation to violence causes the outcome to reflect the 
Relative Interests at Stake. 


Relative Resolve. Disputants must demonstrate their 
willingness to use their capabilities and to pay the costs 
of war in order to persuade their opponent to yield. 
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States attempt to do this by engaging in actions that 
escalate the crisis toward war. A state that convincingly 
shows its willingness to use force and to suffer war costs 
should be more likely to prevail in the crisis. Hypoth- 
esis 3 tests the effect of this variable in interaction with 
domestic audience costs, but we also include Relative 
Resolve as an independent variable. 


Nuclear Weapons. Numerous theoretical and empiri- 
cal studies of crisis bargaining separate the influence of 
Nuclear Weapons from conventional forces (Betts 1987; 
Huth 1990; Huth, Gelpi, and Bennett 1993; Pape 1996; 
Schelling 1960, 1966). On the one hand, the possession 
of nuclear weapons by the defender in a crisis may be 
a powerful safeguard of the status quo because few 
issues—if any—could lead a challenger to prefer nu- 
clear war to a strategic retreat (Huth, Gelpi, and 
Bennett 1993; Schelling 1996). On the other hand, a 
challenger with nuclear weapons may blackmail de- 
fenders into submission (Betts 1987; Huth 1990). 


MEASUREMENTS AND RESEARCH DESIGN 


We tested our hypotheses against the population of 
international crises that involved the threat or use of 
military force between 1918 and 1994. These were 
drawn from the International Crisis Behavior (ICB) 
Project, directed by Michael Brecher and Jonathan 
Wilkenfeld (1997), but our population of events differs 
from the ICB set in several ways. We describe each of 
our alterations to the ICB data set in detail in the 
Appendix.10 Our coding rules identified 422 dyads 
involved in 283 international crises between 1918 and 
1994. In a few instances we could not determine a 
state’s domestic regime type because it was in flux 
during the crisis, or we were unable to gather data on 
a state’s military capabilities or alliance status. Conse- 
quently, our final analyses cover between 409 and 416 
international crisis dyads. 


Crisis Outcome 


The dependent variable is the extent to which the 
challenger is able to prevail on the issues at stake in the 
international crisis (Crisis Outcome). ing is on a 
three-point scale: win (3), draw (2), and lose (1). Data 
are drawn from the ICB, which codes crisis outcomes 
on a four-point scale: victory, compromise, stalemate, 
and loss.11 We pool cases of stalemate and compromise 
because both outcomes represent a partial success by 
the challenger. An analysis based on the full four-point 
ICB scale yielded nearly identical substantive results, 
but the overall fit of the model was substantially 
improved by pooling the stalemate and compromise 
cases. 


1 Data may be downloaded from http//www.duks.edu/~pgelpi/ 
data.htm. 

13 The ICB attempts to code these outcomes as an objective observer 
would view their content rather than in terms of the subjective 


perceptions of the participants. 
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Democracy Scores of Challengers and 
Defenders 


This variable measures the challenger and defender 
state’s level of democracy (Challenger Democracy and 
Defender Democracy). We relied on the updated Polity 
I data set, compiled by Jaggers and Gurr (1997). We 
subtracted the Polity II autocracy index from the 
democracy index to produce a variable that ranges 
from —10 to +10. In order to ease the interpretation of 
the statistical results, this variable was rescaled from 1 
to 21. We use it to test hypotheses 1 and 2. We also 
created a Joint Democracy variable, which we used in 
interaction with military capabilities to test hypotheses 
4 and 5.12 This variable was constructed by multiplying 
the challenger and defender democracy scores. 


Relative Audience Costs 


We measure Relative Audience Costs by subtracting the 
defender’s democracy score from the challenger’s and 
squaring the resulting difference. The variable ranges 
from 0 to 400. A score of 0 indicates that the two crisis 
participants are equally democratic (and therefore 
equally subject to audience costs). A score of 400 
indicates a crisis between an extremely authoritarian 
and an extremely democratic state. This variable cap- 
tures the situations in which Fearon (1994a, 1997) 
would contend that one disputant has a bargaining 
advantage because of its ability to generate audience 
costs. We use this variable in interaction with relative 
resolve to test hypothesis 3.' 


Common International Security Interests 


Perhaps the. best observable indicator of shared secu- 
rity interests is alliance ties (Bueno de Mesquita 1981). 
Thus, we measure Common International Security In- 
teresis as a dummy variable that is coded 1 if the two 
disputants share a formal alliance tie (Shared Alliance 
Tie), 0 otherwise. The data on alliances are drawn from 
the updated Correlates of War (COW) data set on 


12 We had no theoretical expectation that joint democracy would 
have an independent influence on the probability that the challenger 
would prevail. Thua, we do not include this variable in our final 
analyses independent if its interaction with relative military capabil- 
ities, but we tested this assumption by including joint democracy in 
all the analyses in Table 1. As expected, the coefficient was insignif- 
icant and did not substantively alter the effect of the other variables. 
D Joint democracy is a difficult concept to measure. The democratic 
peace literature fs not specific about the functional form of the 
between the s level of democracy, the defend- 
er's level of democracy, and the joint “democraticness” of the dyad. 
We selected our specification because we lack a theoretical basis for 
choosing a more complex functional form. An interaction of linear 
effects seems the simplest form to assume. The results remain 
essentially unchanged whether one uses either a dummy variable that 
identifies dyads with two democratic states or the minimum democ- 
racy score in the dyad as a measure of joint democracy. 
14 As in the case of joint democracy, we have no theoretical 
expectation that relative audience costs will have an independent 
effect on crisis outcomes. Thus we do not include it as an indepen- 
dent variable in our analyses. We tested this assumption and found 
that its inclusion did not significantly alter the results. 
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interstate alliances. We use this variable in interaction 
with relative military capabilities to test hypothesis 6.15 


Relative Miltary Capabilities 


Data for this variable were drawn from the COW data 
set on national material capabilities and the values 
were calculated through the EUGene data generation 
program (Bennett and Stam 2000). EUGene calculated 
the proportion of global composite military capabilities 
controlled by each crisis participant during the year 
that the crisis was initiated. We then calculated the 
proportion of the capabilities within each crisis dyad 
that was controlled by the challenging state (i.e., chal- 
lenger capabilities/(challenger + defender capabili- 
ties)). 


Relative Interests at Stake 


In the ICB data set, Brecher and Wilkenfeld (1997) use 
a 8ix-value ordinal scale to score the gravity of the value 
threatened for each state in the crisis. We transformed 
this variable in three ways in order to create the 
relative interests at stake. First, consistent with Gelpi 
(1997a), we believe that the fourth category on this 
scale, international influence, should not be considered 
a More serious issue than the second category, the 
political stability of the government in power, or the 
third category, the territorial integrity of the state. 
Thus, we reordered the Brecher and Wilkenfeld scale 
so that international influence is coded lower than 
governmental stability and territory. Second, we col- 
lapsed this six-point scale into three categories that 
reflect differences of similar magnitude. Specifically, 
crises that involve a low level of threat or international 
influence are given a value of 1; those that involve 
territory or the status of the regime are coded 2; and 
those that threaten grave damage or the existence of 
the state are coded 3.16 Third, we calculated the 
relative interests at stake by subtracting the defender’s 
score from the challenger’s. 


Relative Resolve 


Relative resolve is coded on the basis of the ICB data 
set. Brecher and Wilkenfeld (1997) code the crisis 
management strategy of the participants on an eight- 
point scale that ascends toward military violence.!7 As 


15 As in the case of jomt democracy and audience costs, we had no 
theoretical expectation that this variable will have an independent 
effect on crisis outcomes. We tested this expectation aud found that 
the coefficient was never substantively or statistically significant, and 
it did not substantially change the coefficients or statistical 
cance of the other vanables. Thus, we did not include it in the 
přeéentation of our final results. 

16 Analyses with the full six-point ICB scale yielded virtually identical 
results. The direction and substantive size of all coefficients as well as 
their levels of statistical significance remained unchanged. The 
models estimated with the three-point scale yielded a slightly better 
overall fit to the data, so we retained the rescaled variable. 

17 Conmstent with Fearon (1994a) and Schelling (1960, 1966), we 
distinguish between demonstrations of resolve and the underlying 
preferences that may give mse to such demonstrations. 
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in the case of the interests at stake, however, we 
collapsed their scale into four categories that we be- 
lieve reflect comparable steps in the escalation of crisis 
management. Specifically, a strategy of negotiation, 
arbitration, adjudication, or mediation is given a value 
of 1. A strategy that relies on nonmilitary pressure, 
such as economic sanctions or diplomatic pressure, a8 
well as nonviolent military acts, such as troop move- 
ments or threats to use force, is given a value of 2. A 
strategy that combines military force and other tactics 
is given a value of 3. A strategy that relies exclusively 
on the use of military force is given a value of 4.18 We 
then calculated the relative resolve demonstrated by 
the disputants by subtracting the defender’s resolve 
score from the challenger’s. 


Possession of Nuclear Weapons by 
Challenger and Defender 


Challenger Nuclear Weapons and Defender Nuclear 
Weapons are both dichotomous variables. A state must 
satisfy two criteria to be identified as nuclear capable 
for the purposes of crisis bargaining. First, it must be 
capable of deploying a nuclear weapon. Second, this 
capability must be known by the opposing state in the 
crisis in order to have any coercive effect. Data for 
coding challenger nuclear weapons and defender nu- 
clear weapons were drawn from Arkin and Fieldhouse 
(1985), Betts (1987), and the High Energy Weapons 
Archive (Federation of American Scientists 2000). 

With the exception of Israel, all dates regarding 
nuclear capability are based upon the dates of publicly 
acknowledged tests. The United States tested its first 
atomic weapon in 1945, Britain in 1952, France in 1960, 
the People’s Republic of China in 1964, and India in 
1974. We coded Israel as a nuclear state beginning in 
1969.19 Altering this date to any year between 1968 and 
1973 has no effect on our results. 


18 Analyses with the full eight-pomt ICB scale yielded similar results, 
although the coefficients and levels of statistical significance were 
somewhat reduced. We believe that the reduction In coefficient size 
was due to measurement error introduced by using the eight-point 
scale as a lmear measure of resolve. Our rescaling addresses this 
problem. 

19 Trae] may not have participated m nuclear tests (as opposed to 
zero-yield hydrogen tests) until as late as 1979, but it was known to 
be nuclear capable much earlier. Israel deployed nuclear weapons 
during the 1973 October War, and its Dimona reactor began 
producing weapons grade nuclear material m 1968, so we chose 1969 
as the date. Altering this date to any year between 1968 and 1973 has 
no effect on our results. South Africa became a nuclear state m 1979 


purposes of cnsis bergaming. 

capable in‘ 1991, but that capability was not publicly acknowledged 
until 1992. Since Pakistan was not involved m an international crisis 
between 1991 and 1994, we do not code it as a nuclear capable state 
in any crisis in our data set. Classifying Pakistan as nuclear capable 
during the 1990 crims with India had no substantial effect on our 
results. 
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DATA ANALYSIS AND STATISTICAL 
RESULTS 


We began the hypothesis tests with three ordered 
probit analyses, the results of which are presented in 
Table 1. Model I contains only the realist control 
variables. The coefficient on relative military capabili- 
ties is positive and statistically significant (b = 0.53, 
p < .01). This coefficient indicates that challengers 
become more likely to prevail as they control a larger 
proportion of the capabilities in the crisis dyad. The 
coefficient on relative interests at stake also is positive 
and statistically significant (b = 0.22, p < .01). This 
coefficient indicates that challengers become more 
likely to prevail as their stakes in the dispute increase 
relative to those of the defender. 

The substantive effects of changes in the relative 
interests at stake are displayed in Table 2. A shift from 
a crisis in which the challenger and defender have 
equally strong interests to one in which the challenger’s 
interests are one category higher increases the proba- 
bility of a challenger victory by an additional 11.3%. 
Another shift to a crisis in which the challenger’s 
interests rank two categories higher than the defend- 
er’s increases the probability of victory by 16%. When 
the overall shift is from a crisis in which the defender’s 
interests rank two categories higher than the challeng- 
er’s to a crisis in which the challenger enjoys such an 
advantage, the probability that the challenger will 
prevail increases from 11% to 40%. The coefficient for 
relative resolve is positive (b = 0.08) in Table 1, but it 
does not achieve statistical significance (p < .15).” 

With regard to nuclear weapons, in Table 1 the 
coefficient on the challenger’s possession of nuclear 
weapons is positive, as expected, but it does not achieve 
statistical significance (p < .09).2! In contrast, the 
coefficient for the defender’s nuclear capability is neg- 
ative and statistically significant (b = —0.31, p < .05), 
but its substantive effect is not strikingly large. Specif- 
ically, compared to a defender without nuclear weap- 
ons, a defender with nuclear weapons is 11% less likely 
to be defeated and is 8% more likely to prevail. This 
provides some support for the argument that nuclear 
weapons can be effective in defending the status quo, 
but we do not find strong support for the view that 
challengers can use nuclear weapons to revise the 
status quo. 

Overall, the results of Model I provide substantial 


20 Auxiliary r-squared for this variable was only 0.11, which indicates 
that colinearity is not a plausible interpretation of this insignificant 
result. Auxiliary for ali variables in models I and II were 
0.41 or lower. Moreover, the two varlables with higher colincarity 
ngnificant. Auxiliary r-squares for the variables in Model 
I were all below 0.60, with the exception of the interaction of joint 
democracy and military capabilities. 
to 0.30. Thus, colinearity is not a plausible explanation for any of the 
statistically insignificant results. 


31 Once again, 
for this variable is only 0.10. 
2 A mmilar dummy variable identifying only the cases in which the 
challenger has a nuclear monopoly (Le. the challenger has nuckear 
= and the defender does not) yielded virtually identical 
ts. 
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TABLE 1. Probł Analyses Predicting Challenger’s Victory in International Crises 
Modal | Model II Mode! Ili 


Democracy Varlables j 
Challengers democracy 0.03** 0.04* 
(0.01) (0.01) 
Defender’s democracy —0.00 0.01 
: (0.01) (0.01) 
Relative audience costs xX relative resolve — 0.001* 
(0.005) 
Relattve mllitary capabilites x jolnt democracy —0.002* 
(0.001) 
Reallst Variables 
Relative military capabilites x disputants 0.17 
share alilance tle (0.23) 


Relative military capabllities l ; 0.79" 
(0.26) 
Relative interests at stake . 0.23" 
(0.08) 
Relative resolve : —0.01 
: (0.10) 
Challenger nuclear weapons 0.05 
(0.20) 
Defender nuclear weapons . —0.29 
(0.20) 


First threshold : —0.06 0.50 
Second threshold i 1.02 1.62 


Number of obeərvations : 413 409 
183 195 
44% 48% 
25.7 5)" 7.3 (2)" 38.2 (10)" 
Note: Standard errora for coefhcierts are in parentheese Huber-White robust standard errors are allowed for clustanng on each onsie. *p < .06, “p < .01 
Significance tests for chalenger and defender democracy are two-taled All other sagnificance teeta are one-tailed. 





support for the realist model of crisis outcomes. shall see below, however, realist variables are only the 
Clearly, power and interests have a significant influ- beginning of the story. 
ence on the outcome of international crises. As we Also presented in Table 1 are the results for Model 


TABLE 2 Marginal Effects of Relative interests at Stake and the Challenger’s Democracy on the 
Probabliitty of a Challenger Victory 
' Section |: Interests at Stake 

Change In Change In P Change In P Change in P 
Interests at Stake (Challenger Losss) (Draw) (Challenger Wins) 
—2 to -1 —-9.3% +4.3% +5.0% 
—1to 0 —9.1% +2.8% +86.3% 
Oto1 —8.4% +0.9% +7.5% 
1to2 —7.3% -1.2% +8.5% 


Section ll: Challenger Democracy 


Change In Change In P Change In P " Change In P 
(Challenger Loses) (Draw) (Challenger Wins} 


+2.0% 
+1.4% 
+0.7% 
+0.1% 


+3.3% 
+3.7% 
+4.2% 
+4.5% 


values from the probit model while changing the values of selected vanables and hoichng 
that the outcome would fal Into each category by summing the 


probahualties 
between the predicted value and each of the category thresholde. 
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I, which includes only the political structure variables: 
the challenger’s and defender’s democracy scores. We 
estimate this model to distinguish Lake’s (1992) view of 
democracies as powerful pacifists from Bueno de Mes- 
quita‘s (Bueno de Mesquita and Lalman 1992; Bueno 
de Mesquita and Siverson 1995; see also Reiter and 
Stam 1998) concept of democracies as weak and con- 
strained polities. We do not include the realist control 
variables in Model II because Lake’s argument implies 
that interests at stake and military capabilities may be 
intervening variables between democracy and conflict 
behavior (King, Keohane, and Verba 1994). 

Consistent with both Bueno de Mesquita and Lake, 
the results of Model I indicate that democratic chal- 
lengers are more likely to prevail. The coefficient on 
challenger’s democracy is positive and statistically sig- 
nificant (b = 0.05, p < .01). The substantive effect of 
this variable is displayed in the second section of Table 
2, which shows that it is roughly comparable to the 
effect of the interests at stake. Specifically, a shift in the 
challenger’s domestic regime from an autocracy (de- 
mocracy score = 1) to a mixed regime (democracy 
score = 11) increases the probability that the chal- 
lenger will prevail in the crisis by 7%. A further shift 
from a mixed regime to full democracy (democracy 
score = 21) raises the probability of victory by an 
additional 8.7%. Overall, the probability that an au- 
thoritarian challenger will prevail in a crisis is 15%, 
compared to 30% for a democratic challenger. 

The defender’s democracy score, however, has vir- 
tually no influence on crisis outcome (b = —0.00, 
n.8.). Moreover, the results of Model I remain un- 
changed even when the analysis is restricted to cases in 
which the defender’s regime, its existence, or grave 
damage in war is at stake (b = 0.00, n.s.). Contrary to 
Lake’s expectations, democratic defenders are not 
more likely to prevail even when defeat entails dire 
consequences. 

At first glance, these results do not appear consistent 
with either Bueno de Mesquita or Lake. Both predict 
the positive coefficient on challenger democracy, and 
neither predicts an insignificant effect for defender 
democracy. Yet, we believe that our results are more 
supportive of the Bueno de Mesquita framework for at 
least three reasons. First, that approach expects the 
effect of democracy on crisis outcomes to differ accord- 
ing to conflict initiator. Specifically, only democracies 
that initiate international crises will be more likely to 
prevail. Model II does indicate that the effect of 
democracy depends upon the challenger or defender 
status. 


Second, as indicated by the results in Model MI, the 
coefficient for the defender’s democracy becomes sub- 
stantially more strongly positive (b = 0.15, but does 
not achieve statistical significance, p < .15) once we 
control for relative military capabilities and the issues 
at stake. 

Third, the selection process that appears to be 
dampening the coefficient for the defender’s democ- 
racy score reflects the expectations of Bueno de Mes- 
quita at the expense of Lake. The ability to generalize 
from results is limited by selection effects whenever 


sample observations are not randomly drawn from the 
population (Achen 1986). International crises are not a 
random set of state interactions, and any analysis of 
crisis behavior is subject to those effects (Fearon 
1994b). In general, nonrandom selection reduces the 
observed substantive size and statistical significance of 
relationships. Of course, the crisis selection process 
may affect the coefficient for any variable that is known 
and observable before the outbreak of the crisis. 

The examination of selection effects on the defend- 
er’s democracy score is particularly important because 
Lake and Bueno de Mesquita make differing claims 
about what might be dampening the influence of 
defender democracy. Specifically, Lake (1992) expects 
that defender democracy will have a negative effect on 
the probability of a challenger victory because democ- 
racies are powerful and efficient combatants. Accord- 
ing to this logic, Lake’s expected negative coefficient 
may be dampened if challengers, recognizing the mili- 
tary capacity of democratic states, tend to initiate crises 
only against especially weak democracies. If this were 
true, then the democratic defenders we observe would 
tend to be at a disadvantage in terms of relative 
military capabilities. Bueno de Mesquita (Bueno de 
Mesquita and Lalman 1992; Bueno de Mesquita and 
Siverson 1995), for his part, expects a positive relation- 
ship between defender democracy and the probability 
of a challenger victory because democracies will gen- 
erally be reluctant to use force. This expected positive 
coefficient may be dampened if challengers are willing 
to initiate crises even against very powerful democra- 
cies because of their belief that the democracies will 
not fight. If this were true, then the democratic defend- 
ers we observe would tend to have an advantage in 
terms of relative military capabilities. 

We tested these expectations by regressing the chal- 
lenger and defender democracy on the relative military 
capabilities in the dyad (recall that this is expressed as 
the proportion of total military capability controlled by 
the challenger). We also included the relative interests 
at stake as a control variable.% The key variable in this 
analysis is the defender democracy score. Lake predicts 
a positive coefficient. That is, Lake expects that when 
challengers select a democratic defender, they will tend 
to do so if they have an advantage, in terms of relative 
military capabilities. Bueno de Mesquita predicts a 
negative coefficient. That is, Bueno de Mesquita ex- 
pects that when challengers select a democratic de- 
fender they may target states that are more powerful 


D In extreme cases, selection effects can cause coefficients to reverse 
sign. This logic led Bueno de Mesquita and others to hypothesis 8, 
which predicts that the weakness of democratic states will cause 
democratic challengers to choose crises that they are more hkely to 
win. 


were dropped from the analysis. 
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TABLE 3. Regression Analysis Predicting 
Relative Military Capabilities In Intemational 
Crises by Disputants’ Power Status 

Realist Variables 
Challengers democracy 


Defender’s democracy 
Relative Interests at stake 
Constant 


Ses of observations 


F-test (degrees of freedom) 


Note: Standard errors for coefhoernts are in parentheses. Huber-White 
robust standard errors are allowed for clustanng on each coneis “p < 
.01. All teats for stabehoa!] significance are two-tailed 





than themselves because of their belief that democra- 
cies prefer not to fight. 

The results of this analysis are presented in Table 3. 
These findings clearly support Bueno de Mesquita’s 
logic. Most important, the coefficient for the defender’s 
democracy is clearly negative and statistically signifi- 
cant (b = —0.009, p < .01). Since this analysis was 
performed by ordinary least squares, the marginal 
effect of the variables can be easily calculated directly 
from the coefficients. For each increase of one unit in 
the defender’s democracy score, the balance of relative 
military capability shifts toward the defender by .009. 
When other variables are held at their mean value, the 
predicted relative capability score for a crisis in which 
the defender is authoritarian (democracy score = 1) is 
58, which reflects an advantage of nearly 3:2 for the 
challenger. If the defender is democratic (democracy 
score = 21), the comparable score is .41, or a 3:2 
defender advantage. In contrast, the influence of the 
challenger’s democracy score is not significant. Finally, 
as one would expect, the coefficient on relative inter- 
ests at stake is negative, which indicates that challeng- 
ers are willing to accept a less favorable balance of 
military capability when they have more at stake than 
the defender. 

It appears that democratic defenders tend to be 
militarily more powerful than their opponents but are 
not more likely to prevail. This result holds even when 
they are defending their domestic regime or the exis- 
tence of their state. Democratic challengers, in con- 
trast, are not more powerful than their opponents but 
are significantly more likely to prevail. These findings 


æ It is well documented that major powers are disproportionately 
involved in military conflict (Bremer 1992), but many of the major 
powers between 1918 and 1994 were democratic. To ensure that the 
results in Table 4 were not an artifact of these two patterns, we 
performed an analysıs on the 236 crises in our data set that were 
exclusively between mmor powers and the 35 crises that were 
exclusively between major powers. The results were identical in all 
cases. The coefficient for the defender’s democracy score within the 
minor power set was —0.007 (p < .01). In the major power set the 
coefficient was —0.009, but the coefficient was not significant because 
there were only 35 cases. 
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seem more consistent with Bueno de Mesquita than 
with Lake. Lake is right that the democracies we 
observe as defenders tend to be powerful states. But 
these powerful democracies appear in our data because 
states are willing to challenge them despite their capa- 
bilities, due to the belief they are reluctant to use force. 
As challengers, however, democracies select the crises 
they can win, even when they do not have an advantage 
in military capabilities. 

Model III in Table 1 contains the complete specifi- 
cation of our analysis of crisis outcomes. It includes the 
realist control variables, the political structure factors, 
and the interaction effects between the two. The results 
for the variables specified in Model I and Model I 
change very little in the more complete analysis. Rel- 
ative military capabilities and relative interests at stake 
continue to have a substantial and statistically signifi- 
cant influence on crisis outcome. In fact, the effect of 
both becomes substantively larger in the complete 
model as we control for political structure. The sub- 
stentive influence of the defender’s nuclear capacity 
remains virtually unchanged in Model IM (b = —0.29) 
but does not achieve statistical significance (p < 
.07).27 The effect of the challenger’s and defender’s 
democracy also remains relatively unchanged. As 
noted above, the coefficient for the defender’s democ- 
racy shifts somewhat in the direction expected by 
Bueno de Mesquita in our more complete model, but it 
remains quite small relative to challenger’s democracy 
and does not achieve statistical significance. 

Model III also includes several interactive variables 
that require careful interpretation. The coefficients for 
interaction terms represent the effect that change in 
one variable has on the coefficient for the other 
variable. For example, the coefficient for audience 
costs X relative resolve is 0.01. This coefficient indi- 
cates that each one unit increase in the relative audi- 
ence costs score increases the coefficient for relative 
resolve by 0.01. As we discuss below, this result is 
consistent with hypothesis 3. Hypothesis 4 predicts that 
the interaction of relative military capabilities and joint 
democracy will be insignificant, while hypothesis 5 
predicts that this coefficient will be negative. Finally, 
hypothesis 6 predicts a negative coefficient on the 
interaction between relative military capabilities and 
shared alliance ties. 

The significant influence of the interactive terms in 
Model IN indicates that the effects of realist vari- 
ables—such as capabilities and resolve—are contingent 
on variation in political structure. 

The positive and statistically significant (b = 0.01, 
p < .05) coefficient on the interaction between relative 
audience costs and relative resolve provides strong 
support for Fearon’s (1994a, 1997) argument about 
domestic audience costs and the credibility of crisis 
escalation.“ This coefficient indicates that demonstra- 





77 This result could be due to slightly higher colmearity levels for this 
variable in Model II. The democracy variables remain statistically 
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Effect of Relative Resolve on the Probability of Challenger Victory, by Audience Cost 


(Indicated by Challenger and Defender Democracy) 


a 
O 
8 
> 
5 
[@)) 
(a 
a 
T 
fl ey 
O 
Oo 
> 
a 
NQ 
© 
A 


Relative Resolve 


Both Autocratic 
(No Aud. Advantage) 


Both Democratic 
(No Aud. Advantage) 


Ch. Democ, Df. Autoc 
(Ch. Aud. Advantage) 


Ch. Autoc, Df. Democ 
(Df. Aud. Advantage) 


Escalation of Challenger Bargaining Strategy Relative to Defender 


tions of resolve have a greater influence on crisis 
outcomes when one of the disputants is able to make its 
escalatory threats more credible because of its demo- 
cratic political structures. Although democracies may 
be reluctant to threaten force, their threats may be 
highly credible when they do so. As a result, democra- 
cies that demonstrate resolve are more likely to prevail 
against nondemocracies. Conversely, democracies that 
do not demonstrate resolve are put under extreme 
pressure by authoritarian states and forced to concede 
defeat. 

The dramatic interactive effects of audience costs 
and relative resolve are displayed in Figure 1. Recall 
that we measure relative audience costs as a function of 
challenger and defender democracy. Thus, we depict 
the effect of relative resolve on the probability of 
challenger victory in four different types of crises: (1) 
between two autocracies; (2) between two democra- 
cies, (3) between a democratic challenger and an 
autocratic defender, and (4) between an autocratic 
challenger and a democratic defender.” 


that do not escalate to the large-scale use of force, but war is 
frequently viewed as a contest in endurance and willingness to suffer 
(Schelling 1966; Stam 1996). Escalation could be viewed as bargain- 

ing moves that generate audience costs in an attempt to persuade the 
a ib seide, 16 we ba that # w-Gpgroptint: co test this 
argument on all international crises. Nonetheless, to ensure that the 
inclusion of cases that escalate to war does not influence our results, 
we reestimated the equation in Table 1 on crises that did not escalate 
to war. The results remained virtually identical (b = 0.01, p < .05). 


cases in our data set. Of our 422 crisis dyads, 288 involve only states 
with democracy scores below 6 or above 16. Overall, the two most 





When the challenger and defender have similar 
regime types—whether autocratic or democratic—nei- 
ther disputant has an advantage in demonstrating 
domestic audience costs (relative audience costs = 0). 
In this context neither state is able to make its threats 
of escalation effective, and relative resolve has little 
effect on the outcome of the crisis. Each one category 
increase in the challenger’s relative resolve actually 
reduces the probability it will prevail by between 0.2% 
and 0.3%. 

When the disputants have differing regime types, 
however, relative resolve becomes a significant deter- 
minant of crisis outcomes. For example, when the 
challenger is democratic and the defender is authori- 
tarian, the challenger’s domestic audience costs (rela- 
tive audience costs = 289) make its threats highly 
credible to the defender. If the democratic challenger 
escalates three categories higher than the authoritarian 
defender (relative resolve = 3), then the probability 
that the democracy will prevail is 70%. If, however, the 
same democratic challenger allows the defender to 
escalate 3 categories higher (relative resolve = —3), 
then the probability that the democratic challenger will 
prevail is only 8%. Relative resolve has a similar effect 
on crises between an authoritarian challenger and a 
democratic defender (relative audience costs = 289). 
In this case a relative resolve score of 3 becomes a 
credible signal of weakness on the part of the demo- 
cratic defender, leading to nearly a 65% probability of 


common democracy scores in our data set are 4 and 21. In generating 


Figure 1 we set the challenger and defender democracy scores to 
each of the four possible combinations of those scores. 
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FIGURE 2 Effect of Relative Military Capabilities on the Probability of Challenger Victory, by 
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victory for the authoritarian challenger. Conversely, a 
relative resolve score of —3 indicates resolve by the 
democratic defender and leads to approximately a 1% 
probability of victory for the authoritarian challenger. 

These findings provide striking and dramatic support 
for arguments by Fearon and others about the interac- 
tion of political structures and bargaining behavior 
(Fearon 1994b, 1997; Putnam 1988; Schultz 1999), but 
they also represent something of a double-edged 
sword. On the one hand, democratic states—whether 
challengers or defenders—can send credible signals of 
resolve that greatly enhance their chances of prevailing 
in an international crisis when they.engage in provoc- 
ative escalatory behavior. On the other hand, when 
democratic states choose not to make escalatory 
moves, that may be taken as a credible signal of 
weakness. The transparency of domestic constraints 
both enables democracies to communicate their will- 
ingness to defend their interests and prevents them 
from bluffing resolve when they are not truly willing to 
pay the price of using force. 

Consistent with hypothesis 5 and contrary to hypoth- 
esis 4, the coefficient on the interaction between joint 
democracy and relative military capabilities is negative 
and statistically significant (b = —0.002, p < .05), 
which indicates that military power has less effect on 
the outcomes of crises between democracies as com- 
pared to other dyads. The substantive influence of the 
interaction between relative military capabilities and 
joint democracy is displayed in Figure 2. 

As the figure indicates, for crises between autocratic 
or mixed dyads, an increase in the challenger’s relative 
military capabilities has a modest positive effect on the 
probability that the challenger will prevail in the crisis. 
In either of these contexts, each increase of 0.2 in the 
relative military capabilities score increases the proba- 
bility the challenger will prevail by about 3-4%. Spe- 
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cifically, in crises between two autocratic states (joint 
democracy = 1), the improvement in the challenger’s 
capabilities from 10% of the capabilities in the dyad 
(relative military capabilities = .1) to 90% of the 
capabilities in the dyad (relative military capabilities = 
.9) increases by about 13% the probability that it will 
prevail in the crisis. Similarly, in a crisis involving a 
mixture of regime types (joint democracy = 121), the 
overall change in relative military capabilities from 0.1 
to 0.9 increases by 12.5% the probability of a chal- 
lenger victory. 

In crises between democratic states (joint democ- 
racy = 441), however, the effect of relative capabilities 
evaporates. In this context, increases in the challenger’s 
relative military capabilities slightly reduce the proba- 
bility that it will prevail in the dispute. Figure 2 
indicates that an increase in the challenger’s capabili- 
ties from 10% to 90% of the capabilities in the dyad 
actually reduces by 3.1% the probability that the 
challenger will prevail in the crisis.>° 

Farber and Gowa’s (1995) expectation that the dem- 
ocratic peace can be explained as a spurious result of 
common international security interests among democ- 
racies during the Cold War era is not supported by our 
analysis. As predicted by hypothesis 5, relative military 
capabilities has a substantially reduced effect on the 
outcomes of crises between democratic states. But 


æ We vary joint democracy from 1 to 441 to illustrate the full range 
of effects predicted by our model. It is important to note, however, 
that the median value for joint democracy is only 42, and 80% of the 
cases have joint scores of 126 or lower. Thus, relative 
military capabilities do have a significant effect on the outcomes of 
is not com- 
joint democracy reaches a score of 400—the 
99th percentile. Nonetheless, as joint democracy mereases, we 
observe a substantial decrease in the effect of relative military 
capabilities. 
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TABLE 4. Summary of Results 


Theoretical Approach 
Structural Theories of the Democratic Peace 
Democratic constralnts (Bueno de Mesquita 
at al.) 
Democratic challengers win 
Powerful pacifists (Lake) 
Democratic challenger and defenders win, 
Democratic audience costs (Fearon) 
Democracy and credibility of resolve 
Critiquing the Democratic Peace 
Common polites (democratic peace) 
Democracy reduces effect of capabilities 
Common Interests (Farber and Gowa) 
Alllance reduces effect of capabilities and 
does not 
Reallst Bargaining Theory 
Relative capabilities 
Relative Interests 
Relative resolve 
Nuclear weapons 









contrary to hypothesis 6, the coefficient for the inter- 
action of alliance ties and relative capabilities does not 
approach statistical significance.>! This insignificant 
coefficient indicates that the effect of relative military 
capabilities on crises between allies is not different 
from its effect on crises between nonallies. That is, 
while allies appear to bargain with one another in the 
shadow of violence, democracies do not. This combi- 
nation of results seems especially inconsistent with 
Farber and Gowa’s analysis. 


CONCLUSION 


By extending theories of the democratic peace to 
encompass a new dependent variable—the outcomes 
of international crises—we can take a fresh look at the 
debate. Table 4 summarizes the central conclusions of 
our research. 

First, our analysis undermines the realist claim (Far- 
ber and Gowa 1995) that the democratic peace can be 
attributed to common strategic interests. Although 
democracies may share an interest in not using force 
against one another, they do not have a common 
interest in determining which state will prevail in an 
international crisis. The realist claim of a spurious 
correlation implies that the effect of relative military 
capabilities on crisis outcomes should not depend on 
the disputants’ domestic political structures. Instead, 
realists would expect that the effect of relative military 
capabilities will depend on the shared security interests 
of the disputants. Neither expectation is supported by 
our results. 

Second, our study does not support Lake’s (1992) 
contention that democracies do not fight one another 


31 Further evidence against Farber and Gowa is demonstrated by the 
fact that this interaction becomes even more substantial when the 
analysis is on the 111 pre—Cold War crises that occurred 
between 1918 and 1945. For this time period, the coefikaent is 
—0.004 (p < .10). 
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Hypotheses Result 

Hypothesis 1 Supported—Tables 1, 2, 3 
Hypothees 2 Not supported—Tables 1, 3 
Hypothesis 3 Supported—Table 1, Figure 1 
Hypothesis 5 Supported—Table 1, Figure 2 

6 Not supported—Table 1 

Hypothesis 4 Not supported—Table 1, Figure 2 
Control Supported—Table 1, Figure 1 
Variables Supported—Table 1, 2 


Not supported—Table 1 
Partially supported—Tabte 1 


because they are powerful states that are satisfied with 
the status quo. This “powerful pacifist” argument sug- 
gests that democratic states, whether challengers or 
defenders, should be more likely to prevail when they 
do become involved in international crises. We find 
that not to be the case. 

Third, we do find support for the argument (Bueno 
de Mesquita and Lalman 1992; Bueno de Mesquita and 
Siverson 1995; see also Reiter and Stam 1998) that 
democratic states face domestic costs in using force 
and select crises in which they are more likely to 
prevail. Moreover, we find that the perceived weakness 
of democracies leads states to challenge them despite 
their military power. This explanation is consistent with 
other empirical analyses of the democratic peace 
(Grieco 2001; Rousseau et al. 1996). 

Fourth, our results provide the strongest support for 
Fearon’s (1994a, 1997) argument concerning the inter- 
action of audience costs and crisis escalation. We found 
that an asymmetry in domestic audience costs has a 
dramatic influence on the credibility of escalatory 
signals sent during a crisis. In particular, democratic 
states, whether challengers or defenders, can more 
credibly signal their intentions than nondemocratic 
states. As a result, democratic states can dramatically 
increase their chances of prevailing in a crisis by 
engaging in escalatory behavior. Yet, their failure to 
escalate is viewed as a credible sign of weakness that 
drives nondemocratic states to coerce them into capit- 
ulation. 

An interesting synthesis of the Bueno de Mesquita 
and Fearon explanations emerges from our study. The 
democratic peace is caused by the constraints that 
democratic political structures place on state leaders. 
These structures impose costs for using force that have 
at least four implications for democratic foreign policy. 
First, as discussed by Bueno de Mesquita and Lalman 
(1992), the costs prevent democracies from fighting 
one another. Second, due to their perceived reluctance 
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to use force, democracies may attract a greater number 
of challenges than one would expect in view of their 
military capabilities (Grieco 2001; Rousseau et al. 
1996). Third, democratic states choose conflicts in 
which they are especially likely to prevail Fourth, and 
somewhat paradoxically, the openness of democratic 
societies makes their bargaining tactics credible to 
opponents. This credibility helps them prevail when 
they are willing to use force but prevents them from 
successfully bluffing resolve. This synthesis is attractive 
inasmuch as it predicts a variety of behaviors based 
upon a single simple assumption that lies at the core of 
numerous theories of the democratic peace. 

In sum, our analysis indicates that democratic polit- 
ical structures do affect international state behavior. 
Specifically, those structures impose costs on leaders 
who choose to imitiate force, which in turn make their 
actions credible to their opponents. 


APPENDIX: PROCEDURES FOR REVISING 
THE INTERNATIONAL CRISIS BEHAVIOR 
PROJECT (ICB) DATA SET 


We made three types of revisions to the set of cases identified 
by the ICB. 


(1) We merged a number of the ICB cases in which military 
conflict was ongoing between crises. We did so because our 
central dependent variable is the outcome of the crises, and 
in ongoing conflict no final resolution has been reached. 

These merged cases fall into two categories. The first 
encompasses what the ICB calls “intra-war” crises, such as 
the battle of Stalingrad. We treat all German-Soviet engage- 
ments during World War I as a single encounter, beginning 
with Operation Barbarossa in June 1941 and ending in May 
1945. The second category involves what the ICB calls 
“protracted conflicts,” many of which are essentially long- 
term guerilla campaigns or internationalized civil wars. We 
merge all ICB crises between the same two participants if the 
prior crisis ends without a negotiated solution and less than 
one year elapses before combat resumes. For example, the 
nearly continuous guerilla fighting between Mozambique and 
Rhodesia from 1976 to 1980 is represented in our data set as 
a single conflict in which Rhodesia ig the loser, rather than 
several stalemated engagements followed by a Rhodesian 
loss. 


(2) We disaggregated multilateral ICB crises into a series of 
dyads. A number of the central variables in our analysis are 
most appropriately measured on the state or dyadic level It 
is unclear, for example, how one should evaluate the relative 
interests at stake in the Korean War. For the United States, 
China, and the Soviet Union, the conflict was about geopo- 
litical influence; for North Korea it was about territory, for 
South Korea it was about survival. We create seven separate 
crisis dyads: North Korea versus South Korea, North Korea 
versus the United States, and so on. Of course, this procedure 
means that our observation of the dependent variable (crisis 
outcome) is not independent across all the dyads that emerge 
from a single crisis. We account for this problem by estimat- 
ing Huber-White robust standard errors that allow for non- 
independence of dyads within each crisis. For example, our 
analyses allow for a correlation among the outcomes of the 
seven crisis dyads of the Korean War. 


(3) A few ICB international crises events do not appear in 
our data set because no militarized threat or activity occurred 
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between any dyad of state actors; all militarized threats were 
either targeted at or initiated by nonstate actors. 


(4) For each crisis dyad we identify one state as the chal- 
lenger and the other as the defender. The first member of the 
dyad that attempts to overturn the status quo, either by 
making a militarized threat or by taking military action, is the 
challenger. The state against which such threats or activities 
are targeted is the defender. In addition, any state that 
attempts to prevent a challenger from militarily altering the 
status quo with regard to a third party is also considered a 
defender. In the Korean War, for example, North Korea’s 
invasion of the South makes it the challenger in that dyad. 
The American response to that invasion necessitates a sepa- 
rate dyad, in which North Korea is the challenger against the 
United States. China became involved in response to the 
American decision to cross the 38th parallel, 30 we code the 
United States as the challenger against China in a third crisis 
dyad, and s0 on 


Applying these coding rules to the ICB data identified 422 
dyads involved in 283 international crises between 1918 and 
1994. In a few instances we could not determine a state’s 
domestic regime type because it was in flux during the crisis, 
or we were unable to gather data on a state’s military 
capabilities or alliance status. Consequently, our final analy- 
ses include between 409 and 416 international crisis dyads. 
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[eerste ts fens done a social ii 


redistribute income, but very few 


governments 
can explain why this often is done in a socially inefficient form. In the theory we develop, compared 
inefficient 


to efficient methods, 


redistribution makes it more attractive to stay in or enter a group that 


receives subsidies. When political institutions cannot credibly commit to future policy, and when the political 
influence of a group depends on its size, inefficient redistribution is a tool to sustain political power. Our 


model may account for the choice of inefficient redistributive policies in 


agriculture, 
E Te IA AEN SCHOEN Br EE ESE Dee ie He Beir inefficient 


may be more prevalent. 


here are many normative and positive theories 
that explain why governments redistribute in- 
come. For example, most positive theories of 
politics typically involve a group that redistributes 
resources and income away from other groups to itself. 
Redistribution also may be undertaken for normative 
reasons; for example, the distribution of income and 
welfare generated by market outcomes may be judged 
unfair or undesirable by some ethical criterion. We 
lack a satisfactory understanding, however, of why 
redistribution often takes an inefficient form.1 
A common example of income redistribution that 
takes an inefficient form is farmers’ receiving price 
supports or input subsidies. Such policies distort rela- 
tive prices and discourage the reallocation of produc- 
tive resources away from agriculture and into other 
sectors, such as manufacturing, where they could be 
better used. Similarly, despite economists’ conviction 
that free trade is typically efficient, domestic industries 
are often protected by tariffs and quotas. A particularly 
interesting and relatively neglected example in which 
the form of redistribution appears to be inefficient is 
labor market regulation. Although firing costs and such 
restrictive labor practices as closed shops are wide- 
spread in most countries, they are thought to be highly 
inefficient because they disrupt labor reallocation and 
cause unemployment. 
In all these cases, it is difficult to argue that the 
particular form of the policy is correcting a market 
failure. Rather, it seems aimed simply at redistributing 
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and notes that the prevalence of inefficient redistnbution is a major 


saying Sir Edmund Hillary had to climb Mt. Everest because he 
wanted to get some fresh ar. There was surely an easier way of 
accomplishing that objective!” 


income. For instance, no scholars appear to argue that 
price supports for farmers, which have the effect of 
increasing farm output, promote efficiency because 
without them there would be too few resources in 
agriculture. This might be the case if farm output 
generated positive externalities, but that seems implau- 
sible. Instead, it is widely agreed that price supports are 
simply a way to raise farmers’ incomes. If this is correct, 
then they are Pareto inefficient in the sense that farm 
incomes could be maintained, and everyone else made 
better off, by a form of redistribution that did not 
involve resource misallocation. A simple transfer to 
raise the income of the farmers by as much as the 
inefficient policy yields would constitute an actual 
Pareto improvement. 

We present a theory of inefficient redistribution that 
builds on two basic assumptions. First, the political 
system cannot commit today to future policies, since 
they will be determined by whomever has political 
power in the future. Second, at least over some range, 
political power increases with group size. Under these 
conditions, inefficient redistribution may arise as a way 
to-erpand or mainta thie size at a gronp: morder to 
guarantee its future political power. 

Consider the example of price support for agricul- 
ture. Imagine that farmers have sufficient political 
influence to induce the government to redistribute 
income to them, and this can take the form of a simple 
money transfer to current farmers or a price subsidy. 
The latter is relatively inefficient as it potentially avoids 
the reallocation of resources to sectors in which they 
can be used more productively.? Our key observation is 
that the political equilibrium may nonetheless entail 
price subsidies because that form of redistribution 
affects the decision to remain in farming and encour- 
ages new agents to enter, in a way that lump-sum 
transfers would not. Everything else equal, farmers 
would not want to encourage newcomers, who increase 
competition both for transfers and in the marketplace, 
but if future political power and ability to extract 


2 Notice, however, that in a dynamic world the expectation of future 
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further redistribution depends on the number of farm- 
ers, price subsidies may be preferred. 

In some sense, our analysis extends Becker's (1985, 
338) insight that “a satisfactory analysis of the choice of 
method must consider whether the influence function 
itself depends on the methods used.” In our example, 
to ensure future transfers it is necessary for farmers to 
retain their political power, and they achieve this by 
choosing a relatively inefficient method of redistribu- 
tion; it discourages farmers from changing sectors and 
encourages new agents to enter agriculture. The same 
argument may apply to other instances of inefficient 
redistribution, such as trade policy and labor market 
regulation, and suggests that inefficient methods in 
those groups also may be chosen to preserve the 
constituencies who favor the redistributive policies. 

It is useful to distinguish between two categories of 
inefficient redistributive policies. The first type, “inef- 
ficient targeting,” is our main focus.. An example is 
subsidies to farmers who have been in farming for a 
specified period rather than subsidies to current farm- 
ers. This policy is inefficient because it encourages 
people to enter a sector where their productivity is 
likely to be low. Inefficient targeting affects the exten- 
sive margin. The second type, of “inefficient condition- 
ing,” distorts the intensive margin—the marginal pro- 
duction decision of agents. An example is price 
subsidies rather than a constant transfer to every 
farmer who remains in farming; price subsidies encour- 
age production beyond an efficient amount. 

We are primarily concerned with inefficient target- 
ing, but our theory also provides a rationale for ineffi- 
cient conditioning. Although farmers do not want their 
numbers to shrink, they also do not want many more 
people to enter the sector and reduce per-capita trans- 
fers. If the redistributive policy gives a constant trans- 
fer to every farmer, many additional people may claim 
to be farmers or may enter farming. Policies that 
condition redistribution on production, such as acreage 
controls, may be a way to prevent excess entry. 

In addition to explaining the choice of inefficient 
methods of redistribution, our analysis leads to a 
number of interesting comparative static results. First, 
inefficient redistribution is more likely to arise when 
the political power of influential groups is contested, 
for example, when an industry and its voting power are 
declining. This result is consistent with evidence that 
declining industries receive the most distortionary 
transfers (Baldwin 1985; Rodrik 1996). Second, and 
most important, when factors of production are less 
specific to a sector, there may be more inefficient 
redistribution. This result is intuitive: The rationale for 
inefficient redistribution is to prevent marginal agents 
from leaving the sector (as well as encourage new 
entrants). When there is less specificity, redistribution 
needs to be more inefficient ‘to convince marginal 
agents to stay. This contrasts with theories that suggest 
specificity of factors should increase lobbying and 
rent-seeking behavior (e.g., Alt et al. 1996; Brainard 
and Verdier 1994; Coate and Morris 1999). Although 
these theories do not explain why redistribution is 
made inefficiently, they suggest that redistribution, and 
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hence inefficient redistribution, should be more preva- 
lent when factors are more specific. Yet, in many of the 
common examples of inefficient policies, there appears 
to be, if anything, less specificity than in other sectors. 
A case in point is consumer goods industries, such as 
textiles, which often receive more trade protection than 
other industries (e.g, Ray 1991; Rodrik 1996). This 
pattern is consistent with our comparative static result. 

The two basic assumptions of our analysis are plau- 
sible and receive empirical support. First, the fact that 
the political system today cannot commit to future 
redistribution policy seems to be an intrinsic feature of 
democracy, although it can be ameliorated ta some 
extent. Constitutions place restrictions on some types 
of policies, but they seldom constrain taxes and subsi- 
dies. A large literature in political science has noted, 
however, that aspects of the U.S. Congress and of 
democratic institutions more generally may foster com- 
mitment to certain types of policies. This is implicit in 
ideas stemming from work on “structure-induced equi- 
librium” initiated by Shepsle (1979). For example, 
Marshall and Weingast (1988) discuss ways in which 
many congressional institutions help mitigate future 
commitment problems. (See Weingast 1998 for a fas- 
cinating study of how political institutions aided com- 
mitment in the nineteenth century.) Other important 
instances are independent central banks (Cukierman 
1992), long-lived political parties (Alesina and Spear 
1988; Jones, True, and Baumgartner 1997), executive 
and legislative interaction (Martin 2000), and regula- 
tory commissions (Lowi 1969). This literature suggests 
that certain forms of commitment problems can be 
overcome, but it also attests to the importance of those 
problems in democratic politics. 

Second, the notion that group size, at least over 
some range, increases political power is consistent with 
the empirical evidence. Olson (1965) emphasizes that 
free-rider problems affect the political organization of 
large groups, and it is possible to build models in which 
small groups are more powerful (Lohmann 1998). 
Nevertheless, Olson’s analysis does not imply that 
small groups have more power; rather, they find it 
easier to solve the collective action problem. But if 
large groups can solve the collective action problém, by 
creating private goods or other specific incentives to 
induce potential members to join,? then they may well 
be more powerful. 

For our theory to apply, size must be an asset in the 
sense that it can be used to increase the per-capita 
welfare of a group, at least over some range, for groups 
that solve the collective action problem. Evidence from 


| 

3 Moe (1980) discusses various ways m which large groups can 
circumvent free-ridmg by providing different sorts of incentives. An 
interesting idea developed by Amold (1990) and Wittman (4995) is 
that political entrepreneurs have an incentive to solve the collective 
action problem of large groups, and there are many examples of this. 
To mention just one, Bates (1997) shows that the national association 
of coffees growers in Colombia was created m the 1920s as a t of 
political entrepreneurship; ıt overcame the considerable aie 
action problems faced by coffee growers, who were mostly smallhold- 
ers, and gave them significant political power. 


American Political Science Review 


Vol. 95, No. 3 


AMECHCAn COT Sen ee o o M aa 


democratic societies supports this assumption.* Al- 
though some studies find that smaller groups receive 
larger transfers, the majority of empirical work finds 
size to be an asset (e.g., Becker 1986; Kristov, Lindert, 
and McClelland 1995; Sloof 1998). The history of some 
notable interest groups supports this conclusion. Both 
the National Rifle Association (NRA) and the Chris- 
tian Coalition became powerful national forces after 
increasing their membership significantly. In the NRA 
case, Davidson (1993, 49) shows that this occurred 
under Harion Carter, when membership rose rapidly 
from one million in 1977 to 2.6 million in 1983, 
precisely when it obtained national influence. “The 
advertising campaign was just one part of an all out 
NRA effort to boost membership.” Davidson (p. 49) 
quotes John Aquilino, NRA director of public infor- 
mation for ten years: “Harlon saw that power is in 
numbers.” 

The consensus among political scientists seems to 
support the view that size is an asset in political 
conflict. For example, Cameron (1988, 572) writes: 
“Size represents an important resource in the struggle 
and conflict amongst groups... . Individuals may have 
more incentive to form groups if the potential mem- 
bership is large and thus allows them to anticipate 
greater power and hence greater collective rewards.” 
In the farming context scholars continually stress this 
point. Hansen (1991, 7) argues that “the farm lobby [in 
the United States] as a whole... suffered a marked 
setback in the sixties, seventies and eighties. As people 
migrated away from farms, the agricultural organiza- 
tions represented fewer and fewer constituents..., 
and the responsiveness of the Agriculture Committee 
and the Congress declined.” Kindleberger (1951) and 
Tracy (1989) suggest that the greater numbers of voters 
in farming groups in France and Germany as compared 
to Britain explains why farmers in those two countries 
obtained tariff protection in the 1880s but British 
farmers did not. The success of the large Scandinavian 
unions and the relative failure of the smaller U.S. and 
British unions is also consistent with the hypothesis 
that size matters for political power.5 

Three arguments that may account for inefficient 
redistribution have been suggested in the literature. 
The first, which to our knowledge has not been formal- 
ized, is that inefficient methods may be harder to 
reverse; thus, when the political system cannot directly 
commit to future decisions, inefficient methods serve in 
effect as commitment. In the United States, for exam- 
ple, congressional appropriations expire after two 
years, but tariffs and regulations persist until the statute 
that created them is repealed.© Our theory develops 





4In contrast, m undemocratc societies, size may be a liability 
because large groups provide potential tax revenues for the rulers. 
The large number of farmers m Soviet Russia and some African 
countries may explain why they have been heavily taxed (see Bates 
1981). 

5 This evidence is consistent with our hypothesis but, of course, is not 
definitive since the direction of causality is uncertam. We thank a 


referee for pointing out this important caveat 
6 We are grateful to a referee for providing this example. 


this line of reasoning by endogenously linking the 
persistence of a:policy to its form. 

The second argument, implicit in the work of 
Buchanan and Tullock (1962) and formalized by Ro- 
drik (1986), Wilson (1990), and Becker and Mulligan 
(1998), is that if the amount of redistribution is endog- 
enous, then politicians may want to use inefficient 
methods in order to reduce total redistribution (see 
also Dixit, Grossman, and Helpman 1997; Grossman 
and Helpman 1994; and Staiger and Tabellini 1987 for 
models with related results). This theory rests, how- 
ever, on the arbitrary assumption that politicians can 
commit to the form of redistribution but not to the 
level. The frequent changes in the composition of taxes 
in the United States go against this assumption. 

Coate and Morris (1995), partially building on an 
argument by Tullock (1983), offer a third explanation: 
Politicians who care about a certain group exploit 
voters’ uncertainty regarding which policies are effi- 
cient. Whereas.a lump-sum redistribution to farmers 
would reveal that a politician cares about that group at 
the expense of others, a price subsidy can be disguised 
as a Pigouvian subsidy aimed at correcting some mar- 
ket failure.” There are two potential problems with this 
interesting explanation. First, only those inefficient 
policies that might in some circumstances be consid- 
ered efficient can be used; otherwise, voters would see 
through them (see Austen-Smith 1991). Second, it 
must be the case that neither the party in power nor a 
rival can tax farmers after giving the price subsidy, 
which would reveal that they are redistributing truly for 
efficiency reasons, not because they care about farmers. 

In other related work, Dixit and Londregan (1995) 
construct a model in which the inability of politicians to 
commit to future transfers prevents efficient realloca- 
tion of agents: Farmers who currently receive trans- 
fers—perhaps because they are the swing voters— 
realize that if they switch to manufacturing they will 
lose these transfers. The model explains why redistri- 
bution may lead to inefficiencies but not why the form 
of redistribution is inefficient. In the same spirit, Alt 
and colleagues (1996) argue that when policy is endog- 
enous, agents can take actions (e.g., specific invest- 
ments) that induce future redistribution, thus prevent- 
ing exit from a declining industry. Saint-Paul (1992) 
notes that workers who currently have secure jobs 
(so-called insiders) may oppose two-tier wage systems 
that would remove firing costs for newcomers, antici- 
pating that this will reduce future political support for 
firing costs, but he does not pursue this idea to develop 
an explanation for inefficient redistribution. A number 
of researchers analyze the trade-off between lobbying 
and campaign contributions as methods of influencing 
policy (e.g., Austen-Smith 1995; Lohmann 1995) but 
focus neither on the economic efficiency of the meth- 
ods nor the influence of policy on future political 
power. 

Our article is organized as follows. We first outline a 





7 Such a subsidy, as first argued by A.C. Pigou, would be the standard 


government policy prescription in the case of a positive externality 
(see Green, Mas-Colel, and Whinston 1995, 355). 
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simple two-period model in which inefficient redistri- 
bution is a political equilibrium. The next section 
reveals that the extent of inefficient redistribution may 
increase when a sector requires less specific skills and 
investments. We then discuss a range of real-world 
redistributive policies and argue that inefficient meth- 
ods arise, at least in part, because of the reasons 
emphasized in our model. 


THE BASIC MODEL 


We chose a simple reduced-form model of interest 
group behavior (along the lines of Becker 1983) to 
highlight the major trade-offs because the same type of 
interactions arise in a variety of settings. What is 
important is that the two key assumptions—size of 
political group matters over a certain range and no 
commitment to future political decisions—apply. 

Consider a two-period economy (periods 0 and 1) 
with a single consumption good produced by one of 
two sectors, farming and manufacturing. They can be 
thought of as producing different goods that are perfect 
substitutes.® In the first period there are 1 — 8 agents, 
with a fraction ng in farming and 1 — nọ in manufac- 
turing. These agents cannot change sector. All are risk 
neutral and discount the second period by a factor B € 
(0,1). In each period, a farmer produces output B, and 
a manufacturer produces output A, with A > B. We 
assume that farmers cannot be taxed (i.e., they can hide 
their output costlessly), but manufacturers can be taxed 
a maximum of T (i.e., they can hide their output at a 
cost of T), where T < A.’ At the beginning of period 
0, 8 new agents arrive and choose which sector to enter. 
This decision is irreversible. There are no new agents in 
period 1. 

Let To and +, denote the tax on manufacturers in 
periods 0 and 1, respectively, where 1, € [0,7], for t = 
0,1. The tax revenue, if any, can be redistributed to 
farmers in two distinct forms. The first is a transfer to 
agents who are farming at the beginning of the period, 
denoted by 0, = 0, fort = 0,1. The second is a general 
price subsidy for all farmers, denoted by p, = 0.10 The 
difference between pp and 0 is that only those farming 
att = 0 receive 0, whereas py is also paid out to agents 
who enter farming at time £ = 0. 0) therefore approx- 
imates an efficient transfer, as it is conditioned on 
characteristics outside the agent’s control. In contrast, 
because po subsidizes farm output and encourages new 
agents to enter farming, it is an inefficient method of 


t The case of imperfect substitutes does not alter our results but 
complicates the expressions. 

° The assumption that farmers cannot be taxed is made for simplicity 
only and is not required for any of our results. If manufacturers could 
tax farmers, that would simply increase the desire of farmers to 
mamtain their power and, thus, their incentrves to use inefficient 
methods of redistribution. All we require is that farmers receive 
per-capita net benefits from the political system that, at least over 
some range, increase with the number of farmers. 

10 In the model, output per farmer is exogenous, so a per-capita 
subsidy to all farmers, new and old, is the same as a price subsidy. 
More generally, in a model with variable production, there would 
also be a difference between price subsidies and per-capita subsidies 
to all farmers. 


652 


redistribution. (p, will be redundant, since only in 
period 0 is there a distinction between current farmers 
and potential entrants, so we ignore it in the rest of our 


) 

It is clear that, ignoring political economy consider- 
ations, existing farmers prefer 0 transfers to p trans- 
fers, because they do not have to share the former with 
new arrivals. Our key endeavor in this section is to 
demonstrate that political economy considerations may 
nonetheless encourage existing farmers to choose p 
transfers. 

To discuss these issues in the simplest possible way, 
we assume a reduced-form political process that deter- 
mines the current tax rate on manufacturers as a 
function of the number of farmers. More explicitly, the 
tax rates in the two periods are: 


To = (m9) E [0,T], 
tı = t(n,) E [0,T]. (1) 


The assumption t = 0 incorporates the fact that 
farmers cannot be taxed. Notice that the tax rate in 
period ¢ is a function of the fraction of the population 
in farming at the time. To simplify the discussion, we 
assume that the function 7 satisfies the following two 
conditions. 

1. Ifm sn then t(n) = 0. 

2. Ifn = n* then r(n) = T. 

The function that maps from the fraction of farmers 
into taxes on manufacturers, t(.), can be locally in- 

ing or decreasing, although we assume that it 
takes a higher value at n* than at n~. This is reason- 
able because when there are very few farmers, they will 
not have the political power to impose taxes on man- 
ufacturers. Finally, we assume that the division of the 
tax revenue between @ and p subsidies is decided only 
by farmers. 

The timing of political and economic events is as 
follows. In period 0, the political economy process 
determines To and the farmers decide 6) and po 
Young agents are born, observe the policy vector, and 
decide which sector to enter. Production takes place, 
and the policy is implemented. At the beginning of 
period 1, the political process determines +, and 8}. 
The model ends following production and implemen- 
tation of the chosen policy. If we define x as the 
fraction of agents who enter farming at time t = 0, then 
the government budget constraints in the two peri- 
ods can be written as: 


(1 — 8)(1 — nojo = (1 — 8)no(@o + pa) + Bp; (2) 
(1 — ny), = 7248). (3) 


In equation 2, (1 — 8)(1 — np)tp is total tax revenue, 
(89 + Ho) is the total per-capita transfer to the (1 — 
8)no existing farmers, and p is the inefficient transfer 
received by the & newcomers. In equation 3, (1 — 
71) 7; is total tax revenue, which is distributed among n, 
farmers. Note that young agents who go into manufac- 
turing are not taxed in period 0, and they also may not 
receive any transfers when they go into farming (i.e., if 
uo = 0). Although the political process can discrimi- 
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nate between young and old farmers in period 0, this is 
not possible in period 1.1 

Let Vf and V” be the expected utilities (at time 0) of 
old farmers and manufacturers, respectively. Let W 
and W” be the expected utilities (at time 0) of new 
agents who choose farming and manufacturing, respec- 
tively. Then, 


V4(8o, Ho, 01) = B+ Oo + po + BIB + 61], (4) 


V"(t 71) =A — To + BLA - 71], (5) 
W(t) = (1 + B)A — Br, (6) 
W419, 81) = (1 + B)B + po + BO. (7) 


Newcomers make their occupational choice after ob- 
serving po a variable relevant for their payoffs. Their 
strategy is therefore conditioned on pọ, and we write 
the fraction of new agents who go into farming when 
the subsidy is u as x(p). Then, the optimal sectoral 
choice of new agents in period 0 is: 


x(u) =0 if W™(7) > Wp, 01), 
x(p) =1 if W*(7,) < W*(p, 01), and 


xp) €[0, 1] fW™(71) = Wn, 01). (8) 


x(p.) defines the best response function (correspon- 
dence) of newcomers for all possible levels of subsidy. 
Observe in particular that this function determines 
their best response not only for the level of subsidy 
along the equilibrium path, uo, but also for all p, and it 
helps us determine optimal behavior off the equilib- 
rium path. 

The fraction of farmers in the population at time t = 
1 is then 


nı = (1 —8)no + &, (9) 


A pure strategy subgame perfect Nash equilibrium is a 
tuple, {x(u}, 71, To œ Ho Tr, 01}, such that equations 2, 
3, and 9 hold; To = t(mg) and q; = 1(7,); the function xp) 
is defined by equation 8; and {0% po} maximizes V”. 

The fact that +, = t(n,) at time 1 builds in the 
assumption that the political system cannot commit to 
future redistribution. This is a crucial ingredient in our 
explanation, because it provides a reason for farmers to 
increase their numbers in period 1 to achieve greater 
political power. 

To simplify the discussion, we make the following 
assumption. re 


ASSUMPTION 1. (1 + BXA — B) > 28T. 


This implies that the maximum tax rate is small relative 
to the productivity differential between the two sectors 
and ensures that it is not worthwhile to go into farming 
only to receive future transfers. 

Let us start with the case in which no Sn, so that 
To = 0, hence ðo = py = O. In this case, there are too 
few farmers at £ = 0 for them to have any power, so 


11 In penod 0, the system discriminates between incumbents and 
newcomers. In period 1, there are no newcomers, and all farmers are 
incumbents. 
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there is no redistribution. As a result, there exists a 
unique equilibrium in which all young agents go into 
manufacturing. Specifically, with t = 0, pọ = 0, 
assumption 1 ensures that W4(j1) = 0) = (1 + B)B + 
BT < (1 + B)4 — BT = W(t, = 0). Therefore, we 
have: 


Proposition 1. Suppose assumption I holds and ny < 
n`; there then exists a unique equilibrium with n; = 
(1 — Xio To = T1 = bo = 0; = po = O, and x(g = 
0). 


Next, consider the case in which nọ > n*/(1 — 8). 
Farmers are numerous enough so that even when x = 
0, they retain maximal power. Therefore, they choose 
To Ti Og 0, and py to maximize V”, which gives to = 7, 
= T, po = 0, and 6, = (1-n)T/n, for t = 0,1. To 
characterize an equilibrium completely, we only have 
to determine x and n. Notice that in this case 


Wf = (1 + B)}B + pf), and 
1 


W™ = (1 + B)A - BT. 


Assumption 1 implies that W™ > W”, and x(p = 0) = 
0 (although in this case, x(y) would be positive for p 
sufficiently large). 
PRoposiTion 2. Suppose assumption I holds and 
no > n*/(1 — 8); there then exists a unique equilibrium 
such that ty = 7, = T, po = 0, 8o = (1 — no)T/no x( uo 
= 0) = 0, nı = (1 — 8), and 8, = (1 - (1 - 
81g) T/(1 — Yio 


In both propositions 1 and 2, the equilibrium maxi- 
mizes output, and the form of redistribution is efficient. 
Although there is redistribution, no production or 
occupational decisions are distorted. The reason for 
this efficiency is that political power is not contested. 
When nọ < n`, manufacturers have total political 
power, and this can never be transferred to farmers. 
Similarly, when ny > n*/(1 — 8), farmers have maximal 
political power and always retain it, even if all newcom- 
ers go into manufacturing. This highlights the main 
conclusion of our analysis that inefficient redistribution 
will arise in order to control political power. 

Now consider the most important case for our 
analysis: n7 < ny < n*/(1 — 8). Farmers have some 
political power in period 0, and the extent of their 
political power at period 1 depends on the actions of 
newcomers. It is straightforward from the analysis in 
proposition 2 that if pọ = 0, newcomers will prefer to 
enter manufacturing. Therefore, farmers may want to 
use uy > 0, that is, inefficient redistribution, in order to 
attract newcomers into farming and increase their 
political power. 

Substituting from equations 2 and 3 into equation 4, 
the utility of old farmers can be written as 


Wf = (1 + B)B + Bo + po + Bom), (10) 
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where 


t(n1)(1 — ny) 
b(n) 3 ————— 
nı 
e T((1 — 8)no + &x)(1 — (1 — 8)no — Bx) 
(1 m 8)no + & 
is per-capita redistribution at t = 1. 
To attract newcomers, farmers need to provide them 


with at least as much utility in farming as in manufac- 
turing, hence 


(11) 


Wi=W, 


where W™ and W are given by equations 6 and 7. Let 
us now define 


UN(x) = (1 + B)B + Bb(m) = W’ — po 


as the utility of an agent who enters farming when a 
fraction x of newcomers enter farming and there is no 
inefficient redistribution (i.e., pọ = 0). Also, define 
U™(x) = W™ as the utility of an agent who enters 
manufacturing when a fraction x of newcomers enter 
farming. Now x > 0 requires that po = U™ (x) — U(x) 
so as to convince newcomers to enter farming. More- 
over, existing farmers would never want to pay more 
than necessary to newcomers, so we first start with the 
following case. 


Ho = UG) — UM) = (1 + BXA — B) — BHO) + th). 


Solving equation 2 for 0) + pẹ we can write the return 
to old farmers when they ensure that a fraction x of 
newcomers enter farming, V” (x), as 


Wx) = (1 + B)B + Bd(ny 


if (1 — 8)(1 — noto ~ Bf U"@) — Uf%)] 
(1 — 8)mo 


Let Vf be their utility when pọ = 0. Notice that 17 (x = 
0) = V because when py = 0 and no new agents are 
entering farming, x = 0, so the fact that pp = U"(x) — 
U(x) does not matter. Whether farmers prefer to use 
inefficient methods of redistribution and so attract 
newcomers depends on 


(12) 


dV"\x = 0) 
dx 
U"x =0) —-Uf@ =0 
= (soa — 8) — eo (13) 


The first term in parenthesis is the benefit of attracting 
some of the newcomers, and the second term is the cost 
of doing so per existing farmer. If expression 13, is 
positive, then the utility of old farmers can be increased 
by attracting some newcomers. In this case farmers will 
design the redistribution system to be inefficient spe- 
cifically to increase their numbers. 


2 Jf uo is greater than U"(z) — U(x), all newcomers will enter 
farmmg. We discuss this case below. 
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This expression also makes it clear that farmers will 
only want to use inefficient redistribution when an 
increase in their numbers leads to larger per-capita 
transfers, (71). This implies that taxes imposed on 
manufacturers should increase sufficiently in n, to 
ensure larger transfers to farmers. 

We can now state a key result. 


Proposition 3. If 


1 
P'U — dd > Bay, LU" = 0) — Ue = O 


(14) 
then there will be inefficient redistribution, that is, pọ > 
0. In equilibrium, 

Ho = UM(x*) — UA"), 
and a fraction x* of newcomers will enter farming such 
that 


P(A — 8) + 8x*)h’((1 — 8) + &x*) 

— mo + BSc*s’((1 — Bin + Oc") =0, (A5) 
orx* = 1 if equation 15 > 0 when evaluated at x* = 
1. 


The first part of this proposition is proved in the text. 
The second part follows by noting that (po x*) are 
chosen to maximize equation 12. Substituting for po = 
UG”) — We"), dU" Vide = ~Br' (ny), and aU Gy 
dx = Bo’(n,), and simplifying, we obtain equation 15. 

This proposition implies that, for a range of param- 
eter values, redistribution takes an inefficient form (in 
the sense of inefficient targeting). The underlying rea- 
son is that farmers are attempting to maintain political 
power and realize this can be achieved only by attract- 
ing new farmers in order to remain a large group. 
Inefficient redistribution achieves this because it re- 
wards potential entrants, not just those already en- 
gaged in farming. Expressed differently, because 6, in 
our model is a lump-sum transfer, it does not distort 
the decision of marginal agents. Precisely for this 
reason, however, the political process may choose to 
redistribute via pọ Dot Op. 

It is interesting that newcomers who enter farming 
are exerting a negative externality on manufacturers. 
To see this in a simple way, notice that as more 
newcomers enter farming, aggregate output falls, since 
these agents would have been more productive in 
manufacturing. Newcomers are indifferent between the 
two sectors. Moreover, farmers benefit from entry by 
construction, since they are encouraging newcomers, 80 
the whole cost falls on manufacturers, who pay suffi- 
ciently high taxes to subsidize farmers. 

It is important that commitment to future redistri- 
bution is impossible. An intuition based on the Coase 


13 Notice that the redistributive policy adopted actually leads to a 
Pareto-dominated outcome. If all new agents go into man i 
and farmers are still paid $(n*,) in period t = 1, all agents will be 
made better off. The problem is that the promise of a subsidy of 
o(n*,) is not credible if all newcomers enter manufacturing. 
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theorem suggests that this type of inefficient redistri- 
bution should not arise as there are gains to (political) 
trade (see, e.g., Whitman 1989). That would be true in 
this economy if all existing agents could jointly commit 
to q; and 6, at time ¢ = 0. Such an arrangement is not 
possible, however, because of the constraints imposed 
by political economy considerations. Since the political 
system cannot commit to future redistribution, the only 
way farmers can ensure future transfers is to maintain 
their political power. They achieve this by remaining a 
large group, and inefficient redistribution is the instru- 
ment they use for this purpose. Later we will discuss a 
number of examples in which the concern of various 
groups to maintain political power seems to be a factor 
in the choice of inefficient methods of redistribution, 
which suggests that the forces highlighted by our 
analysis may be important in a variety of circumstances. 

When condition 14 is satisfied, there does not exist 
an equilibrium without inefficient redistribution. To 
see this, notice that condition 14 ensures dfi = O)ide 
> 0, so farmers can always choose a level of price 
subsidy, po to attract some newcomers. Therefore, the 
situation with x = 0 cannot be an equilibrium. 

In contrast, when condition 14 does not hold, there 
can be multiple equilibria. Such multiplicity arises 
when there exists a level of x, say x’, such that when a 
fraction x’ of newcomers enter farming, farmers are 
better off, even though di = 0)/dx < 0, that is, 


Vix!) = (1 + B)B + Bd((1 — B)rro + Bx)’ 
, = BG = nodto — BLU’) - UE] 


(1 — 8)no 
> Pia = 0) = (1 + B)B + Bd((1 — 8) 
(1 — 8)(1 — noo 
a (16) 


The reason for the multiplicity is the nonmonotonicity 
of per-capita transfers to farmers. For example, when 
transfers are determined by voting, farmers will have 
enough power if they have a certain fraction, say n’. 
When newcomers enter but the number of farmers 
does not reach n’, per-capita transfers decrease (Le., T1 
remains constant, but per-capita transfers b(n), de- 
crease). In contrast, when enough newcomers enter to 
raise the number of farmers above n’, per-capita 
transfers increase. In this case a natural multiplicity of 
equilibria arises. When newcomers expect others to 
enter farming so that the group will be of sufficient size, 
returns to farming are high because of the resulting 
transfers, and newcomers are willing to enter farming. 
Because condition 14 is not satisfied, however, there 
also always exists an equilibrium in which all newcom- 
ers expect others not to enter farming, and they 
themselves do not do so for any level of the transfer po- 

Interestingly, there may also exist equilibria with 
different levels of inefficient redistribution. When we 
have 


Wa =8) > Pœ = 0), 
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farmers will be better off if all newcomers enter 
farming than if no newcomers do so. In this case, the 
following strategy for newcomers supports an equilib- 
rium: x() = 0 for all p < p’, and x(p’) = 8. That is, 
newcomers enter only when the price subsidy is high 
enough, say, at some level p’ = U(x = 8) — U(x = 8). 
Their actions are best responses because when others 
do not enter, each newcomer prefers not to enter. 

The fact that future political power depends on the 
coordinated actions of newcomers causes the multiplic- 
ity of equilibria. This multiplicity is of some interest, as 
it highlights that the amount of inefficient redistribu- 
tion can be quite large, in particular, larger than the 
amount farmers would prefer in order to maximize 
their per-capita transfers. Among the multiple equilib- 
ria with inefficient redistribution, farmers prefer those 
with lower x, that is, those that attract fewer newcom- 
ers, so they may have an incentive to limit entry, for 
example, by methods such as acreage controls. This 
again highlights that existing farmers have nonmono- 
tonic preferences over entry; they want a sufficient 
number of newcomers to increase or maintain their 
political power, but not so many that their revenues are 
diluted. 

Our model so far explains inefficient targeting—why 
subsidies are targeted to newborn agents, thus distort- 
ing their productive choices—but our analysis also 
suggests why inefficient conditioning may be useful to 
farmers. First, inefficient conditioning or other meth- 
ods of entry restriction, such as acreage controls, may 
help farmers select the most favorable option among 
multiple equilibria. Second, even in the absence of 
multiple equilibria, there may be a role for inefficient 
conditioning. To see this, suppose that agents working 
in manufacturing can claim to be farmers by buying a 
very small plot of land. In this case, there would be no 
net redistribution to farmers, since all agents would 
receive the subsidy. Farmers might want to use ineffi- 
cient conditioning, by making the subsidy conditional 
on farming output via price subsidies, as a way to 
prevent nonfarmers from claiming the subsidy. 

Next, notice that when U™(x) — U (x) is smaller, 
condition 14 is more likely to be satisfied, so inefficient 
redistribution, pọ > 0, is more likely to arise. Yet, 
conditional on there being inefficient redistribution, a 
greater U™ (x) — Uf(x) will imply a larger amount of 
inefficient redistribution, since py = U(x) — Uf). A 
number of comparative static results follow from this 
observation. A range of variables that increase 
U(x) — U(x) make inefficient redistribution less 
likely, but they increase the amount of inefficient 
redistribution when there is any. For example, an 
increase in the amount of redistribution in period t = 
1, caused by a shift in t(n,), will reduce the gap 
between U(x) and U(x), which makes inefficient 
redistribution more likely. Similarly, a decrease in 
A — B will make inefficient redistribution more 
likely. 

A natural and more important comparative static is 
that inefficient redistribution is more likely when '(n,) 
is larger. This implies that a given increase in the 
number of farmers will translate into larger per-capita 
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transfers. This is intuitive since the point of inefficient 
redistribution is to attract newcomers in order to 
protect political power and per-capita transfers. Em- 
pirically, b’(n,) could be related to such issues as the 
geographical concentration of farmers and whether 
marginal changes in their number affect electoral out- 
comes, as well as the seniority of farming constituency 
representatives and whether they have powerful posi- 
tions, such as the chairmanship of key committees. This 
comparative static also implies that in practice we 
should see inefficient redistribution when political 
power depends crucially on group size. We discuss 
some examples later. 

It is most interesting that redistribution is likely to be 
inefficient when the political power of an influential 
group is contested, that is, whenn™ < ng < n*/(1 — 8). 
This is because the purpose of the inefficiency is to 
prevent the loss of influence. Many examples of inef- 
ficient redistribution are from declining industries (e.g., 
Baldwin 1985; Rodrik 1994), which is consistent with 
this implication. 

It is worth discussing the robustness of our results to 
changes in the structure of the model. We have used a 
reduced form of a more fully specified political model, 
but we could instead determine the policy by majority 
voting and assume there are more agents in farming 
than in manufacturing. In a setting of two-party Down- 
sian political competition, the equilibrium policy would 
be that preferred by the median voter, a farmer. This 
policy would involve inefficient redistribution since the 
median voter would take into account its anticipated 
effect on the identity of the future median voter, and 
thus the equilibrium policy, in the second period. 
Although the policy space is three dimensional, the 
median voter theorem applies because there are only 
two types of agent, so heterogeneity is very limited. 

We also could cast the analysis in terms of a proba- 
bilistic voting model (e.g., Dixit and Londregan 1996; 
Lindbeck and Weibull 1987). The equilibrium policy 
offered by political parties would depend not simply on 
the bliss point of the median voter but on other 
characteristics that determine the political power of 
different groups. One such characteristic is size. Again, 
our results would hold. A large group would influence 
the equilibrium policy because parties would tailor 
their platforms to gain its support. To the extent that 
the group anticipates a decline in its size and future 
political power, it would prefer inefficient methods of 
redistribution. To attract these voters, political parties 
would offer such policies. The model could be extended 
by introducing interest groups (e.g., Persson and Ta- 
bellini 2000, sec. 3.5), and as long as size is a political 
asset, our results could be replicated. 

Our two-period model may seem restrictive, but our 
results are robust in an infinite horizon repeated game, 
with new agents entering sequentially in each period. 
In this case, there exist equilibria in which farmers 
initially have power but all newcomers enter manufac- 
turing. Overtime, therefore, the relative power of farm- 
ers declines until at some point they choose inefficient 
redistribution. Such dynamic paths would have a very 
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similar interpretation to the results here. The compli- 
cation, however, is that in addition to such equilibria 
there are many more that depend on expectations 
about the behavior of future agents. Many different 
sets of expectations could be self-fulfilling and thus 
qualify as equilibria. While this kind of behavior may 
generate empirically interesting phenomena, the two- 
period model allows us to focus on the issues that 
appear most likely to be of first-order importance. 


SPECIFIC FACTORS 


In this section, we show how our framework may 
account for a potentially puzzling pattern in the polit- 
ical economy of redistribution. The literature suggests 
that when skills and investments are specific to a given 
sector, agents have more to lose from relocating, and 
their incentives to lobby for protection are greater 
(e.g., Alt et al. 1996; Becker 1985; see also the formal- 
izations of Brainard and Verdier 1994 and Coate and 
Morris 1999). It is difficult to see the importance of 
specific factors, however, in many of the most pro- 
nounced cases of trade protection, such as textiles or 
farming, which are commonly viewed as sectors with 
limited specific investments by capital and labor.” 
Similarly, labor market policy often involves protection 
for workers with limited specific skills. Contrary to 
conventional wisdom, our model predicts that sectors 
with less specific factors may be more prone to ineffi- 
cient redistribution. 

Consider a modified version of the economy de- 
scribed in the previous section. There are no young 
agents (8 = 0), but in period 0 a fraction y of the 
farmers can switch to manufacturing at some cost C. A 
high level of C means that switchers fail to employ their 
skills effectively in manufacturing, which implies that 
farming uses highly specific factors. We continue to 
assume that those who switch produce A in the other 
sector. To focus on the case in which it is still socially 
efficient to reallocate agents into manufacturing, we 
assume that (1 + B)(4 — B} > C. 

The timing of events is as follows. Taxes are deter- 
mined as in equation 1, and at this point farmers do not 
know whether they will have the opportunity to switch. 
Next, farmers find out whether they have this oppor- 
tunity and make their decision (if they switch, they do 
not pay taxes until period 1). In period 1, taxes are 


14 This idea is commonplace in the literature on the political 
economy of trade policy. Alt et al. (1996, 700) argue. “A crucial 
determinant of the incentrves of an economic agent to seek trade 
protection (or, more broadly subsidies) for his or ber economic 
activity is the degree to which the agent’s assets are specific to this 
activity.” Similarly, Baldwin (1989, 124) claims: “One also expects 
vigorous efforts to secure protection in the face of significantly 
increased import competition by those industries [with] substantial 
. industry specific physical and human capital.” 

1s The empirical literature on trade finds that labor-intensive and 
low-akill industries tend to recerve more protection (see Rodrik 1994 
for a succinct overview; also see Baldwin 1985; Ray 1981, 1991). 
Moreover, consumer goods industries receive more protection than 
intermediate goods industries (Ray 1991), but consumer goods (e.g., 
textiles, apparel, furniture and fixtures, toys, and sporting goods) are 
thought to have relatively unspecific factors of produchon. 


American Political Science Review 


Vol. 95, No. 3 


American rk N a a aM 


determined as in equation 1 (but no switching occurs), 
and the world ends. The difference between 8) and po 
now is that farmers who decide to switch in period 0 
still obtain ðo, but since they no longer farm, they do 
not receive the price subsidy pọ Therefore, once again, 
o is a nondistortionary transfer, whereas pọ is an 
inefficient form of redistribution because it encourages 
agents to stay in the less productive sector. 

Equation 3 still determines 74, but equation 2 is now 
modified to 


(1 — no = Mop + (1 y) + topo AN 


That is, att = 0, there are nọ farmers and 1 — np agents 
in manufacturing; a fraction (1 — y + yx) of farmers 
receive both types of transfer, but the fraction y(1 — x) 
who quit only receive the efficient transfer, 0o- 

The number of farmers in period t = 1 is then given 
i ; 


nı = ((1 — y) + yx) M0 


Let Wf and W™ denote the expected utilities of 
potential switchers, and Vf and V™ denote the utilities 
of immobile agents. As before, we have 


Wi = V! = (1 + BB+ 6) + po + BOL (18) 
W" = (1 + B)A -C+0)—Br(m,). (19) 


Also, define U(x) = Wf — p as the utility of an agent 
who can switch but chooses not to, when a fraction x of 
potential switchers stay in farming and when py = 0. 
Define U"(x) = W™ as the utility of an agent who 
does switch when a fraction x of potential switchers stay 
in farming. 
We denote the fraction of farmers who switch sec- 
tors by 1 — x. We concentrate on the part of the 
eter space of most interest, namely, where ny) € 
(n~,n*/(1 — 8)). We also make the analogous assump- 
tion to assumption 1, which ensures that it is not 
worthwhile for a potential switcher to stay in farming 
just to get future redistribution. 


ASSUMPTION 2. 2BT < (1 + B)(4 - B) =C: 


This condition ensures that if pọ = 0, potential switch- 
ers would all go to manufacturing. As in the previous 
section, to increase their political power, or prevent it 
from declining, farmers therefore need to set po > 0. 

Consider the utility of an old farmer before he knows 
whether he will have the opportunity to switch. His ex 


ante expected utility is 
V4 = (1 — y)Vf + ymax{ Wf, W™}. 


Since farmers decide the form of redistribution at this 
stage, po and Tọ simply maximize ex ante expected 
utility. Now notice that if Wf > W™, then x = 1, and if 
wl < W™, then x = 0. Our interest is to see under 
what circumstances x = 0 will not be in equilibrium, 
that is, under what circumstances farmers will use pọ > 
0, resulting in inefficient redistribution. Suppose that 
Wf = W™, in which case we can write 


V^ = (1 — y) Vf + yW” 


= (1 — y)((1 + B)B + Bo(m)) + y((1 + BMA -C 
— Br(my)) + o + (1 — y) Bo 


where the second line is derived by substituting from 
equations 18 and 19. 

Now solving equation 17 for 8) + (1 — y)Ho and 
substituting, we have 


V4) = (1 — y)((1 + B)B + Bo(n)) 


t= 
+O + BA — C— prap + 


(20) 


As in the previous section, suppose that py = U™@) — 
U*(x), and substitute from equations 18 and 19 to 
obtain 


m= (1+ BA —B)—C- BO) +20). (21) 


Differentiating equation 20, evaluating it atx = 0, and 
substituting for equation 21, we obtain that there will 
be inefficient redistribution, that is, py > 0, if 


dV4a = 0 
a — ypo + Bynol(1 — ¥)6"((1 — Yno) 


— yt'((1 — y)m)} > 0. 


Intuitively, if d^ (œ = 0)/dx > 0, a small increase in x 
will raise the ex ante expected utility of farmers. This 
expression highlights once again that for inefficient 
redistribution to arise ’ and 7’ need to be positive. If 
1'((1 — yt) = 0, then keeping some of the potential 
switchers will reduce taxes, so b’ < 0, and dV4@ = 
O)/dx < 0. Therefore, 7’ > 0 is necessary—but not 
sufficient—for inefficient redistribution. Also, notice 
that potential switchers always prefer ex post not to 
have implemented a policy of inefficient redistribution 
to keep farmers in political power (they would prefer to 
move to manufacturing and not be taxed). 

The novel comparative static result here is with 
respect to C. Recall that when C is high, farming skills 
are more specific. When C is high, equation 22 is more 
likely to be positive, so inefficient redistribution is more 
likely to arise. Yet, conditional on there being ineffi- 
cient redistribution, a smaller C implies more ineffi- 
ciency. Intuitively, when C is lower, the skills of poten- 
tial switchers are less specific to farming, so they are 
more willing to move into manufacturing. This implies 
that farmers need to choose a more inefficient mix of 
redistributive policies to convince potential switchers 
to stay. Contrary to conventional wisdom, therefore, 
our model, which derives inefficient redistribution from 
micro foundations, implies that a lower degree of 
specificity may increase the extent of inefficient redis- 
tribution. : 


(22) 
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APPLICATIONS OF THE MODEL 


Agricuttural Policy 

The first application we discuss is farming subsidies. 
Gisser (1993, 584) argues that “most economists have 
by now abandoned the belief that the main purpose of 
Tegulation is to correct for failures in private markets. 
The U.S. farm commodities program is no exception 
since it is designed to transfer income from taxpayers, 
and sometimes from consumers to farmers.” There is a 
consensus that farm policy cannot be explained as 
correcting market failures. Although a number of 
authors maintain that the form of redistribution to 
farmers is relatively efficient (see Gardner 1987, Gisser 
1993), it is difficult to believe that more efficient 
methods than price supports and quantity controls do 
not exist. For example, most economists consider the 
Common Agricultural Policy in Europe highly ineffi- 
cient, and it is argued that direct subsidies could save 
considerable resources (e.g., Moyer and Josling 1990). 

Most studies of agricultural subsidies take it for 
granted that lump-sum redistribution cannot be used. 
Our theory suggests that this is due to the desire to 
keep a critical mass of farmers in the industry. Wright 
(1995, 14} echoes this view: “Making farming perma- 
nently more attractive to the young by means of price 
supports... is a goal that appears embodied explicitly 
or implicitly in the farm policies of most developed 
economies.” 

In the early 1960s the French government attempted 
to reduce farm prices and promote the consolidation 
and modernization of small farms, but there was 
substantial opposition from the larger and more pow- 
erful farmers who controlled the Fédération Nationale 
des Syndicats d’Exploitants Agricoles (FNSEA). 
Franklin (1969, 103) explains why: “On the one hand, 
by supporting such price [subsidy] policies they [capi- 
talist farmers] achieved an apparent common purpose 
with the large mass of the peasantry, on the other, any 
success such policies might register, by helping to 
maintain the peasantry rather than diminish them, 
would, at the same time, help to sustain the peasants’ 
electoral importance, and by extension increase the 
pressure which the capitalist-led federations might 
bring to bear upon various governments.” It appears 
that farmers in France were aware that the form of 
transfers would influence their numbers and their 
future political power, so they may have preferred 
inefficient methods of redistribution. 

The same considerations appear to be important 
today. Following the McSharry reforms to the Com- 
mon Agricultural Policy in 1992, pressure by French 
farmers induced the government to pass the Loi de 
Modernisation de [’Agriculture in January 1995. Part 
of this law was to introduce the goal of establishing 
15,000 young farmers per year and, in general, lower 
the costs of doing business as a farmer to encourage 
entry (see Coleman, Atkinson, and Monpetit 1997). 
The reaction to agricultural reforms was quite similar 
in Germany and the United States. The Mansholt plan 
was defeated by German farming interests in 1968 on 
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identical grounds (Averyt 1977, 16-7), and the Bran- 
nan Plan in 1958 was defeated by the American Farm 
Bureau for similar reasons (Christenson 1959; Hansen 
1991). 

Overall, on a number of occasions farmers have 
campaigned for inefficient redistribution policies. Our 
model suggests that this is because they want to 
encourage newcomers in order to maintain future 
political power. 


Labor Market Policy 


Most European labor markets are Heavily regulated 
and characterized by such institutions as firing costs, 
which make it prohibitively expensive to lay off workers 
(e.g., Lazear 1990). Although severance pay may be 
useful, as it provides insurance to workers who other- 
wise would remain uninsured, the majority of the costs 
incurred by firms are administrative and do not benefit 
workers. Therefore, these policies appear highly inef- 
ficient. It is often argued that their main role is to 
increase insiders’ b r and wages (e.g. 
Lindbeck and Snower 1988; Saint-Paul 1996). Within 
this category are many pieces of legislation that en- 
hance the ability of workers to unionize and engage in 
collective action to raise wages (e.g. closed shop 
agreements). Many economists believe that these pol- 
icies are designed to give workers market power and 
also are responsible for high unemployment. It is 
argued: that it would be much cheaper and- more 
efficient to make direct transfers to insiders and allow 
the necessary worker and job reallocation. The preva- 
lence of firing costs and legislation that increases the 
ability of workers to combine and engage in collective 
action in Europe is therefore quite puzzling from a 
theoretical perspective.1¢ 

Our model provides a simple answer. Suppose no of 
workers are in a high-wage sector, such as manufactur- 
ing, and wages are determined by union-firm bargain- 
ing. There is a critical mass of workers A, such that for 
all n < A, the union loses its ability to push for higher 
wages. Suppose also that a fraction y of the workers in 
the sector are in loss-making firms. In the absence of 
firing costs, these firms will lay off workers (yg of 
them), and many of these workers will find jobs in 
other sectors, which will reduce union membership to 
nı.< A. The union and manufacturing workers will 
therefore campaign for firing costs in order to prevent 
their numbers from shrinking. Even though other 
methods of redistribution are more efficient, only firing 
costs and similar restrictive work practices ensure that 
unions maintain their power in the future.'7 

There is a body of evidence suggesting that our 





16 Tt is argued that these labor market interventions increase the 
incentives of workers to invest in human capital (seo, e g., Acemoglu 
and Pischke 1999; Robmson n.d.}. Yet, other labor market interven- 
tions can do this much more efficiently than administrative firing 
costs and closed shop arrangements. 

17 Moreover, firing costs reduce turnover and stabilize the compost 
tion of the workforce, which makes it easier for unions to mobilize 
workers. A similar argument can be developed to account for 
prounion legislation: Unions will support the policies that sustain 
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approach is along the right lines, that is, the form of 
welfare state intervention is often inefficient, precisely 
because it is motivated by a desire to maintain future 
political power and sustain the ability of workers to 
engage in collective action. Many authors point out 
that labor market institutions are designed to ensure 
their political sustainability. Esping-Andersen (1990, 
16) writes: “The social rights, income security, equal- 
ization and eradication of poverty that a universalistic 
welfare state pursues are necessary preconditions for 
the strength and unity that collective power mobiliza- 
tion demands” (see also Esping-Anderson 1985). He 
further argues that universalistic welfare states dynam- 
ically sustain the political coalitions that create them in 
a way that means-tested systems, which create divisions 
within workers, do not. 

Rothstein (1985, 1992) shows that a central factor in 
explaining the cross-country strength of trade union 
movements is whether they manage the national un- 
employment insurance scheme. When they do, as in 
Belgium and Scandinavia, except Norway, they can 
reinforce and sustain their bargaining power by deter- 
mining the criteria under which unemployed people 
must accept jobs. This allows them, for example, to 
prevent the unemployed from undercutting their’ bar- 
gaining power. Pontusson (1992, 28) points out that 
there are “instances in which welfare reforms directly 
strengthened union organization. Most notably, the 
public unemployment insurance system introduced by 
the Swedish Social Democratic Party in 1934 subsi- 
dized union administered unemployment funds and 
thereby provided a direct incentive for wage earners to 
join unions.” 

The same issues manifest themselves in unions’ 
regulating job losses. Golden (1997, 4-5) argues: 


Even unions that appear radically to resist market forces 
accept that there are circumstances in which the enterprise 
must reduce the size of its labor force. But what no union 
can accept... is that the firm take advantage of such a 
situation to break the union itself. If too many shop floor 
union representatives are included amongst those to be let 
go, or if so much of the union’s membership is slotted for 
expulsion as to jeopardize the very future of the union as 
an organization...the union responds with industrial 
action. The aim of such action is to restore the union 
organization, not to prevent job loss. Strikes over work- 
force reductions .. . are rational, self-interested responses 
on the part of labor organizations to threats to trade 
unionism. 


In the same vein, Slichter (1941, 17) notes: “If the 
union has no closed shop, restrictions on the employ- 
ers freedom to lay off may be a matter of self- 
preservation, because if union members are always the 
first to be dropped, the men will not remain in the 
organization.” 

As with agricultural and trade policies, it appears 
that a number of redistributive labor market policies 
are chosen to be inefficient, at least in part, to preserve 


their constituency. In this way, they ensure continuity 
in the political power of the policies’ beneficiaries. 


International Trade Policy 


Most countries use tariffs and quotas to protect domes- 
tic industries. This is sometimes justified by infant 
industry protection arguments or similar externalities. 
Most economists, however, view tariffs and quotas as 
inefficient methods of transferring resources to special 
interest groups, in this case firms and workers in sectors 
that are subject to foreign competition. 

' To-apply our analysis to the case of international 
trade policy, it is useful to consider two sectors, man- 
ufacturing and farming, as producing imperfect substi- 
tutes, and all consumers as having the utility function 
yyy %, with a € (0,1). The world relative price of 
farming output in terms of manufactures is p, so pB 
replaces’ B. 6 is still equivalent to a lump-sum transfer 
by current farmers, and p can now be interpreted as a 
tariff at the rate of s = pp/B. So a p transfer increases 
the return to farming to pB(1 + p),!8 but it also distorts 
relative prices. This inefficient method may be pre- 
ferred to a 0 transfer, however, precisely because it 
attracts newcomers, who would be more productive in 
manufacturing, to this sector. 

We do not have direct evidence that the mechanism 
we propose is an important factor in the choice of 
inefficient trade policies, but various authors suggest 
that numbers are important in securing trade protec- 
tion. Caves (1976) claims that the number of votes‘an 
industry can mobilize increases trade protection. Tosini 
and Tower (1987) found that the proportion of textile 
and apparel workers in the workforce of a congres- 
sional district or state was the most significant deter- 
minant of the pattern of voting on the 1985 textile bill. 
Baldwin (1985) presents other evidence of the impor- 
tance of voting in the determination of U.S. trade 
policy, and Harper and Aldrich (1991) provide similar 
evidence on legislation affecting the sugar industry. 


CONCLUSION 


We have developed the idea that the dynamics of 
group power is crucial in political systems that lack the 
ability to make commitments to future policy. Groups 
wish to take actions not just to raise their welfare 
today; they want to sustain their power so that they will 
be able to influence policy in the future. In order to do 
this, they may take current actions that would not be 
optimal if there were no concern for the future. 
Inefficient methods of redistribution may be precisely 
such an action when the political influence of a group 
depends on its size, a natural assumption, in demo- 
cratic systems. This is because inefficient redistribution 
makes staying in, or joining a group, relatively more 
attractive to marginal agents than efficient methods of 
redistribution do. We argue that this explanation is 
consistent with a variety of evidence on the political 








their future influence, which rests on their ability to organize 
collective action. 





18 Or the actual return is pB(1 + »)/(p(1 + »))' T = = Bpt(1 + 
u)“, since the prices of farmmg goods increase for farmers, too. 
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economy of redistribution in agricultural, trade, and 
labor market policy. 

Our model focuses mostly on inefficient targeting, 
that is, on why subsidies are aimed at groups whose 
behavior will be distorted. It also can explain efficient 
conditioning, that is, why redistribution will be condi- 
tioned on the intensive margin of actions. For example, 
in the farming case, inefficient redistribution (target- 
ing) maintains the political power of farmers, but it 
may attract more newcomers to the sector than are 
required to maintain group power. This dilutes the 
wealth of group members. Therefore, subsidies condi- 
tional on production or acreage controls may be useful 
to limit excessive entry. We leave a detailed analysis of 
this issue to future research. 
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ccording to the Olson paradox, larger groups may be less successful than smaller groups in furthering 


their interests. We address the issue in a 


with three distinctive features: explicit intergroup 


interaction, collective prizes with a varying mix of public and private characteristics, and nonlinear 
lobbying costs. The interplay of these features leads to new results. When the cost of lobbying has the elasticity 
of a quadratic function, or higher, larger groups are more effective no matter how private the prize. With 
smaller elasticities, a threshold degree of publicness is enough to overturn the Olson argument, and this 
threshold tends to zero as the elasticity approaches the value for a quadratic function. We also demonstrate 
that these results are true, irrespective of whether we examine group sizes over the cross-section in some given 
equilibrium or changes in the size of a given group over different equilibria. 


ost political, social, or economic activities are 

carried out by groups or organizations, not 

individuals. Presumably, this is because the 
outcome of pooled efforts usually is larger than the sum 
of individual efforts. A critical factor, however, is 
whether the benefits from cooperation are distributed 
in ways that pay all potential partners to cooperate. 
Individual rewards depend on the contributions of 
other group members as well as one’s own. Because of 
the free-rider problem, individuals bear only partially 
the adverse consequences of reducing their effort. 
Consequently, collective effort typically falls below the 
optimal level. 

The free-rider or collective action problem is ex- 
tremely pervasive and includes a wide variety of situa- 
tions in which cooperation is necessary. Trade unions, 
lobbies, and the provision of public goods are standard 
examples. The problem also appears in the case of 
collusive behavior between organizations, and it can 
occur even within organizations when the outcome 
depends on combined efforts by different individuals at 
the management or production level. 

Olson’s (1965) celebrated thesis and the earlier 
insightful work of Pareto (1927) argue that the free- 
rider problem makes smaller groups more effective. 
For instance, Olson writes: 


The most important single point about small groups . . . is 
that they may very well be able to provide themselves with 
a collective good simply because of the attraction of the 
collective good to the individual members. In this, smaller 
groups differ from larger ones. The larger a group is, the 
farther it will fall short of obtaining an optimal supply of 
any collective good, and the less likely that it will act to 
obtain even a minimal amount of such a good. In short, the 
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larger the group, the less it will further its common 
interests (1965, 36). 


Pareto makes a very similar argument in the context of 
protectionist measures. 


In order to explain how those who champion protection 

make themselves heard so easily, it is necessary to add a 

consideration that applies to social movements general- 

ly... If a certain measure A is the case of the loss of one 
franc to each of a thousand persons, and of a thousand 
franc gain to one individual, the latter will expend a great 
deal of energy, whereas the former will resist weakly; and 
it is likely that, in the end, the person who is attempting to 

secure the thousand francs via A will be successful. A 

protectionist measure provides large benefits to a small 

number of people, and causes a very great number of 
consumers a slight loss. This circumstance makes it easier 

to put a protection measure into practice (1927, 379). 

Individuals always have an incentive to shirk, but the 
effect is more pervasive when group size is large. There 
are two reasons for this. First, the larger the group, the 
smaller is the perceived effect of an individual defec- 
tion. Second, if the prize has any element of private- 
ness, then the larger the group, the smaller is the 
individual prize. Hence, larger groups will be less 
effective in pursuing their targets, which in essence is 
the group-size paradox.! 

There is general agreement that, because of the 
free-rider problem, individuals will tend to contribute 
lower levels of “action” (money, effort, time, and so on) 
the larger the group to which they belong. The key 
question, however, is the aggregate potency of the 
group, which is what determines effectiveness in’ the 
sense of success probabilities. Decreasing personal 
contributions are not necessarily incompatible with 

increasing gate effectiveness. Among others, 
Chamberlin (1974), Marwell and Oliver (1993), 
McGuire (1974), Oliver and Marwell (1988), Sandler 
(1992), and Taylor (1987) point out that Olson’s prop- 
osition of an imverse relationship between effective 
collective action and group size hinges on the assump- 
tion that the collective good is purely private, so that it 
must be divided among group members. They argue 


1 For an extended discussion of the different factors relevant to the 
group size paradox, see Olson 1965, chap. 1. 
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(but do not formally demonstrate) that when the 
collective good is public—so that rewards to individual 
group members are fully nonexcludable—Olson’s re- 
sult is reversed: The larger the group, the higher is the 
level of the collective good it will be able to produce. 
The view that the Olson thesis holds when the prize is 
private but may be reversed when the prize is purely 
public will be termed the “common wisdom” for the 
rest of this article. 

Two points are to be noted and will be reiterated 
below. First, the words public and private are relative to 
the group in question. A public prize means that a 
group member's payoff is unaffected by the number of 
members, but nonmembers may be partially or com- 
pletely excluded. Likewise, a private prize is fully 
divided up, but only among group members. Second, 
the common wisdom applies to the extreme cases of 
“purely private” and “purely public” goods and is silent 
on the intermediate case. 

Olson’s thesis—or even the common wisdom version 
of it—is not necessarily consistent with some informal 
observations. For instance, there ig a sense in which the 
received theory runs counter to the old maxim: Divide 
and conquer. Political entities have applied this rule 
with surprising universality, but if smaller groups are 
more potent, the division of one’s opponent into a 
number of smaller units would entail more effective 
opposition. This universality can be reconciled with the 
traditional argument only if all potential gains are fully 
public to group members. But that is hardly the case. 
For example, groups seeking political autonomy may 
be just as motivated by private economic consider- 
ations (e.g., access to natural resources) as by public 
gains (e.g., feelings of independence). The universality 
of “divide-and-conquer” coupled with the mixed na- 
ture of group rewards demands an explanation. 

Consider how individuals organize themselves in 
order to influence government decisions. We can think 
(simplistically but usefully, we hope) of the activity of 
the government as obtaining revenue and then allocat- 
ing it to a number of activities, following the pressures 
perceived from the public. To be sure, a number of 
government activities have the character of public 
goods. But many are very close to private goods. Think 
of transfers either in money or in kind made on the 
basis of various individual characteristics. These in- 
clude pensions, subsidies, education, and health bene- 
fits, for which there may be a fixed budget. Here, 
Olson’s thesis would suggest that we should observe 
citizens organized in extremely narrow interest groups, 
to cut down on the scope of such transfers. Thus, 
lobbies would be organized around a very exhaustive 
(and possibly artificial) list of “deserving characteris- 
tics,” in an attempt to ward off the free-rider problem. 
For instance, we should see the very poorest organizing 
their own lobby to become the sole beneficiaries of 
public support, arguing that those immediately above 
them are not really deserving. In general, there should 
be extreme splintering of lobby groups, not to mention 
au artificial narrowing of the issues. We do not seem to 
observe anything close to this. Whenever an interest 
group tries to argue in its favor, this is usually (although 
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not always) done by appealing to some general and 
inclusive characteristic. By and large, the tendency is to 
see individuals organized in quite broad platforms. 
This sort of evidence does not fully conform to the 
common wisdom that when goods are largely private, 
smaller groups will tend to be more effective. 

Or view firms as organizations, to which the free- 
rider problem should be readily applicable, as Olson 
(1965, 54-5) himself argues. If the common wisdom is 
to be accepted, we should observe that larger firms 
switch from incentive schemes (such as stock owner- 
ship for high-level employees) to direct monitoring of 
employee actions. This is because—applying Olson—in 
large organizations the link between stock returns and 
individual performance is tenuous, so that stock-based 
incentive schemes should have no effect. But we do not 
observe a progressive transition to direct monitoring 
among large corporations. If anything (and especially if 
we neglect the recent dot com experience), we observe 
the opposite. 

The discussion so far is meant to challenge the 
common wisdom, at least in some cases where it may 
apply. But one can go farther. In many cases, the 
common wisdom—with its reliance on the polar dis- 
tinction of private versus public—fails to apply in the 
first place. This is because most situations contain 
public as well as private components. Indeed, one 
purpose of this article is to throw light on the mixed 
case. 

In this article we develop a formal model of collec- 
tive action that has three features. (1) Collective action 
is undertaken in order to counter similar action by 
competing groups. (2) Marginal individual effort is 
increasingly costly (as total individual effort rises). (3) 
The collective prize is permitted to have mixed public- 
private characteristics, within a range between pure 
publicness and pure privateness of the good. 

The first feature is common to a large and growing 
literature,? but the second gives rise to new results of 
substantial empirical relevance. Our main point is that, 
if the marginal cost of effort rises sufficiently quickly 
with respect to resources contributed, then larger 
groups will have a higher win probability, even if the 
prize is purely private. We shall argue that this over- 
turns Olson’s hypothesis in circumstances in which 
marginal costs increase with effort supply. 

The phrase “larger groups have a higher win proba- 
bility” is ambiguous.? One interpretation is that it 
applies over the cross-section of groups, that is, larger 
groups have a higher win probability evaluated at a 
single equilibrium. A second interpretation is more in 
the nature of comparative statics: At some equilibrium, 


2 Chamberlin (1974) emphasires that most collective goods theory 
deals with actions of a single group, and the next step should be to 
consider ths behavior of several groups m competion. As Hardin 
(1995) points out, successful collective action often entails suppres- 
sion of another group’s Interest. A sizable literature has emerged on 
this subject (e.g, Katz, Nitran, and Rosenberg 1990, Nitzan 1991; 
Raaz, Sbogren, and Johnson 1995; and Tullock 1980) and formalizes 
the problem of collective action with multiple groups as a contest 


game. 
3 We are indebted to an anonymous referee for bringing this pomt to 
our attenton 
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if the membership of a certain group exogenously rises, 
then the winning probability of that group rises. Under 
the second interpretation it is necessary, at least in the 
general equilibrium formulation that we adopt, to track 
the possible changes in equilibrium magnitudes before 
we deliver an answer. That is, other groups must be 
permitted to react to this change in group size. Yet, 
remarkably enough, we find that both interpretations 
yield the same results. Larger groups have higher win 
probabilities under precisely the same (sufficient) re- 
strictions on marginal cost. Thus, without taking sides 
as to which mterpretation of the Olson argument 
should be adopted, we find that the same critique 
uniformly applies. 

In the formal analysis to follow, we make precise the 
crucial condition that “marginal cost of effort rises 
sufficiently quickly.” Here we limit ourselves to some 
remarks of interpretation. Observe that group effort 
may be supplied in one (or both) of two ways: funds 
and time. Consider the former. The marginal cost of 
“effort” is then simply the opportunity cost of.devoting 
additional funds to a collective enterprise. If credit 
markets were perfect, this opportunity cost would have 
a simple proxy, which is the rate of return on alterna- 
tive economic activities or just the rate of interest. In 
other words, if funds can be borrowed (without limit) 
for lobbying purposes, there would no reason to expect 
that the marginal cost of those funds will rise with the 
amount borrowed. This picture is inaccurate, however, 
because credit markets are rarely, if ever, perfect; even 
less if it is known that borrowed funds may be diverted 
to lobbying. In this case, the marginal cost of funds is 
no longer the rate of interest, but the rate of return on 
economic alternatives (including consumption) to 
which an individual must apply his own funds. It is 
natural to suppose that the larger the funds devoted to 
collective action, the higher will be the marginal op- 
portunity cost of those funds. Our restriction becomes 
focal. 

Observe, moreover, that this interpretation clarifies 
an important conceptual issue: Our restriction on 
marginal costs is not just-an ad hoc restriction on 
individual tastes and preferences. It may be a state- 
ment about the institutional context, in this example, 
about the nature of capital markets. 

Next, consider time as an input into the lobbying 
process. Then the marginal cost of effort is just the 
marginal cost of time. It is obvious that as more and 
more time is devoted to collective action, the marginal 
value of what remains must certainly rise, leading once 
again to the centrality of the increasing inal cost 
Testriction. Notice, moreover, that under this view 
credit markets play no role. Therefore our conditions 
admit more than one contextual interpretation. 

A simple application of these remarks further illus- 
trates the situation. First, the less rich the groups in 
question, the more significant should be the 
cost effect. This is so for two reasons: (1) If credit 
markets are imperfect, they are likely to be more so for 
the poor, and therefore for such individuals the mar- 
ginal cost of funds expended will rise more steeply with 
expenditure..(2) The poor are more likely to use time 
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rather than funds in the lobbying process. As already 
noted, the use of time has a built-in propensity to 
exhibit increasing marginal cost—there are only 24 
hours in the day. It follows that lobby battles involving 
relatively low-income groups (e.g., public job training, 
Medicare, or welfare assistance) will entail the forma- 
tion of large groups. In contrast, battles fought by 
high-income groups (e.g., tax reduction or tariff pro- 
tection) often substitute financial contributions for 
personal effort, and the marginal cost restriction is less 
likely to apply. Such lobby groups tend to be narrow 
and small. 

The third feature of our model is new, that is, we 
allow for intermediate outcomes that are partly public 
and partly private. The result is new insights about the 
interplay between publicness of the prize and the 
increasing marginal cost effect. Given some cost func- 
tion, a certain critical degree of publicness is enough to 
overcome the group-size paradox, which only confirms 
earlier work in more general form. More significantly, 
we prove that the threshold degree of publicness of the 
outcome is decreasing in the elasticity of marginal cost 
of effort supply. Indeed, if the elasticity is at least as 
large as 1, then the probability of success increases with 
group size, irrespective of the degree of privateness of 
the prize. 

It should be noted that our—and Olson’s—notion of 
group effectiveness may not be the only variable that 
influences group formation. We push this qualification 
a step farther by noting that per-capita payoffs to group 
members are just as important. This motivates a second 
notion of effectiveness, which we briefly explore at the 
end of the article. 

We also note that our framework is a highly abstract 
model of pluralism, not a model of lobbying in the way 
most formal political theorists understand it. Neverthe- 
less, we believe it is the simplest structure within which 
to analyze group effectiveness, for it captures the idea 
that government policy is sensitive to group contribu- 
tions (whether in units of money or effort). Indeed, as 
in Olson’s original argument, the focus is on the 
free-rider problem. We achieve this focus by abstract- 
ing from other (possibly more realistic) aspects, but 
there is no reason to believe that a consideration of 
those aspects would change the results significantly. 


A MODEL OF COLLECTIVE ACTION WITH 
FREE-RIDING 


The Effort Cost of Collective Action 


Suppose that several options are available to society. 
Think of these as different locations of a public facility, 
competing public projects (hospital, library, or muse- 
um), or different political parties in office. Only one of 
the available alternatives can come about. Individuals 
differ in their valuations of these options. All those who 
rank a certain one first form an “interest group.” We 
assume that all individuals with the same favorite are 
identical.+ 


+ We thus exclude the case of group members who might be prepared 
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Let G denote the number of options as well as 

te groups. Let N be the total population and Nj, 
4 Ng be the membership of the G groups. 

ee every collective action problem at least two types 
of goods are involved: the various prizes from collec- 
tive action and the efforts contributed by individuals to 
realize their favored ends. The structure of preferences 
over the various prizes may have complex implications. 
For instance, individuals may free-ride not only on the 
effort made by fellow group members but also on the 
effort contributed by other groups, as long as the 
options have some degree of publicness. In order to 
isolate and examine the within-group component of 
free-riding, we remove these aspects of the problem by 
assumption.® In other words, we assume that individu- 
als in any group care most about their favorite option 
and are indifferent to all others. The utility differential 
between the favorite and the remainder, denoted by w, 
will drive the arguments below. Note well that this 
description is compatible with a scenario in which each 
option has a universally public component that is 

~ enjoyed symmetrically by all groups. 

We shall denote by a the level of effort contributed 
by each individual. These units, which can be added 
across group members to yield group effort, may 
represent dollars or hours contributed to the collective 
cause. This particular interpretation of a will be main- 
tained in the discussion that follows. 

Assume that individual preferences are represented 
by the (additively separable) utility function 


u(w,a) = w — v(a), (1) 


where v is an increasing, smooth, convex function with 
v'(0) = 0, and w is the per-capita benefit from the 
favored option. This is equivalent to measuring utility 
in units of the collective good: From the benefit w we 
subtract the cost of the effort contributed, translated 
into the equivalent units of the collective good.7 


to supply by themselves the necessary contributions. That issue was 
raised by Olson in classifying different types of groups and has been 
extensively analyzed by Marwell and Oliver (1993). We do not deny 
the potential empirical relevance of this msue but want to focus on 
group numbers alone. 

3 Consider the case of three aod options, A, B, and C. 
Suppose that group A (whose first preference is A) strongly prefers 
option B over C If choice is restricted to this pair. Suppose, 
moreover, that group B is pushing very strongly for its most preferred 
outcome. When deciding how much effort to contribute, indrviduals 
in group A would take into account not only what their fellow 
members are contributing but also the fact that option B is not that 
bad. Therefore, the free-ridmg problem would not be confined to 
one’s own group, but would also be affected by the amount of effort 
made by “nearby” groups and the distance of ther preferences from 
that of one’s own group. 

* More complex preference structures among options are explored m 
detail in Esteban and Ray 1999 There, we examine the case in which 
free-riding occurs not within groups but across groups We show that, 
in general, the structure of preferences over the entire set of options 
has implications for the magnitude and pattern of conflict. Here we 
avoid the general case in order to focus narrowly on the Olson 
argument. 

7 Recall that a is the variable directly added over all members to 
arrive at win probabilities for a group, which means that the shape of 
v—in particular, the fact that we allow v to be nonlinear—becomes 
crucial in what follows. Note that a could as well be contnbutlons in 
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In order to examine the economic meaning of alter- 
native shapes of the y function, consider the individual 
preferences over (w, a) pairs implied by equation 1. If 
v is linear, we have the case in which effort is directly 
subtracted from benefits, as in Olson. There are in- 
stances in which this might be a plausible assumption. 
Think, for instance, of the cost to a lobbying firm of 
borrowing an extra dollar from a frictionless credit 
market, so that the rate of interest i is insensitive to the 
amount borrowed. In this case v(a) is just (1 + rja. Yet, 
in many interesting instances this assumption does not 
appear appropriate. That is clearly the case when the 
collective action is contributed by individuals and con- 
sists of personal effort, time, or income. The class of 
nonlinearities broadens even more if capital markets 
are imperfect. In these situations it may be more 
appropriate to assume that additional units of effort 
are increasingly costly. 

Put another way, the marginal rate of substitution 
between reward and effort—the amount of extra ben- 
efit that will just compensate an individual for contrib- 
uting an extra unit of effort—increases as total effort 
increases. As it turns out, the rate of increase, that is, 
the elasticity of the marginal rate of substitution with 
respect to effort, is the key variable that determines the 
effect of group size in collective action problems. 

To formalize this concept quickly, we note that the 
marginal rate of substitution, r, can be simply written as 


r = v'(a), (2) 


and its elasticity «(a) at any effort level as 


dr 
@) = aro . ; 
xa da v(a)’ (3) 
a 


o(a) will play a central role in our main result. 


The Benefits of Collective Action 


We turn now to an exploration of the benefits of group 
action and to the way in which efforts influence these 
benefits. We allow for options that have both public 
and private components because we wish to examine a 
view that a to be common wisdom (at least since 
Chamberlin [1974]): Individual effort always decreases 
as group size increases, but aggregate collective action 
increases when the good is purely public and decreases 
when the good is purely private. To this end, each 
option will have a public component, to be denoted by 
P, and a private component, denoted by M. 


money. In that case, v(a) is the utility cost of the contnbuted amount 
of money 

Taylor (1987, chap. 2) pouts out that Olson’s argument critically 
depends on the assumption that costs can simply be subtracted from 
the benefits and mentions that Olson (1965, 29, 46) was aware of this 
analytical weakness. We shall show that this mportant observation 
has implications far stronger than the ones derved by Taylor through 
bus diagrammatic analysss. 
2 Seo the summary m Taylor 1987, chap. 2, or Sandler 1992, chap. 2. 
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By a public component P, we refer to a component 
that all group members enjoy equally, irrespective of 
group size. By assumption, outsiders have no access to 
the public benefits of a group and may even be hurt by 
such a policy. Various causes come under this label: 
“better trade terms for developing countries,” “abor- 
tion rights,” “establishment of a dominant Hindu 
state,” “a shorter working day,” “saving the dolphins,” 
and so on. Observe that a public component, while 
often nonmonetary, may well be monetary: A 10% rate 
of tariff protection to a group of domestic producers 
will offer the same benefit to each producer regardless 
of the number of producers. In contrast, a private 
component M is typically, although not always, mone- 
tary, the important characteristic being that it is dissi- 
pated with group size. As examples, consider “an 
additional $1 billion for Medicare,” “an additional 
increase in the U.S. immigration quota,” or “allocation 
of water rights.” Note that the last two examples are 
not directly monetary. The point is that a public 
component is unaffected by group size, while a private 
component is affected. 

We assume that any private, divisible part of the 
collective good is shared by the N, members of group i 
on an equal division basis. Let à € [0, 1] be the share 
of publicness in any alternative. Then, the per-capita 
benefit to each member of ‘group i, provided that 
option i is chosen by society, is 


M 


and the benefit is normalized to be zero if any other 
option is chosen. 

Observe that this formulation posits perfect symme- 
try among all options, except for the possible dilution 
of per-capita rewards because of varying group size. 
Once again, this assumption is a deliberate attempt to 
focus on the effects of group size alone. 

Notice that A = 0 corresponds to the case of a purely 
private, distributable good, whereas à = 1 corresponds 
to a pure public good with perfect nonexcludability 
within the group. Thus, à serves as a useful parame- 
terization of the degree of publicness in a collective 
good. 

We emphasize that the perceived share of publicness 
in a given collective good depends on group size. For 
given A, P, and M, the larger the group, the larger is the 
perceived share of the public component. We shall 
denote by 9, the share of publicness as perceived by an 
individual member of group i. That is, 


AP 


= SS 

We now discuss how the choice of options depends 
on effort. Denote by A, the total effort contributed by 
group i. (Recall that this is just the sum of individual 
efforts within the group.) We assume that the proba- 
bility of success for group i, which is just the probability 
ar, that option i will be chosen, equals the effort level of 


Vol. 95, No. 3 


group i relative to the aggregate amount of effort 4 


exerted by all groups. That is, 
A, 
m= (6) 


Therefore, m,w, is the expected value of the collective 
good to each member of group i. Under this specifica- 
tion, the marginal return to an additional unit of 
individual effort (as well as to an additional unit of 
group effort) is positive but decreasing in effort.1° 


Equllibrium 
Expected utility per capita is given by 


A, 
7 WO, N) — v(a). 


Each individua! in each group takes as given the efforts 
contributed by everyone else in society (including 
fellow group members) and chooses a, to maximize 
expected utility. Our end-point and curvature condi- 
tions guarantee that this choice is interior and is 
completely described by the first-order condition 


1 A, 
3 = fiwa, N) — v(a) 


: i 
= 30- TWA, N) — va)=0 (1 


An equilibrium is a vector of individual contributions 
such that equation 7 is satisfied for every individual in 
every group. From equation 7, it is clear that in any 
equilibrium the choice made by each member of any 


given group will be identical. That is, 4, = N,a,, so that 
l _ A, = Pai 8 
a, = N, = N, ( ) 


Using equation 8, we can rewrite equation 7 as 


1 M 
b(t, A, N) m rie oe mAP +(1- sd 


+ T, aie 
-v( x)= . (9) 


An equilibrium can now be reinterpreted as a vector of 
success probabilities—therefore adding up to unity— 
and a positive number A, such that equation 9 is 
satisfied for all groups..4 is an indicator of scale: It tells 
us the aggregate amount of collective action that is 
created in equilibrium. 

It is very easy to check that an equilibrium always 
exists and is unique. Provisionally view 7, as a param- 
eter in equation 9, and observe that 


° The model can easily be rewritten with success probabilities 
depending on individual effort (see Skaperdas 1996), but 
in that case the interpretation of the nonlinearity of y becomes more 
complex. 

10 Marwell and Oliver (1993) use the term “decelerating” to describe 
this phenomenon. 
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p — (L/A[MP + (1 — )(M/IND]>Oasm, | 0, 


whereas  — —v'(4/N,) < Qasa, t 1. Furthermore, 
it is obvious that ¢ is strictly decreasing in 7,. There- 
fore, for each given A and N, there is a unique value of 
at, that satisfies equation 9. In other words, equation 9 
implicitly defines 7, as a function of A and N; m; = 
a(A, N,). The equilibrium value of A is then deter- 
mined by the condition that 


G 
D(A, N)=1. 
1m] 

It is easily seen that q(A, N,) is strictly decreasing in A 

and varjes between 0 and 1. This completes the dem- 

onstration of existence and uniqueness of equilibrium. 


(10) 


GROUP SIZE AND COLLECTIVE ACTION 


Winning Probabllities 


We now analyze the effect of group size on the 
provision of collective goods and present a pair of 
propositions. Both are in sharp contrast to the common 
wisdom that Olson’s result hinges on whether the 
collective good is private or public. We show that 
whenever preferences are nonlinear, that is, a(a) > 0, 
Olson’s tenet that the level of collective action dimin- 
ishes with group size does not generally hold, irrespec- 
tive of the degree of privateness of the collective good. 
If preferences are “sufficiently” nonlinear (in a sense to 
be made precise below), Olson’s result is not true and, 
indeed, is exactly reversed. . 

As noted earlier, the phrase “larger groups have 
higher win probabilities” is ambiguous. It might refer 
to an examination of large versus small groups at some 
given equilibrium or to the implications of increasing 
the size of some group, in which case the comparison is 
across equilibria. 

Proposition 1 adopts the first of these interpreta- 
tions. 

Propostrion 1. Consider the equilibrium of the game 
described above. 


a. Whenever inf, afa) > 1, the level of collective 
action (and therefore winning probabilities) is strictly 
increasing in group size for all » © [0, 1], that is, 
irrespective of the degree of publicfprivateness of the 
collective good. 
b. More generally, winning probabilities are increasing 
over a pair of group sizes n and n', where n < n', if 
0A, n) = 1— inf, a(a). (11) 
Provided inf, «(a) > 0 and the good is not fully private, 
this 


provisions and for any pair of group sizes, it is satisfied 
for à close enough to unity. 


Proof. We prove the proposition by examining the 
behavior of m, over the cross-section of groups, keeping 
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A unchanged at its equilibrium value. The easiest way 
to do this is to pretend that N, is a continuous variable 
in equation 7 and to differentiate 7 with respect to N, 

Some tedious calculations reveal that this derivative is 


given by 
a(a,) 


a(a,) + I 


dt, _ 


T, - (1 — 04, N)) 
dN, N, i 


(12) 





t 


Notice that this derivative is guaranteed to be positive 
when inf, a(@) > 1, so that part a is immediately 
established. 

Condition 11 of part b is also a nearly immediate 
consequence of equation 12. We only need to observe 
that 6(A, 7) is a nondecreasing function of n, so that 
the nonegativity of the derivative evaluated at N, = n 
is sufficient. The fact that condition 11 is satisfied for 
large group sizes or for à close to unity (provided that 
inf, a(a) > 0) follows simply from the fact that @(A, n) 
— 1 either as à > 1 or as n > œ. Q.ED. 

In the next proposition, we investigate what a 
when the size of a particular group increases. 
involves a comparison across equilibria. Remarkably 
enough, the results tightly parallel those of proposi- 
tion 1. 


PROPOSITION 2. Suppose that the size of one group (of size 
n) is increased, but all other group sizes are kept 
unchanged. Then, under the new equilibrium, the 
winning probability of this group increases if condition 
11 holds: 


Q(A, n) = 1 — inf, a(a). 
Just as in proposition 1, this condition is automatically 


probability i irrespective 
T EE E EE A manna 


Proof. Let group i have size N, = n. Just as in the proof 
of proposition 1, use equation 7 to differentiate m, with 
respect ton, keeping A unchanged for the moment. We 
already know that this derivative is positive if equation 
12 holds. Put another way, we have learned so far that 
n(A, n) is increasing in n for fixed A (assuming 
equation 12 applies). 

Now we have to allow for the change in equilibrium 
A. Notice that our exercise so far allows us to conclude 
that 


G 
D a(d, N) 


im] 


has increased, evaluated at the earlier value of A and 
N, forj # i, and the new value of N,. Therefore, using 
equation 10, the new equilibrium value of A must 
increase. But this proves that, for every group other 
than i, the equilibrium win probability must strictly fall. 
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The Olson result may or may not 
hold here, depending on group 
size. 


The Olson result is overtumed In this region above 
the graphed line: an Increase In group size raises 
group effectiveness. 


1 
Elasticity of Marginal Cost with Respect to Effort (a) 


Because win probabilities sum to one, this completes the 
proof. (The other particular implications follow just as in 
proposition 1.) Q.E.D. 

For expository convenience, assume for the moment that 
the elasticity of marginal cost of effort, a, is constant. If the 
cost function is itself of the constant-elasticity form v(a) = 
a'**, it is easy to verify that the elasticity of marginal cost 
will also be constant and exactly equal to a. Then one can 
graphically view the region for which the Olson result is 
overturned (see Figure 1). The figure plots different elas- 
ticities of marginal cost (a) on the horizontal axis, and 
different degrees of publicness of the collective good (A) on 
the vertical axis. 

Let us suppose that we know that all group sizes are at 
least a certain minimum; call this rnm: Figure 1 invokes 
condition 11 for this minimum group size, and this is 
used to divide the diagram into two regions. For any 
combination of parameters in the upper region, the 
Olson result must be overturned. That is, for any starting 
size (not lower than 7mm), an increase in group size must 
increase group effectiveness. In contrast, for parametric 





configurations in the lower region, the Olson result 
may or may not be valid, depending on how large the 
starting group size is (the larger is this starting size, 
the more likely it is that the result is invalid, even in 
this region). The only point that guarantees the Olson 
result is the origin of the diagrammatic axes (corre- 
sponding to both a = 0 and à = 0), which may be 
interpreted as a linear cost function together with a 
fully private prize. No other configuration of param- 
eters can guarantee the Olson result for arbitrary 
group sizes. 

Notice that when the elasticity of marginal cost 
attains the value one (corresponding to a quadratic 
cost function for effort), the group size paradox is 
“fully” reversed, and remains so for all higher elastic- 
ities. Irrespective of the characteristics of the collec- 
tive prize or the starting group size, larger groups are 
then more effective. Figure 1 graphically displays this 
by yielding all space to the upper region to the “right” 
of the point where a = 1. 
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Discussion 


Propositions 1 and 2 stand in sharp contrast to the 
common wisdom that collective action increases or 
decreases with group size, depending on whether the 
collective good is purely public or purely private, 
respectively. Instead, we prove that there is no inverse 
relation between collective action and group size ex- 
cept under either of two extreme assumptions: (1) 
individual indifference curves are straight lines (Le, 
a(a) = 0), or (2) a(a) € [0,1] and the collective good 
is purely private (i.e., à = 0). 

Indeed, if a = 1, the Olson assertion is fully over- 
turned. Collective action increases with group size even 
when the prize is purely private, and this is true a 
fortiori when some degree of publicness is introduced. 
Moreover, even if the cost function exhibits lower 
elasticity (but nevertheless continues to be strictly 
convex), proposition 1 argues that there will always be 
some vector of group sizes for which the Olson thesis is 
false.11 In summary, the private/public distinction mat- 
ters, to be sure, but may be entirely swamped by 
considerations that involve the disutility cost of effort 
supply. This offers new insights regarding the Olson 
thesis, especially in light of our argument that some 
curvature in the cost function is empirically likely to be 
the rule rather than the exception. 

To obtain further intuition for the propositions, we 
drop the subscripts in operation 12 and rewrite it as 


=i 7 a LON. 





This expression states that the proportional change in 
the size of collective action induced by a given percent- 
age increase in group size is proportional to the 
diference between the elasticity of the marginal rate of 
substitution and the elasticity of the value of the 
collective good with respect to group size. Notice that 
the reciprocal expression l/a(a) is the elasticity of 
individual effort with respect to the marginal compen- 
sating reward. Thus, low values of a correspond to 
extremely elastic responses of individual effort supply 
to small variations in the marginal reward. Therefore, 
the propositions say that collective action will decrease 
or increase with group size depending on whether the 
proportional effect on the reward (1'— 6) does or does 


W Indeed, if a(a) = (1 — MMIAP + (1 — XM), the Olson thesis 


with group size for all e(a) = 0 and à € [0, 1} a result known since 
Chamberlin (1974); see also Riaz, Shogren and Johnson 1995. 
13 To see why 1 — OQ, N) serves as a measure of the latter elasticity, 


ow aN N M (1 — AMIN) 


WIN pO NYE y TE 8D 


Therefore, the degree of perceived privateness (1 — 8) can also be 
interpreted as the elasticity of w—the reward—#with respect to group 
size, 
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not induce a sufficiently elastic response in individual 
effort supply. 

The sign and value of a play a critical role in our 
result. Which assumptions about a are most plausible? 
Chamberlin (1974) introduced the distinction among 
three categories of collective goods: inferior, normal, 
and superior. A collective good is inferior if the 
individual responds to an increase of one unit in effort 
by the rest of the group with a reduction in personal 
effort of no less than one unit; it is normal if the 
personal reduction is strictly less than one unit; it is 
superior when the personal response is increased ef- 
fort. 

In order to see which assumptions would give rise to 
each category of goods, we differentiate a, with respect 
to A, in the best response function, equation 7, where 
A_,™A, — a,. After performing this differentiation 
and dividing each side by the corresponding side of 
equation 7, we find that 


da, 1 
1+a(a A 
(aor 


It follows that the collective good is normal if and only 
if a(a) > 0. Therefore, our main result simply requires 
the collective good to be normal in the sense of 
Chamberlin.1* 


Another Notion of Group Effectiveness 


A second definition of group effectiveness relates 
group size to per-capita payoffs. This relationship 
cannot, in general, be predicted by the change in 
winning probabilities (as described in propositions 1 
and 2). For instance, when the good is purely private, it 
may be true that larger groups have a higher win 
probability (if a > 1). Moreover, as is well known 
(Chamberlin 1974), they also put in lower effort per 
capita. Yet, larger numbers do diminish the per-capita 
value of the prize. Therefore, holding constant the 
overall value of the prize, large groups are at some 
intrinsic disadvantage in terms of payoffs. 

If the good is purely public, then this disadvantage 
vanishes altogether, and we are simply left with the two 
positive effects for large groups. This informal discus- 
sion suggests that our second notion of effectiveness 
may be more closely tied to the private/public distinction. 


PROPOSITION 3. ee Oe ne Dd Oe 
Player increases with group size when the collective 
good is purely public (à = 1) and decreases when it is 
purely private (à = 0). 

Proof. Write individual expected utility in an equilib- 

Tium as 


n(A, N,)w(a, N,) a (4), 


14 This result represents a substantial exploration of the observation 
made by Taylor (1987, chap. 2) regarding the critical role of flatness 
in the indrvidual indifference curves between reward and effort. 
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where A satisfies the equilibrium condition 10. Again, 
treating N; as a continuous variable, differentiate this 
expression with respect to N, but keep A fixed. The idea 
is that we are moving over a cross-section of groups in 
a given equilibrium. If we carry out this exercise and 
manipulate the results a bit, we see that 


au, ald, Niwa, Ny) 


aN, N, 


iv aA, Ni) 
[on ~ [1S 


i N,  an(A, N) 
say n) 
The sign of this derivative depends on whether the 
product of the two brackets is smaller or larger than 
6(A, N;). Since N, = 1, it is plain that the first bracket is 
positive and does not exceed 1. As for the second 
bracket, we know—paraphrasing Chamberlin (1974) 
(see note 12)}—that it is always positive. 

When the collective good is purely private, we have 
à = 0 and 6(A, N,) = 8(0, N,) = 0. Consequently, the 
derivative in equation 13 is strictly negative: In equi- 
librium, members of larger groups attain lower levels of 
per-capita utility than members of smaller groups. 

When the collective good is purely public, we have 

= 1 and 6(A, N) = 6(1, N,) = 1. Moreover, by 
proposition 1, win probabilities rise with group size. It 
follows that the value of the second square bracket 
in equation 13 does not exceed 1. Consequently, when 
the good is purely public, the derivative in equation 13 
is positive: Members of larger groups attain higher 
levels of per-capita utility than members of smaller 
groups. Q.E.D. 

The argument behind this result is quite straightfor- 
ward. Consider the case of a pure public good. By 
proposition 1 we know that, provided it is a normal 
good, the larger the group, the smaller is the individual 
efort, but the higher is the level of collective action. 
Since the size of the group does not reduce the 
availability of the collective good to individual mem- 
bers, membership in a larger group has the effect of 
increasing the benefit and reducing the costs. Larger 
groups are more desirable on all counts. When the 
good is purely private, an increase in group size still has 
the effect of reducing individual effort (but the effect on 
win probabilities is ambiguous). This effect possibly 
enhances individual utilities, but (as the proposition 
shows) it is never enough to counteract the fall in 
per-capita benefit created by larger group size. 

Combining propositions 1 (or 2) and 3, we see that 
there are situations in which collective actions and 
utilities do not move in the same direction, so that our 
two notions of group effectiveness are really distinct. 
For instance, when inf, e(a) > 1 and the collective 
good is purely private, larger groups contribute more 
resources and therefore enjoy larger win probabilities 
(proposition 1). By proposition 3, however, they must 
have lower payoffs at the individual level. 

Notice that proposition 3 is not as comprehensive as 





(13) 
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proposition 1, in that it does not characterize group 
utilities in the mtermediate cases. Observe, however, that 
whenever the good is not purely private (Le., à € (0,1), 
6 tends to 1 as N, becomes large. It can be easily seen 
from equation 13 that, for sufficiently large N, this 
expression is strictly positive, so that further increases in 

size will increase the equilibrium utility of mem- 
bers. It follows that whenever the collective good has 
some public content, large groups may do better (depend- 
ing on the configuration of group sizes in society). 


CONCLUSION 


The common wisdom concerning Olson’s thesis is that 
small groups are more effective, even when the collec- 
tive prize is fully private, but that this relationship is 
overturned when the prize if fully public. By explicitly 
modeling the costs of effort, as well as situations in 
which the prize has mixed characteristics, we not only 
extend the common wisdom but also show that there 
are cases in which it is false. 

In particular, if marginal costs rise sufficiently fast 
with contributions, even when the prize is purely 
private, large groups have higher win probabilities than 
small groups (propositions 1 and 2). The Olson result is 
critically dependent on the linearity of cost functions, 
an unrealistic assumption that we question here. 

When a good is fully private, however, large groups 
have lower per-capita payoffs than smaller groups, if we 
control for the overall size of the prize (proposition 3). 
This payoff-based notion of effectiveness is different 
from the win-based notion that has received attention, 
but it certamly should be considered in theories of 
group formation. We do not pursue it here, but some 
remarks may be useful. 

First, it is unclear whether group formation occurs 
exclusively on the basis of per-capita payoffs, or 
whether effectiveness in the sense of win probabilities 
(the perception of being successful) also is a factor. If 
we view firms as an instance of groups, we see that this 
ambiguity is closely related to the age-old question of 
what firms “maximize”: profits, size and presence, 
other public perceptions of success, or some combina- 
tion of these? To the extent that these other factors 
also matter, win-based notions of effectiveness enter 
into the theory of group formation, and there is no 
guarantee that society will be splintered into small 
Olson-style lobbies. 

Second, even if per-capita payoffs are the sole crite- 
rion for group formation, proposition 3 (unfortunately) 
throws little light on the issue of small versus large 
group selection. The reason is that the proposition is 
unequivocal only in the extreme cases of pure private- 
ness or publicness. Just how much privateness is re- 
quired (under this criterion) for the small group effect 
to dominate remains an interesting and open question. 

Finally, it is important to remember that many 
groups are defined by their ideal points, and there may 
be little or no room for group formation. So there is no 
necessary contradiction between the possibility that 
large groups have low per-capita payoffs yet still exist. 
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i e analyze party discipline in the House of Representatives between 1947 and 1998. The effects of 
party pressures can be represented in a spatial model by allowing each party to have its own cutting 
line on roll call votes. Adding a second cutting line makes, at best, a marginal improvement over 

the standard single-line model. Analysis of legislators who switch parties shows, however, that party 

discipline is manifest in the location of the legislator’s ideal point. In contrast to our approach, we find that 
the Snyder-Groseclose method of estimating the influence of party discipline is biased toward exaggerating 


party effects. 


scholarly investigation of how political parties and 

their leaders infiuence legislative institutions and 
behavior (see Aldrich 1995; Cox and McCubbins 1993; 
Rohde 1991; Sinclair 1995). Much of the focus is on the 
collective action problems that are inherent in the 
legislative and electoral processes. Cox and McCubbins 
(1993), who conceptualize political parties as cartels 
that direct legislative activity to enhance the collective 
electoral fortunes of their members, provide a typical 
variant on this theme but by no means the only one. 
The primary function of such a cartel is to build a 
collective reputation on which its members can run. 
They argue, however, that without strong leadership 
members have individual incentives to engage in legis- 
lative activities (such as pork) that diminish the collec- 
tive reputation. 

The focus on collective action has generated much 
interest in the cohesiveness of parties as floor coali- 
tions.1 The principal prediction is that a party produces 
a more cohesive coalition than would be possible if 
members were to act on their individual preferences. 
Rohde (1991) uses evidence of greater party cohesion 
since 1975 to demonstrate an increasing role of party in 
the postreform House. Aldrich, Berger, and Rohde 
(1999) choose party voting as the main dependent 
variable to test the predictions of “conditional party 
government,” and Cox and McCubbins (1993) use 
member support on leadership votes to test for the role 
of leaders in creating voting coalitions. Furthermore, 
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some scholars, including Rohde (1991), see a reasser- 
tion of party strength behind the growing cohesiveness 
and polarization of congressional parties since the 
mid-1970s (for alternative explanations, see King 1998; 
McCarty, Poole, and Rosenthal 1997; and Poole and 
Rosenthal 1984). i 
` As Krehbiel (1993, 1998) points out, however, the 
patterns of behavior uncovered in these empirical 
studies are consistent with both theories of strong, 
influential parties and nonpartisan models in which 
member preferences are sorted along party lines. This 
dilemma is exacerbated by the difficulty of measuring 
legislative preferences. Ideally, some exogenous mea- 
gure of member preferences should be used to test 
party theories. Voting behavior under the null hypoth- 
esis of no party influence then could be compared with 
actual voting behavior. The problem is that the usual 
measures of legislative preferences are derived from 
the voting behavior itself. 

We attempt to untie this Gordian knot. We begin by 
reviewing the evidence from work on congressional roll 
call votes that hypothesizes sincere spatial voting. We 
discuss how this evidence suggests the presence of 
some party discipline. We next examine the party 
discipline model of Snyder and Groseclose (2000) and 
argue that their method both seriously biases the 
estimate of ideal points for ideologicat moderates and 
overestimates the extent of party discipline. We pro- 
vide a compelling theoretical illustration of the bias. A 
more extensive technical discussion can be found in 
McCarty, Poole, and Rosenthal (2000, Appendix B). ' 

To assess party discipline (or pressure) properly, we 
propose an alternative approach. The basic idea is very 
simple. We start with Krehbiel’s (1993, 1998) proposal 
that the spatial model of purely preference-based 
voting is the appropriate benchmark for evaluation of 
models that incorporate party effects. In one dimen- 
sion, the spatial model asserts that, on each roll call, 
“yea” and “nay” voters are separated by a cutpoint on 
the liberal-conservative continuum. If the Republicans 
apply pressure to their membership, some moderates 
to the left of the “sincere” cutpoint will vote with the 
conservative wing. Republicans will have a cutpoint to 
the left of the sincere cutpoint, and Democrats will 
have a cutpoint to the right of it. 

To illustrate this point, we assume some overlap 
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between the parties: Certain Democrats, historically 
largely southerners, are more conservative than the 
most liberal Republicans. A single cutpoint model 
would always predict that either some Republicans 
vote with the majority of Democrats or vice versa. But 
if enough pressure were applied to turn a roll call into 
a straight party vote, the voting pattern would be 
captured better by a Democratic cutpoint to the right 
of the most conservative Democrat and a Republican 
cutpoint to the left of the most liberal Republican. 
More generally, when one or both parties apply pres- 
sure, the voting patterns should look as if there were 
separate cutpoints for each party, with.the Democratic 
cutpoint being to the right of the Republican cutpoint. 
Therefore, if pressure is important, we should find a 
better fit to the data when we estimate two cutpoints 
rather than one. e will return to this point in 
discussing Figure 4. 

To keep the analysis as simple as possible, we use 
nonparametric optimal classification analysis; legislator 
ideal points and roll call cutpoints are jointly rank 
ordered to maximize predictive success on roll call 
votes (Poole 2000). By classifying the voting of each 
party independently and then comparing the results to 
Classifying both parties together, we can evaluate the 
maximum possible improvement in correct classifica- 
tion attributable to party discipline. An advantage of 
the cutpoint approach is that it does not require any 
assumptions about which specific roll calls are subject 
to party pressure. A particular advantage of the non- 
parametric approach is that it assumes only that the 
amount of pressure applied to individual members 
does not change the order of their induced ideal points. 
It does not require making parametric assumptions 
about how pressure varies with the ideal point of the 
individual member, such as equal pressure being ap- 
plied to all. On the basis of our optimal classification 
analysis, we conclude that allowing for party discipline 
affords only a very marginal improvement over the 
sincere spatial model, particularly in recent Con- 


gresses. 

Where, then, is party discipline? We argue that the 
main influence of party discipline is not on the votes on 
specific roll calls but on the choice of ideal point made 
by the representative. The smoking gun is provided by 
the great change in ideological position demonstrated 
by the few legislators. who have switched parties. 
Wayne Morse and Strom Thurmond are two well- 
known examples in the postwar Senate. The Democrats 
who defected to the Republicans after the 1994 elec- 
tion made equally dramatic shifts. Our finding that 
parties shape ideal points ends our hunt for party 
discipline in roll call voting. i 


INDEPENDENT VOTING ON THE FLOOR: 
THE EVIDENCE FROM THE SPATIAL 
MODEL 


The standard spatial model provides a benchmark 
approach to independent floor voting. Poole and 
Rosenthal (1991, 1997) demonstrate that the model is 
quite successful in accounting for floor decisions. With 
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two dimensions, one can correctly predict roughly 85% 
of the individual decisions—even on close roll calls— 
between 1789 and 1985. McCarty, Poole, and 
Rosenthal (1997, 7) report additional results for 1947- 
95.2 In recent Congresses, a one-dimensional model 
classifies nearly 90% of the individual decisions. 

The spatial estimates strongly suggest that party 
influence underpins much of this remarkable classifi- 
cation success. First, in Congresses in which voting is 
largely one dimensional, party-line votes are along the 
main dimension. The’ distribution of ideal points is 
strongly bimodal. The two parties appear as two very 
distinct “clouds” that barely overlap, particularly in 
recent years (As an illustration, see McCarty, Poole, 
and Rosenthal 1997, 11.) The presence of a “channel” 
between the clouds suggests that party affiliation may 
discipline the roll call voting behavior of members. The 
main dimension of political conflict clearly appears to 
reflect partisan conflict. Parties perhaps also influence 
their members’ votes on specific roll calls. 

Second, in Congresses in which voting is two dimen- 
sional, there also are two distinct clouds separated by a 
channel. Party-line votes are no longer on the main 
dimension but are a blend of the first and second 
dimensions. (See Poole and-Rosenthal 1991, 233, or 
1997, 44, for an example.) An interpretation of such 
plots is that ideal points projected onto roughly a 45° 
line represent the ideological (liberal-conservative) di- 
mension. The orthogonal projection, roughly at —45°, 
represents a party loyalty or valence dimension. Most 
votes occur along the main, 0° dimension. On these 
votes, the legislator’s decision depends both on ideol- 
ogy and on party loyalty. 

Although the spatial model shows that the structure 
of voting coalitions in Congress coincides strongly with 
party affiliation, it does not prove that party per se has 
any influence on voting behavior. Party-line voting is, 
of course, consistent with both strong party models and 
ideological models in which preferences are sorted by 
parties. In the sections that follow, we review a recent 
attempt to separate partisan effects from preferences 
and then propose a method of our own. 


THE SNYDER-GROSECLOSE MODEL OF 
PARTY DISCIPLINE 


One inherent problem in identifying the effects of party 
is that we observe only behavior, which is presumably a 
mix of individual preferences and party influence. This 
problem is particularly acute with congressional voting 
data. If party discipline is exercised on floor votes, the 
ideal points estimated on the assumption of indepen- 
dent spatial voting may be very biased estimates of 
legislator preferences. If party influences these esti- 
mates, it is inappropriate to use it as a control for 
preferences when testing for a party effect. Snyder and 


? These authors use the Poole and Rosenthal (1991, 1997) NOMI- 
NATE methods. Both NOMINATE and the Heckman and Snyder 
(1997) method are parametric. The results of the two approaches are 
very similar, particularly on the first and second dimensions. 

> The spatial model does, however, strongly outperform a model of 
straight party voting. 
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Groseclose (2000) note this potential for bias. They 
propose a method for estimating unbiased ideal points 
and then using them to estimate the effect of party 
discipline.* 

The basics of the one-dimensional Snyder-Grose- 
close model are as follows. On roll call j, a legislator i, 
if a Republican has an induced ideal point Xy =x; on 
roli call j, a legislator i, if a Democrat, has an induced 
ideal point x, = x, + y,. In other words, the true ideal 
points of the Democrats, the x, are displaced by the 
amount of party pressure given by y,5 It turns out that 
only the relative amount of party pressure matters in 
the model, so the ideal points of the Republicans can 


just be given by their true values. For the difference in ' 


pressure to be consistent with discipline, we would 
expect that pressure must move Democrats in a liberal 
direction relative to Republicans. Thus, pressure works 
to increase the separation of the parties. If preferences 
are viewed on a scale, such that the left end is liberal 
and the right end conservative, then we would expect y 
to have a negative sign. 

Snyder and Groseclose argue that, because there is 
little need to apply party discipline on votes not 
expected to be close, ideological position taking should 
occur on lopsided votes. These votes, such as those 
with margins greater than 65 to 35, could be used to 
estimate the true ideal points. On these votes, the y 
would be zero. The true ideal points could then be used 
to estimate the y; on close votes, such as those with 
margins less than 65 to 35. 

In brief, their procedure is as follows. Stage 1: Use 
votes with margins greater than 65 to 35 to estimate the 
ideal points, x, Stage 2: On the remaining votes, for 
each roll call j, estimate the following ordinary least- 
squares (OLS) model: 


Y, = Bo + Bx, + BaD, a) 


where D, = 1 if legislator i is a Democrat, 0 if 
Republican, and Yy = 1 if į votes “yea,” 0 if i votes 
“nay.” Equation 1 is equivalent to Yy = Bo + Bury 
when 


y = Ba/B:. (2) 
To see this, note that for Democrats equation 1 implies 
Yy= Bot Be, + B2=Bot Bi(%,+B2/B,). As we noted 


dons, but the unidimensional case grves the intuition of their more 
general model. 


above, the party pressure model predicts a negative 
estimate for -y when preferences are scaled with Dem- 
ocrats on the left (as we assume they are). Therefore, 
the two estimated Bs should be of opposite sign. 

This method is likely to generate the inference that 
party pressure is substantial even when ali voting is 
preference based. Consider, for example, a six-member 
legislature with the party affiliations and spatial pref- 
erences given in Figure 1. If all voting in this legislature 
is spatial without error, there are only twelve possible 
voting configurations, which are given in Figure 2. 

Stage 1 of the Snyder-Groseclose method estimates 
a preference score using only voting patterns 1~10. But 
voters 3 and 4 cast identical votes in each of these 
patterns, so any scaling procedure will estimate them as 
having the same position. Thus, stage 1 provides biased 
estimates of the preferences of moderates. There is not 
enough information in the lopsided votes to discrimi- 
nate “left” moderates from “right” moderates. The 
preference ordering that maximizes the classification of 
votes is shown in Figure 3. 

In stage 2, the preferences in Figure 3 and party 
affiliation are used to explain vote patterns 11 and 12. 
The votes of legislators 1, 2, 5, and 6 are correctly 
classified on the basis of the preference estimates, but 
the votes of legislators 3 and 4 cannot be. Yet, since 3 
and 4 are members of different parties, adding the 
party variable to the model increases its explanatory 
power even though voting is purely preference driven. 

Our example extends to larger legislatures. In gen- 
eral, with perfect spatial voting, a first stage based only 
on lopsided votes will produce identical preference 
estimates for all members in the interval between the 
35th and 65th percentiles. The second stage will pro- 
duce a spurious party effect so long as party and 
ideology are correlated within this interval.6 We now 
present an alternative procedure that maintains the 
essential features of the Snyder-Groseclose model of 


discipline. 


A NONPARAMETRIC MODEL 


All specifications of a spatial model of voting have two 
critical elements: ideal points for the legislators and 


6 Given the assumptions of no voting error and no overlap of 
preferences between the parties, this example is somewhat special. In 
McCarty, Poole, and Rosenthal 2000, Appendix B, we present Monte 
Carlo evidence that shows how this result extends to large legisla- 
tures in which, as in the Snyder and Groseclose approach, there 1s 
some error m voting and the distribution of preferences of each party 
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FIGURE 2. Perfect Spatial Voting In a Six-Member Legislature 


Vote on Roll Call 


cutpoints (or separating hyperplanes) for the roll calls. 
The Snyder-Groseclose model, with a discipline pa- 
rameter to each roll call, is isomorphic with one in 
which each party has its own cutting line (see Appendix 
A). That is, moving the ideal points for all Democrats 
to the left by a magnitude y; is equivalent to moving the 
cutpoint for Democrats to the right by the same 
amount. Party discipline generally involves getting 
moderates to vote with extremists.’ Therefore, if there 
is party discipline, the cutpoint for the Democrats 
should be to the right of the cutpoint for Republicans.® 

Consider a one-dimensional spatial configuration. 


7 Poole and Rosenthal (1997, 155-7) document that there are very 
few “both ends against the middle” votes in which extremists defect. 
® The predicted order of cutpoints is equivalent to the prediction that 
y is negative. 
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When the cutpoint is constrained to be the same for 
both parties, this produces the standard spatial model. 
For example, in Figure 4, with a common cutpoint, 
there are three classification errors: legislators 3, 11, 
and 15. When each party can have its own cutpoint, this 
produces a model that allows for party discipline. 
Moderate Democrats to the right of some Republicans 
can vote with the majority of their party. Moderate 
Republicans to the left of some Democrats can vote 
with the majority of their party. The best cutpoint for 
the Republicans in Figure 4 remains the common 
cutpoint (the solid line). Legislator 15 is the only 
Republican classification error. But the best cutpoint 
for the Democrats is to the right of the common 
cutpoint (the dashed line), which leaves only legislator 
3 as a classification error. Rather than estimate either 
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FIQURE 3. Preference Order Based on Lopsided Votes 


Legislator 


{ 


4 
2 3 5 6 


Note: Since voter 3 and voter 4 voted identically on lopsided votee, any eshmation procedure wil generate idenbcal ideal pouts 





the one- or the two-cutpomt model via a metric 
technique, such as Poole and Rosenthal’s (1991) 
NOMINATE or the Heckman and Snyder (1997) 
method, one can simply find the joint rank order of 
legislators and cutpoints that will minimize classifica- 
tion error. Poole (2000) presents an efficient algorithm 
that very closely approximates the global maximum in 
correct classification.? Note that this method, in con- 
trast to equation 1, does not require a uniform adjust- 
ment in the ideal points of all party members. Only 
moderates need to be disciplined. All that is required is 
a displacement of the cutpoint. 

Our empirical analysis involves not only testing the 
implications of our methodological critique of Snyder 
and Groseclose but also testing the implications of 
their model of party discipline with our two-cutpoint 
model. We begin by stating three methodological pre- 
dictions. Each is consistent with the mismeasurement 
of preferences in the Snyder-Groseclose framework 
under the hypothesis of purely spatial voting. In only 
one case is the prediction also consistent with their 
theoretical model. Therefore, verification of these re- 
lationships illustrates the inability of Snyder-Grose- 
close to distinguish party pressure from mismeasure- 
ment of preferences. The three methodological 
predictions are as follows. 


PREDICTION 1. Estimate the rank order of ideal points by 
one-dimensional optimal classification first using all 
roll call votes and then using only lopsided votes. The 
correlation between the two rank orders will be greater 
for extremists (the first and last one-third of the all votes 
distribution) than for moderates (the middle one- 
third). This prediction is consistent with the Snyder- 
Groseclose assertion that party pressure primarily af- 
Jeens Tr nates, Oui diso (HOW fort ory clani hat 


sided votes are used to estimate ideal points. 
PREDICTION 2, Similarly, when the rank order is estimated 
first on all roll call votes and second on only close 
votes, the correlation between the two rankings will be 
greater for moderates than for extremists. The motiva- 
tion for this prediction is similar to that of the first. If 
there is preference-based voting, the ideal points of 
extremists will be inaccurately recovered if only close 
votes are used to estimate ideal points. This prediction 
is inconsistent with the Snyder and Groseclose model, 
which implicitly assumes that extremists will have 
°? Althongh the undertying assumptions are very different, in ons 
dimension this method is essentially equivalent to classical Guttman 
scaling. 


preference estimates on pressured votes similar to those 
on unpressured votes. 


Prepicrion 3. The correlation between the two rank 
orders for moderates will be higher if there is prefer- 
ence-based voting, lower if there is party discipline. 
The reason is that, if there is discipline only on close 
votes, as claimed by Snyder and Groseclose, the all- 
votes estimates will mix preference-based lopsided votes 
and disciplined close votes. The estimates of close votes 
will have more distortion of the true ideal points. 


We now turn to testing hypotheses from the party 
discipline model. In all cases, the null hypothesis of 
preference-based voting predicts no difference. 


HyroTtuesis 1. Classification should be substantially 
i with a two-point model than with a one-point 
model. Note that classification cannot be lower with 
the two-point model. 

Hypotuesis 2. The improvements in classification should 
be greater on close votes. Since the Snyder-Groseclose 
votes, the incremental predictive power of the two- 
cutpoint model should be higher on those votes. 


HyrorHesis 3. The rank order of the legislators should 
disclose more separation of the parties in the one- than 
vin the two-cutpoint model because the former ignores 

party discipline. Moving Democrats to the left and 
Tenues to he adhe Todd AK ud some of the 
effects of party pressure. In contrast, in the two-cutpoint 
model, each legislator’s ideal point can take on its true 
rank order position, because the cutpoints can pick up 
the effects of party discipline. 

Hyporuesis 4. The separation of the cutpoints should be 
greater on close votes. The identifying assumption of 
the Snyder-Groseclose model is that party pressures are 
more likely on close votes. Therefore, under their 
assumptions, the distance between the Democratic and 
the Republican cutpoint should be greatest on those 
votes. 


HypoTuesis 5. The estimated cutpoint for the Democrats 
should be to the right of the estimated cutpoint for the 
Republicans. 

It should be noted that some instances of party 
pressure may be masked. Consider a legislature with no 
party overlap—all Democrats are to the left of all 
Republicans. Suppose that, were there no pressure, a 
Republican bill would be rejected by a majority com- 
posed of all Democrats and some moderate Republi- 
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FIGURE 4. Cutpoint Models 
Legislator 1 2 3 4 5 6 7 8/9 10 11112 13 14 15 16 17 
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can defectors. If the Republicans then apply pressure 
to the defectors, the resulting party-line vote will 
appear to be consistent with preference-based voting. 
When ideal points are estimated correctly, the true 
explanatory power of party may be masked. Indeed, 
when there is no overlap in the distribution of party 
ideal points and there is errorless spatial voting, it is 
impossible to identify party pressure effects. This mask- 
ing is inherent to spatial analysis. It would confound 
the Snyder-Groseclose model as well as our optimal 
classification method. Albeit important, the question 
we can ask is limited to whether allowing for party 
discipline can improve on the son of a purely 
preference-based model.1° 

With our optimal classification metód, it is possible 
to calculate an upper bound for the amount by which 
party pressure can increase vote classification. This 
bound depends on the overlap between the two parties. 
This upper bound represents the classification on a 
strict party vote of a two-cutpoint model (perfect 
classification) minus the classification of a strict party 
vote using a single cutpoint.!1 When there is no overlap 
between the parties, a single cutpoint correctly classi- 
fies a strict party vote, so there can be no classification 
gain for the two-cutpoint model. The greater the ‘party 
overlap, the worse a one-cutpoint model does in ex- 
plaining a strict party vote. Thus, the maximum classi- 
fication gain increases in the overlap. 

If we use the configurations of preferences that 
emerge from optimal one-cutpoint classification to 
measure overlap, the maximum classification gain from 
cutpoints consistent with party pressure (i.e., D > R) 


10 This focus is consistent with a key pomt of Krehbiel (1998). He 
argues that the main empirical question should not be whether 
parties influence legislative behavior but whether partisan models 
Tepresent a substantial improvement over those that assume auton- 
omous legislators. 

11 Pressure beyond that necessary to generate a strict party vote 
cannot further increase classification. 
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ranges from 0 in the 80th House (where there is zero 
overlap) to 16% in the 92d House. The average upper 
bound over all the Congresses we analyze is 5%. It is 
important to remember that these upper bounds are 
simply for roll calls consistent with party pressure (i.e., 
Democratic cutpoint to the right of the Republican 
cutpoint). Perfect classification is the upper bound if 
we allow other cutpoint configurations (e.g., the Re- 
publican cutpoint on the right). Furthermore, as we 
discuss below, optimal classification with a single cut- 
point will underestimate party overlap, which would 
lead to the underestimation of these upper bounds. 


TESTS USING THE NONPARAMETRIC 
MODEL 


Classification with One Cutpoint 


We begin with the three predictions concerning the 
correlation of ideal points, which we tested with actual 
data. We first performed optimal one-cutpoint classi- 
fication using all roll call House votes from the 80th 
through the 105th Congress. If the basic spatial model 
is correct, this procedure should produce a rank order 
of legislator ideal points that is very close to the true 
order. Next, we did optimal classification using only 
lopsided votes, or margins greater than 65 to 35. 
Finally, we did optimal classification using only close 
votes, or margins of 65 to 35 or less. 

We then computed Spearman correlations between 
the lopsided and the all-votes rank orders for leftwing- 
ers, the one-third of legislators farthest to the Left in 
the all-votes classification; for moderates, the middle 
one-third; and for rightwingers, the one-third farthest 
to the Right. We expect these correlations to be high 
for the extremes but low for the moderates because the 
lopsided votes provide little information about the 
ideal points of moderates (prediction 1). Conversely, in 
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TABLE 1. Average Spearman Rank-Order Correlations of Legislator Ideal Points from Optimal 
Classification Analyses 


Estimates for Lopsided Votes versus All Votes 
Houses Left Third Middle Third Right Third 
80th-90th 86 91 


Estimates for Close Votes versus All Votes 
Left Third Middle Third Right Third 
97 51 


(1947-68) (07) (30) (-05) (13) (03) (17) 


91st-105th 94 -T7 94 5A 97 80 
(1969-08) (01) (.07) (.07) (14) (.02) (17) 
80th-105th .90 63 93 56 97 56 
(1947-98) (.07) (28) (.08) (14) (.03) (17 


Mote: in each House, each 1/3 represents an N of at least 145 Actua! N's are typically saghtly larger because some cestncts heve two distinct 





serving In a grven House as a result of deaths, replacements, eto. The averages are then computed as unweighted averages across the 
in perentheees. 


representatives 
Inchoated set of Houses. Standard deviations are 


the case of close votes and all votes, we expect the 
correlations to be high for moderates but low for the 
extremes. !2 

The hypothesized patterns occur, as shown in Table 
1. Indeed, for the lopsided-all comparison, in every 
postwar House but one, the correlation for moderates 
is lower than for leftwingers and rightwingers.“ Table 
1 indicates that the middle correlation is particularly 
low in the years preceding passage of the major civil 
rights bills of the 1960s. In this period, an important 
second dimension (Poole and Rosenthal 1997) con- 
founds the recovery of moderate positions on the first 
dimension. When the second dimension vanishes, even 
the middle correlations are reasonably high because 
the “errors” in voting provide some information about 
moderates. That is, for example, a relatively liberal 
moderate is still less likely to vote with the rightwingers 
than is a relatively conservative moderate, even on 
lopsided votes. Nonetheless, in accord with prediction 
2, correlations for moderates are lower than for ex- 
tremists.14 i 

As predicted, these results reverse for the close-all 
comparison. The moderates always produce a correla- 
tion above 0.9. The leftwinger and rightwinger corre- 
lations are always below 0.9, usually much below, and 
in one case the correlation is negative. 

The close-all correlations for moderates are strik- 
ingly high, predicted by preference-based voting but 
not by voting subject to party discipline (prediction 3). 
If the party discipline effect were important, we would 
expect lower rank order correlations, particularly for 
the House before 1980, when there was still consider- 
able overlap in the ideal point distributions of the two 
parties. 

12 We focus on the rank-order correlation results since they are most 
consistent with our optimal classification approach. When we con- 
ducted each of these experiments using standard correlations, we 
found little substantive difference. 

D A simple sgn test for the observation of 25 successes in 26 tals 
has a p-value < 107°. 

14 Since hypothesis 1 may be consistent with either party or prefer- 
ence voting, wo generated Monte Carlo data that imposed prefer- 
ence voting without party discipline. These results, listed in columns 
i andj of Table A1 in McCarty, Poole, and Rosenthal 2000, show that 


under pure preference voting the correlation between the true and 
estimated preferences is lower when only lopsided votes are used. 





Classification with Two Cutpolnts 


We now assess the ability of a party discipline model to 
improve on a preference-based model. Our criterion is 
percentage of votes correctly classified. 

To find the highest classification possible for a party 
discipline model, there is a simple solution: Classify 
each party separately. This allows the cutpoint on each 
roll call to adjust to pressures internal to the party. 
Because the cutpoints can adjust, the true intraparty 
rank order of the ideal points can be found. The 
classification from this model then can be compared to 
that of a single-cutpoint model. 

The outcome of this exercise appears in Figure 5, 
which shows results for a one-, two-, and six-dimen- 
sional model. The latter parallels the high dimension- 
ality used by Snyder and Groseclose in their empirical 
work. For multiple dimensions, the cutpoint is replaced 
with a separating hyperplane. 

In the case of one dimension, it is apparent that a 
two-party model adds little, particularly in recent Con- 
gresses. The improvement in the earlier Houses is at 
the level that results when a two-dimensional model 
with one cutpoint is used. The two cutpoints allow for 
southern Democrats to vote with northern Democrats 
on some issues as well as a coalition of conservative 
Republicans and southern Democrats in opposition to 
liberal Republicans and northern Democrats. Since the 
Democrats were the majority party in the conservative 
coalition era, these votes demonstrate a breakdown in 
party discipline that is exactly opposite the basic as- 
sumption of the Snyder-Groseclose model. 

In the case of two or six dimensions, when two (as 
against one) separating hyperplanes are allowed, there 
is even less improvement than in the one-dimensional 
case. The improvement is almost always less than 1% 
for all postwar Houses. The reason is that, with one 
dimension, “party” picks up some effects that can be 
accounted for just as well as by a multidimensional 
preference-based voting model. The strength of the 
results in Figure 5 is further emphasized by two 
observations. First, some of the increase in fit is simply 
noise fitting due to the extra degrees of freedom. 


Second, classifying each party separately allows for 
“both ends against the middle” voting, when liberal 


679 


The Hunt for Party Discipline in Congress September 2001 


FIGURE 5. Correct Classification in One, Two, and Six Dimensions 
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Democrats and conservative Republicans vote to- 
gether. This last problem and other considerations led 
us to adopt a slightly different approach. 

The remainder of the analysis in this section follows 
a two-stage procedure. First, using optimal classifica- 
tion, we estimate a one-dimensional spatial model that 
has a single cutpoint, common to both parties. Second, 
holding the rank order positions of the legislators 
constant at the positions produced by step 1, we then 
estimate separate cutpoints for the two parties. The 
two cutpoints must be placed to maintain polarity. That 
is, unlike the separate scalings reported in Figure 5, we 
did not consider improving classification by allowing 
moderates to be opposed by extremists at both ends of 
the spectrum. Bob Barr and Maxine Waters cannot 
vote together against Connie Morella. This constraint 
is fully consistent with the Snyder-Groseclose ap- 
proach, which calls for an order-preserving shift in a 
party’s ideal point distribution but not for a flip-flop. 

The motivation for this two-step approach is that it is 
not possible to estimate jointly a single order for the 
legislators and two cutpoints for each roll call. The 
reason is that the rank order of the legislators within 
each party is pinned down only by the cutpoints for that 
party. Therefore, it is impossible to rank order either 
the legislators of a party or the cutpoints for that party 
with those for the other party. In contrast, once we fix 
the rank order of the legislators, we can estimate 
separate cutpoints and test theoretical predictions 
about them. We cannot directly test hypothesis 3, 


680 


however, that preferences will show less party overlap 
in a one-point model than in a two-point model. That 
hypothesis could be tested only indirectly, by our test of 
prediction 3. - 

To justify holding the legislators constant, we com- 
puted within-party Spearman correlations between the 
rankings of the single-cutpoint model and the rankings 
when optimal classification is applied to the party 
separately. Recall that this separate classification is 
consistent with a party pressure model there is a true 
underlying order of ideal points, but cutpoints are 
adjusted to reflect party pressure. As Table 2 shows, 
these correlations are remarkably high. For both par- 
ties, they have exceeded 0.95 since the mid-1960s. 
(Before then, some correlations were lower due to an 
important second dimension, as noted earlier.) The 
single-cutpoint ratings, particularly for the past 30 
years, are likely to provide accurate rankings of the 
“true” ideal points within each party. 

Note that Table 2 informs us that the relative order 
of legislators within each party is insensitive to whether 
we just assume pure preference-based voting or explic- 
itly account for party pressure. The results do not rule 
out party pressure; rather, consistent with equation 1, 
party pressure is unlikely to change the relative order 
of induced ideal points. The results also do not rule out 
polarization due to party pressures, and the lack of 
overlap observed in the 1990s may be the outcome of 
party pressure. We will return to this point. 

The two-cutpoint model creates only minor gains in 
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TABLE 2. Correlations of Legislator Ideal 
Points from One- and Two-Cutpolnt Models 


Average Within-Party Rank Order Correlations 
Houses Democrats Republicans 


80th-90th .94 .93 
(1947-88) (.06) (.03) 


91st-105th .99 .98 
(1969-98) (.01) (.01) 


80th-105th .97 i .96 
(1847-98) (.05) (03) 


Note: The averages are unweighted averages across the indicated set 
of Houses Standard devatons are mn parentheses 


classification of all roll call votes, which is evident in 
Figure 6. As the second dimension has diminished in 
importance, these gains have declined to less than 
0.5% in the last eight Congresses. In other words, 
adding a second cutpoint typically allows correct clas- 
sification of only an additional 2 of the 435 represen- 
tatives (assuming full turnout). Note that (1) the 
classification must get better with a second cutpoimt, 
2) the second cutpoint can just fit noise in the data 
ee Poole and Rosenthal 1997, 156), and (3) much of 
the improvement in classification occurs from using 
two cutpoints that have the Democratic cutpoint coun- 
terhypothesis to (i.e., left of) the Republican cutpoint, 
as will be seen in Table 3. An improvement of less than 
1 percent is very minor. Hypothesis 1 is not sup- 
ported. 

Figure 7 shows the gains for the two-cutpoint model 
for close and lopsided roll calls. It contains a little good 
news for party pressure advocates. The classification 
gain is greater on close votes than on all roll calls, but 
only since the mid-1960s. The evidence for the earlier 
Congresses reinforces our contention that the larger 
improvements in classification for those years, shown in 
Figure 5, are the work of a second dimension. If party 
discipline were at work, then the gain should not occur 
on lopsided roll calls, and in later Congresses there is 
systematically a greater gain on close than on lopsided 
votes. Some of the gain on close votes, however, must 
result from nondiscipline factors—such as noise fit- 
ting—that affect lopsided as well as close votes. The 
increase in the gain on close versus lopsided votes is 
roughly 1 percent, which suggests that party pressures 
are changing only about four votes per roll call on the 
close votes. At best, hypothesis 2 is weakly supported. 

Another hypothesis derived from the party pressure 
model is also weakly supported. To test hypothesis 4, 
we computed for each House the average of the 


15 An improvement of 05% may well be statistically significant. In 
column (k) of table Al in McCarty, Poole, and Rosenthal 2000, 
which shows sumulations for preference-based voting, classification 
gains are shown for various one-dimensional specifications. In the 
first three (low overlap) rows, smilar to actual overlap m the past 
eight Congresses, the gains range from 0.10% to 0.26%, all consid- 
erably less than 0.5%. Of course, the gains from “fitting” an extra 
hyperplane m a multidimensional model would be expected to be 
even higher. In any event, an mprovement of 05% may lack 
substantive import. 
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difference between the rank of the Democratic cut- 
point and the rank of the Republican cutpoint and then 
divided by the number of legislators in the House. This 
procedure normalized the difference in the rank orders 
to a —1 to +1 scale, so that the Houses could be more 
easily compared. We used the difference rather than 
the magnitude of the difference between the ranks 
because the party pressure model predicts that the 
Democratic cutpoint will be greater than the Republi- 
can cutpoint (D > R). 

We classified all roll calls into three types. For the 
first type, in line with hypothesis 5, the Democratic 
cutpoint is greater than the Republican cutpoint (D > 
R). Note that whenever there is some overlap in the 
ideal point ranks of the two parties, straight party votes 
are counted as D > R. The second type is clearly 
counterhypothesis roll calls, that is, R > D. Finally, for 
many roll calls (see Table 3) the relative location of the 
two party cutpoints is ambiguous, and we term this 
third type “undecided.” Note that cutpoints interior to 
the legislators of a party can be identified for only a 
subset of roll calls.1° A portion of our analysis will be 
restricted to such roll calls.17 

When the ideal point distributions of the two parties 
have no overlap, as happened in the 80th House 
(1947-48), we cannot identify any roll calls as D > R, 
so the average difference must be less than zero. In 
contrast, when there is substantial party overlap, as in 
the 1970s, the party pressure model predicts that the 
average difference for close votes will be greater than 
zero and be greater than the average difference for 
lopsided roll calls. The average difference for lopsided 
roll calls should be near zero. The results, computed 
for all roll calls with interior cutpoints in both parties, 
appear in Figure 8. 

The average difference for the close votes is indeed 
above zero for 19 of the 26 Houses. Since the 91st 
House, however, the average difference has been very 
close to zero, hovering around .02, or an average 
difference of about 8 or 9 ranks. In only three Houses, 
all in the two-dimensional 1950s and 1960s, does the 
figure exceed 0.1, or 10% of the House membership. 
To benchmark this difference, note that the normalized 
difference or overlap between the third rightmost 
Democrat and the third leftmost Republican averages 
46% of the House membership for the 26 Houses we 
analyze; it exceeds 32% in all but the 80th, 84th, and 
100th to 105th Houses. Moreover, note that this aver- 
age difference is highly biased in favor of the party 
pressure model in that it does not include undecided 
roll calls, for example, votes on which the Republicans 
were unanimous but the Democratic cutpoint was to 
the left of the leftmost Republican. On such roll calls, 
most likely party discipline broke down among the 
Democrats, so that D < R. 

The average difference for the lopsided roll calls is 


16 A cutpoint is interior whenever at least one legislator 1s to the left 
of it and one legislator is to the nght. Otherwme, the cutpomt ss 
exterior. 

17 In Appendix B, we outline our procedure for determining roll call 
cutpoints, classifying roll calls into the three categories, and comput- 
ing the differences in ranks. 
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FIGURE 6. Classification Gain of Two Cutpolnts versus One Cutpoint for All Roll Call Votes 
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Note: The claseifoabon gams are for a one-dimensional voting model. Al representatrves were scaled together, as m the “common” soaimg of Figure 4. 

With the ideal pant orders from the “common” scaling heid fixed, a separate cutpoint wes then estimated for each party. Companson to Figure 4 shows 

thet the claserhoahon gams are similer to thoes In the “two-party” soalings where each party has an independent rank order of ideal points as wel as a 
cutpomt. 
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FIGURE 7. Classification Gain of Two Cutpoints versus One Cutpoint for Close and 
Lopsided Votes 
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FIQURE 8. Normalized Differences In Ranks between Party Cutpoints for Close and 
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. 437/437. The “average difference” ie the average of the differences between the normeakzed rank of the Demoorat cutpomt and the normalized 
Repubscan cutpomt. 





negative for all 26 Houses. The negative sign probably 
reflects “both ends against the middle” voting. If the six 
most liberal Democrats and the six most conservative 
Republicans cast protest votes on final passage and 
these are the only negative votes, with fixed polarity, 
one of the party cutpoints will be near an end of the 
dimension, and the other party cutpoint will be near 
the middle. Therefore, the difference in ranks will be 
negative and large in magnitude. The negative differ- 
ences can reflect a few conservative Republicans and a 
few liberal Democrats voting against a lopsided major- 
ity. 

Hypothesis 5, which predicts that the Democratic 
cutpoint will be to the right of the Republican cutpoint, 
is not supported, as shown in Table 3. The pattern, 
except for the no or low overlap Congresses (80th and 
103d-105th), is quite stable, so we present results in 


TABLE 3. Order of Cutpoints on Close 
Roll Calls 


House 
81st-102d 


D>R 
41.7% 


18.0% 
38.1% 


80th, 103d-105th 
All (80th—105th) 








tabular form. Recall that low overlap means there will 
be very few or no roll calls with D > R. Even when 
there is overlap, the pattern runs counter to the 
Snyder-Groseclose model; R > D roll calls outnumber 
the straight party D > R roll calls by more than three 
to two. 

Table 3 lends much less support than Figure 8 to the 
party pressure model because of the fact that many of 
the Houses scores for a handful of southern Democrats 
fell in the midst of the Republican scores, and those of 
a handful of liberal Republicans fell in the range of 
Democratic scores. Therefore, on strict party or near 
strict party votes D > R, and the difference in ranks 
was quite large. The difference was smaller in magni- 
tude on counterhypothesis R > D votes, but such votes 
are typically a majority of the roll calls.18 

Some of the counterhypothesis R > D votes almost 
certainly indicate a true breakdown in party discipline. 
A breakdown can occur, for example, when the major- 
ity is subject to a few defections by its own moderates 
but offers bills or makes promises that buy the support 
of moderates in the opposite party. The seduction of 
minority party moderates is a scenario that seems to fit 
the two Gingrich Houses, in which, in the single- 
cutpoint analysis, the modal cutpoint fell interior to the 
Democratic Party (see McCarty, Poole, and Rosenthal 


14 In addition, the ordinal comparisons involve some roll calls with 
exterior cutpoints. See Appendix B. 
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FIGURE 9. Ranks Shifted to Satisfy Average D = 
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1997, 12). (The two Gingrich Houses are the last two 
points in every plot.) f 

These results about cutpoints are, however, subject 
to the warning that the single-cutpoint estimation of 
ideal point ranks may show too much separation of the 
parties. We therefore calculated how far the ideal 
points of Republicans would have to shift leftward to 
reach the average difference for lopsided roll calls of 
zero. In that case, a new version of Figure 8 would have 
a flat line through zero for lopsided votes. That is, the 
shift would force the average pattern for lopsided votes 
to match the theoretical level in the Snyder-Groseclose 
model.19 

The results of this exercise are shown in Figure 9. 
When the lopsided vote difference is just slightly 
negative, as in the late 1960s, very few ranks need to be 
shifted. In these cases the close vote difference is near 
zero, and R > D roll calls outnumber D > R, so the 
Snyder-Groseclose model is not supported. When the 
lopsided vote difference is sharply negative (see Figure 
8), in the late 1940s and in the 1990s, many ranks have 
to be shifted to force the lopsided votes to show a zero 
average. In the most recent Congresses in our time 
series, the order of change is of 100 ranks, or about half 





1 We thank Tim Groseclose for suggesting the adjustment. The 
algorithm we developed to implement the suggestion is as follows. If 
the average difference m ranks for lopsided roll calls is nonnegative, 
then no shift is required. Otherwise, shift every Republican leftward 
by a number of ranks equal to the average difference in ranks. This 
procedure mmplicitly assumes that the ranks are interval measure- 
ments. By shifting the Republicans leftward, we are compressing the 
space. In the original estimates, the unnormalized space will extend 
from 1 to N, where N is the number of scaled legislators in the House. 
In the shifted estimates, the space will run from 1 to N — A, where 
A is the number of ranks shifted. 
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R Condition on Lopsided Roll Calls 




















the Democratic membership. A shift of 100 ranks, 
which would place the “true” ideal points of the most 
moderate Republicans in the middle of the Democratic 
Party, is seriously lacking in face validity. The amount 
of overlap in the ideal point distribution is simply too 
great to make a party pressure model credible. In fact, 
the amount of shifting needed matches the decrease in 
party polarization in the postwar period and its in- 
crease since the late 1960s (McCarty, Poole and 
Rosenthal 1997) as measured by NOMINATE scores. 
The increasing separation of the parties is, in our view, 
much more likely to reflect fundamental political 
changes, such as a large increase in southern Republi- 
car representatives, than an increase in party discipline 
within Congress. 

Because our initial ideal point distribution has 
greater face validity than the shifted distribution, we 
use the initial distribution to ask whether party disci- 
pline makes a difference in outcomes. We assess this 
in two ways. (1) We assume the true cutpoint is the 
minority cutpoint. Pressured voters are those majority 
party voters with ideal points between the minority and 
majority cutpoints. The benchmark is that all pressure 
is exerted by the majority party. Would the outcome 
have changed if their votes were reversed? (2) We 
assume that the true cutpoint is the average of the two 
party cutpoints, which reflects equal pressure exerted 
by both parties. Pressured voters are those with ideal 
points between their party cutpoint and the average. 
Would the outcome have changed if their votes were 
reversed? 

The results vary substantially from one Congress to 
the next, which in part is a function of the separation of 
party ideal points. We find that, averaged across Con- 
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TABLE 4. Rank Order Positions of Legislators In Thelr Old and New Party ` 


First Congress Last 
in New Party 


Normalized Rank 
New 


In Old Party 


104 
104 
104 
104 
104 


Note: The ordenngs were normeiized to 0-1 by driding the raw ranks by 472 for the Senate and 2,328 for the House. 


gresses, discipline makes a difference, under the first 
assumption, on 16.97% of close roll calls (std. dev. 
9.17) and, for the second assumption, on 11.07% (std. 
dev. 8.03%). These numbers are substantial, but they 
are well below the proportion of significant ft-statistics 
reported by Snyder and Groseclose. Moreover, they 
are almost certainly, overestimates. One qualification is 
that the first assumption is extreme, since only the 
majority party exerts pressure. Another is that some of 
the pressured voters might not have changed their 
votes even if pressure were removed. This is because 
under the null hypothesis of a single cutpoint, errors in 
voting will result in some legislators who are on the 
“yea” side of the cutpoint who vote “nay,” and vice 
versa. Similarly, under the alternative hypothesis of two 
cutpoints, there will be two types of legislators between 
the cutpoints—those who are pressured and those who 
vote with their party for idiosyncratic reasons. The first 
assumption mistakenly counts both types of legislators 
as pressured. 

This section, in summary, has established that allow- 
ing for party discipline does not make an important 
contribution to classification; and those improvements 
in classification that do occur are, more frequently than 
not, the result of using cutpoints that are inconsistent 
with the party pressure modeL 


IDEAL POINT CHANGES IN PARTY 
SWITCHERS 


There is little evidence that many ideal points are 
displaced on individual votes, but there is very substan- 
tial evidence that party affiliation has a strong influence 
on ideal points. To see this, we again used the proce- 
dure of Poole (2000) to obtain rank orders of the ideal 
points in separate estimations for the House and 





Senate. This time we pooled all roll calls from 1947 to 
1998.2 Each member was constrained to have a con- 
stant ideological position in his or her career, except 
that party switchers were allowed to have two positions, 
one before and one after the switch. There were 472 
senators and 2,326 representatives (counting the party 
switchers as two individuals). The orderings were nor- 
malized to 0-1 by dividing the raw ranks by 472 for the 
Senate and 2,326 for the House. 

When legislators switch from Republican to Demo- 
crat, they should have a lower rank. The reverse should 
hold for Democrat to Republican switchers. There 
were 19 legislators who both switched parties and 
remained in the same chamber in the period of our 
analysis. They are listed in Table 4. In 18 of 19 cases the 
rank changed as expected. The exception is Strom 
Thurmond, whose slightly more moderate position as a 
Republican reflects his more moderate views on race 
relations in the past 20 years. A simple sign test is 
overwhelmingly significant. Induced ideal points re- 
spond to party affiliation.?! 

We have shown that party switchers generally move 


D Tn an earlier version, we used the McCarty, Poole, and Rosenthal 
(1997) DW-NOMINATE procedure to obtam metric estimates’ of 
the magnitude of changes induced by party switching. The metric 
assumptions in the NOMINATE procedure lead to sensible results; 
for example, there is less distance between the median and the 9th 
decile in the Gingrich Houses than between the Ist decile and the 
median. We conducted the metric analysis in two dimensions. 
Switchers from R to D were expected to become more negative on 
the first dimension and more positive on the second, and vice versa 
for D to R switchers. All movement on both dimensions was in accord 
with the For more details, the earlier version can be 
accessed at http://voteview.ub.edu/d011 pdf. 

21 These results are consistent with Nokken (2000), who also finds 
significant changes in congressional behavior following a party 
switch. 
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in the theoretically expected direction. Do they move 
very much? The average rank movement was 0.28, or a 
jump over more than one-fourth of all legislators 
serving in the period. To benchmark this movement, 
we reran the analysis for the House allowing two 
positions not only for the party switchers but also for 
some legislators who never changed party. Specifically, 
we picked in the legislator file every 500th legislator 
among moderates (i.e., those with ideal points between 
—0.3 and +0.3) who served in at least two Houses. 
There were 15 such representatives, matching the 
number of actual switchers in the House. For each 
group of 15 we computed the average partisan switch. 
That is, for Democrats the switch was just the change in 
the coordinate, as Democratic switchers ‘are expected 
to increase their rank. For Republicans, we used the 
negative of the change. Actual switchers moved sub- 
stantially, a change in normalized rank of 0.281. On 
average, nonswitchers barely budged, moving only 
0.026 in rank. The (one-tail) t-statistic for the differ- 
ences in the means indicates a high level of statistical 
significance (p < .0001). 

This evidence is consistent with a party effect, but a 
couple of caveats are in order. First, because it is silent 
on the mechanism that generates this effect, the source 
may not be internal to the legislature. Switchers, after 
all, have to adapt to a new set of primary constituents 
and contributors as well as legislative leaders. Second, 
party switchers are obviously not a random sample of 
all legislators. In the 104th House, southern Democrats 
switched to the Republican Party for a reason: They 
wanted to reflect the increasingly conservative temper- 
ament of their district.2 Therefore, selection bias 
preciudes us from suggesting that the shift in ideal 
points is an unbiased estimate of party pressure. But 
even if the selection bias were severe, it is telling that 
changing party labels was deemed necessary to reflect 
changing district sentiment.24 Third, the estimates 
based on party switchers are almost certainly an up- 
wardly biased measure of the average amount of party 
discipline. Among those who do not switch there is 
probably more congruence between their personal/ 
constituency position and the party’s desires.25 In par- 


2 The representatives selected were Boland (D-MA), Johnson (D- 
CA), J. Melcher (D-MT), Button (R-NY), Fallon (D-MD), Traficant 
(D-OH), Matthews (D-FL), Morella (R-MD), Fountain (D-NC), 
Taft (R-OH), Lloyd (R-UT), Kasten (R-WI), Haley (D-FL), T 
Corcoran (R-IL), and Zion (R-IN). 

D Yet, studies show that constituency changes do not have much 
effect on the ideal points of legislators. See Poole and Romer 1993 
for House redistricting and Doberman 1997 for House members who 
moved to the Senate. 

H Levitt (1996) provides some indication of the relative effects of 
party versus constituency factors in determing the ideal points of 
switchers. For the Senate, Levitt models each senator’s ideal point 
(as proxied by ADA rating) as a weighted average of personal 
ideology, overall state characteristics, support group characteristics, 
and the “national party line.” Although all four of these factors may 
change for switchers, the main changes are likely to be in the new 
national party line and in the new support group that relevant to 
campaign funding and primaries. Levitt’s results put about equal 
weight on these two factors. Therefore, about half the change in the 
ideal point would reflect forces internal to Congress. 

3 We thank Larry Bartels for this observation. 
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ticular, representatives close to the party median are 
likely to vote “correctly” without any discipline. 


CONCLUSION 


In the past decade, theorizing about the influence of 
parties and leaders on legislative behavior has out- 
stripped progress in solving difficult methodological 
and measurement problems necessary to test these 
theories. We have addressed the problems associated 
with distinguishing party effects from a null hypothesis 
of individual preference-driven behavior. We began by 
demonstrating the unattractiveness of regression-based 
procedures, such as that of Snyder and Groseclose. We 
find that these methods of estimating the effects of 
party discipline on individual roll call votes are biased 
toward exaggerating the party influence. To remedy 
these statistical problems, we incorporated the theoret- 
ical insight of Snyder and Groseclose into the spatial 
model of voting, which we estimated nonparametri- 
cally. We found that empirically a party discipline 
approach makes, at best, a marginal improvement over 
the standard spatial model. 

We do not conclude, however, that party is irrele- 
vant. Voting behavior changes fairly dramatically when 
members change parties. Party discipline, we conclude, 
is manifest ïn the location of the legislator’s ideal point 
in. the standard spatial model. It is not a strategic 
variable manipulated by party whips from one roll call 
to another but a part of a legislator’s overall environ- 
ment that forms her mduced preferences. The “disci- 
pline” that leads a legislator to choose a spatial loca- 
tion may result as much from external pressures of 
campaign donors and primary voters as from the 
internal pressures of the congressional party. 

Thus, the evidence we present does not suggest that 
a resurgence of party or party-induced institutional 
change is responsible for the greater cohesiveness of 
parties and polarized politics in Congress. Distinct 
cutting lines (or separating hyperplanes) for Demo- 
crats and Republicans do not add substantially to the 
classification success of the spatial model in the period 
after World War Il. Indeed, the incremental classifica- 
tion success of the second cutting line has fallen 
throughout this period, both during the period of 
declining polarization (1947 to about 1975) and during 
its more recent resurgence.*6 


APPENDIX A: SHIFTS IN IDEAL POINTS 


Let z, and z,, be the “yea” and “nay” outcomes of roll call j. 
In both the Heckman-Snyder and NOMINATE methods for 
estimating the spatial model, the nonrandom portion of the 
utility a legislator i has for roll call outcome z, E {z,,,z nyt 


can be expressed as: 
Ug = fdg), (A-1) 


where f is a negative monotonic function, and d,, denotes the 
Euclidean distance from x, r's ideal point, toz. 


25 On polarization in Congress, see Fiorina 1999; King 1998; Lowry 
and Shipan 2000; McCarty, Poole, and Rosenthal 1997; and Poole 
and Rosenthal 1984, 1997, 1999. 
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Now let the “party-pressured” ideological coordinates for 
Democrats equal x, + y, We obtain: 


di. = (, + YT 4y- (A-2) 


But this expression for distance is identical to the expres- 
gion we would have if the ideal point were unchanged but the 
“yea” roll call outcome were changed to z,, — y,. The 
distance to z,, would also be unaffected if it were also 
changed to z,, — y, Shifting both roll call outcomes by y, 
also shifts the midpoint (z,, + z,,)/2 by y,. So, for example, 
a leftward shift in the ideal points for all Democrats is 
equivalent to a rightward shift in the outcome locations and 
midpoint for Democrats. The argument extends readily to 
multidimensional shifts. Since for every ideal point shift there 
is an equivalent outcome shift, neither Heckman-Snyder nor 
NOMINATE can discriminate between a model in which a 
party alters ideal points on a roll call and one in which each 
party has its own midpoint or separating hyperplane on each 
roll call. 

Now consider the more general situation in which the 
amount of pressure is not equal for all members but the 
pressured ideal points maintain the same order as the 
original members, and the magnitude of the pressure, for 
Democrats, is increasing in spatial position. Moderates are 
pressured more than liberals. Since the pressure is not 
uniform, the shift in ideal points can no longer be captured by 
a simple shift in outcome locations. Nonetheless, in the map 
from the pressured ideal points back to the original ideal 
points, there will continue to be a point at which a party 
member is indifferent between voting “yea” and “nay.” Let 
this point be the pressured midpoint for the party on the roll 
call. Optimal classification should be reasonably robust in 
identifying the pressured midpoint as long as the form of 
pressure does not depart too strongly from uniform pressure. 


APPENDIX B: PROCEDURE FOR 
COMPUTING THE DIFFERENCE IN 
CUTPOINT RANKS 


We used the following to determine roll call 

cutpoints, classify roll calls into the three categories, and 

compute the differences in ranks. 

1. Optimally classify all legislators using a single cutpoint. 
Rank order the legislators from 1 to N, starting at the left. 

2. Estimate the two-cutpoint model for roll calls using the 
rank order of legislators from step 1. (Note that the 
estimation must “maintain polarity”: Classification is op- 
timal subject to making the same prediction for Demo- 
crats and Republicans to the left of their party’s cutpoint.) 

3. Every interior Democratic cutpoint must be between two 
Democratic legislators. Let their ranks be i andj. The rank 
of the roll call cutpoint is then given as cp = (i + j)/2. 
When the cutpoint is to the right of the rightmost Dem- 
ocrat, denote the cutpoint by cp = dg = rank of rightmost 
Democrat. When the cutpoint is to the left of the leftmost 
Democrat, denote the cutpoint by 1. The Republicans are 
treated similarly; when the cutpoint is to the left of the 
leftmost Republican, denote the cutpoint by cg = r, = 
rank of leftmost Republican, and to the right of the 
rightmost Republican, denote the cutpoint by N. 

4. Score the roll call as follows. 
a. fcp = landcg >r, orifl<cp <dpandcg >ry, 

and cp < Cp, score the roll call D < R. 
b. If 1 <cp and cr < N and cp > Cp, score the roll call 
D>R. 

c. Otherwise, the roll call is “undecided.” 
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5. For roll calls with interior cutpoints in both parties, the 
difference in ranks isc, — Cx. Roll calls with one or more 
party cutpoints exterior are excluded from the difference 
in ranks computations (Figure 8). (Thus, more roll calls 
are included in the ordinal comparisons under step 4.) 
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Estimating Party Influence on Roll Call Voting: Regression Coefficients 


versus Classification Success 


JAMES M. SNYDER, JR. Massachusetts Institute of Technology 
TIM GROSECLOSE Hoover Institution and Stanford University 


moderates often vote with the extremists; and (2) 


mness in voting; as a consequence, on a lopsided vote 
our data set includes some roll calls that require a 


supermajority for passage; for these we define a 50%-50% roll call as lopsided. We also show that the 
classification-success method of McCarty, Poole, and Rosenthal severely understates the presence of party 
influence. Furthermore, we show that a proper interpretation of some of their results reveals a significant 


amount of party influence in Congress. 


McCarty, Poole, and Rosenthal (2001) do three 

things. First, they critique the estimation strategy 
proposed in Snyder and Groseclose (2000), arguing 
that it overstates the actual amount of party influence 
on roll call voting. Second, they propose a different 
approach for estimating party influence, which com- 
pares the classification success achieved using a one- 
cutline model and a two-cutline model that allows 
parties to have distinct cutlines on each roll call. Third, 
they study congressional party switchers and show that 
they quickly change their roll call voting behavior, 
moving sharply away from the center of their old party 
and toward the center of their new party. This is 
interpreted as evidence that “party discipline... is 
manifest in the location of the legislator’s ideal point” 
(McCarty, Poole, and Rosenthal 2001, 686). 

In this response, we do four things. First, we counter 
the main criticism of our estimation strategy, showing 
that it does not measure poorly the ideal points of 
moderates. Second, we discuss an independent mea- 
sure of ideal points, which is not derived from roll calls, 
that produces results remarkably similar to those ob- 
tained when we estimate ideal points using lopsided 
roll calls. Third, we show that the basic conclusions in 
our earlier article are not driven by functional form. 
Specifically, we obtain similar results whether we use a 
linear model (as in our earlier article) or Poole and 
Rosenthal’s W-NOMINATE model. Finally, we show 
that the method proposed by McCarty, Poole, and 
Rosenthal is severely biased toward not finding any 
party influence. It will often fail to detect such influ- 
ence even when it is quite strong. In fact, some simple 
calculations suggest that there is no contradiction 
between the findings reported by McCarty, Poole, and 
Rosenthal and our earlier article. 


I “The Hunt for Party Discipline in Congress,” 


James M. Snyder, Jr., 1s Professor of Political Science and Econom- 
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LOPSIDED ROLL CALLS PROVIDE AN 
ACCURATE MEASURE OF IDEAL POINTS 


We will first respond to McCarty, Poole, and 
Rosenthal’s key example, which describes conditions 
that cause our method to yield poor ideal point esti- 
mates for moderate legislators. We will show that these 
conditions are not empirically relevant. Next, we will 
present independent evidence that supports our use of 
lopsided roll calls to estimate ideal points. We then will 
show that some of the puzzling findings reported by 
McCarty, Poole, and Rosenthal are actually consistent 
with the existence of party influence. 


Congressional Roll Call Voting Is 
Probabllistic 


McCarty, Poole, and Rosenthal (2001, Figure 1) 
present a simple example to show how the lopsided roll 
call estimation strategy we propose cannot distinguish 
among moderate legislators.! Because their example 
assumes errorless voting, there cannot be any roll calls 
on which, say, legislator 3 votes on the left side of the 
issue but legislator 1 votes on the right side. In practice, 
however, such cases occur frequently. To demonstrate, 
we gathered data for the 90th-105th Congresses (1967- 
98) and constructed a list of moderate House members 
based on Heckman and Snyder’s (1997) linear factor 
model.? Specifically, we rank representatives by their 
first-dimension scores and call them “moderate” if they 
fall between the 45th and 55th percentiles.3 

We then examined all roll calls on which 68-72% of 


1 See Snyder 1992 for an earlier study of a closely related phenom- 
enon. 

2 Using Poole and Rosenthal’s W-NOMINATE scores to rank 
representatives produces similar results. 

>f McCarty, Poole, and Rosenthal are correct, then ideal pomt 
estimates should be based on all roll calls (not fust lopsided roll 
calls), so the ranks shown on the horizontal ams are based on all roll 
calls. Of course, if McCarty, Poole, and Rosenthal are correct and 
voting i errorless, then scores from metric scalmg techniques— 
meludmg the linear model and NOMINATE—are inaccurate, since 
these methods are based on probabilistic voting models. We would 
prefer to use the ranks produced by McCarty, Poole, and Rosenthal's 
one-cutline method, but do not have them. In any case, we strongly 
doubt it would affect the results, given the high correlations typically 
found among various first-dimension scalings. 
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the House voted on the winning side. On the one hand, 
if the House fits McCarty, Poole, and Rosenthal’s 
example—a one-dimensional spatial model with error- 
less voting—then a moderate member will never vote 
on the losing side. On the other hand, if the House fits 
a world in which all legislators vote completely ran- 
domly, then each moderate will vote on the losing side 
about 30% of the time. Our findings show that the 
actual world of the House is halfway between these two 
hypothetical worlds. In the Congresses sampled, mod- 
erates voted on the losing side about 16% of the time.* 
Thus, unlike the McCarty, Poole, and Rosenthal exam- 
ple, moderates frequently vote on the losing side of an 
issue (about half as often as extremists do). Therefore, 
our strategy of estimating ideal points only from lop- 
sided votes can make distinctions among moderate 
legislators. 

Neat, if the McCarty, Poole, and Rosenthal example 
is empirically accurate, and congressional roll call 
voting is not probabilistic, then using only lopsided roll 
calls will produce the same ideal point estimate for all 
legislators with an ideal point between the 35th and 
65th percentiles. Thus, if we order legislators according 
to their ideal point rank on the horizontal axis and plot 
the estimated ideal points based on lopsided roll calls 
on the vertical axis, the resulting graph will appear as in 
Figure 1A.5 The actual pattern is quite different, how- 
ever. For instance, in Figure 1B we plot for the 103d 
House (1993-94) the rank of representatives’ ideal 
points, on the horizontal axis, and the ideal point 
estimates based on lopsided roll calls, on the vertical 
axis.6 The relationship is nothing like the inverted 
S-shape that appears in Figure 1A. Most important, the 
slope of the figure in the moderate range of ideal 
points is far from horizontal. Thus, it is very different 
from the prediction of the McCarty, Poole, and 
Rosenthal example. In fact, the slope is much more 
similar to the slope that occurs in the two extremist 
ranges of the figure than it is to being horizontal. This 
is exactly the result that one should expect if our 
method estimates moderate and extremist ideal points 
with approximate equal accuracy. The patterns are 
similar in other Congresses.’ 

These results show that congressional roll call voting 
is best described by a probabilistic rather than an 


4 The percentages range from 8.4% m the 100th Congress to 22.6% 
in the 95th Congress. On average, there were 58 roll calls with 
68-72% of the House on the majonty nde. 

3 This assumes the number of roll calls with cutlines to the left of the 
35th percentile iz approximately equal to the number with cutlines to 
the right of the 65th percentile. 

6 Again, the ranks are based on the Heckman and Snyder (1997) 
linear factor scores computed using all roll calls 

7 Another implication of the McCarty, Poole, and Rosenthal exam- 
ple m that cur method should (incorrectly) find similar amounts of 
party influence across all first-domension issues. In fact, m our earlier 
article we found that party influence is frequent on items such as 
budget bills, tax bills, and rules on bills, but almost nonexistent on 
issues such as abortion, school prayer, and gun control. (Examination 
of classification success and cutime angles shows that these are all 
basically first-dimension issues ) This heterogeneity strongly suggests 
that forces other than preferences influence representatives’ roll call 
voting on at least some issues, and party is probably one of these 
forces. 


690 


September 2001 


errorless voting model. They also show that when 
voting is probabilistic, moderates do not all voie alike, 
even when the cutline is far from the center of distri- 
bution. In fact, the law of large numbers implies that if 
voting is probabilistic, and if there are “enough” roll 
calls, then all legislators’ ideal points can be accurately 
estimated, regardless of the distribution of cutlines. 
How many is enough? In our earlier article we ran a 
number of simulations and found that 200 or even 100 
lopsided roll cails is usually sufficient to obtain accurate 
estimates of both moderates and extremists (Snyder 
and Groseclose 2000, Appendix A).8 

In fact, in previous work, Poole and Rosenthal argue 
forcefully that congressional roll call voting looks prob- 
abilistic. They are also highly critical of empirical 
applications of deterministic voting models. For exam- 
ple: “Perfect (errorless) spatial voting is not the appro- 
priate benchmark with which to compare scaling 
results. ... In a nutshell, while roll call voting is nicely 
captured by a spatial model, there is significant non- 
spatial, random-looking behavior” (Rosenthal 1992, 
31-2). Similarly: “The null model of errorless voting in 
seven or fewer dimensions is not even remotely sup- 
ported by the data” (Poole and Rosenthal 1991, 956). 
More recently: “Even with two dimensions, however, 
about 15 percent of the individual votes fail to fit a 
simple structure....A probabilistic model accounts 
for this structure” (Poole and Rosenthal 1997, 20). We 


agree. 

Finally, the McCarty, Poole, and Rosenthal! example 
ignores another factor that allows our method to 
distinguish among moderates on lopsided roll calls. 
This is that some roll calls in our sample, such as a 
suspension of the rules of the House and veto over- 
rides, require a supermajority. On these we define a 
lopsided roll call as one where the winning side re- 
ceived less than 51.7% or more than 81.7% of the vote. 
Thus, for these roll calls a 50%—50% division qualifies 
as lopsided. Consequently, in most Congresses there 
are actually several lopsided roll calls that have cutlines 
within the group of moderate ideal points. 


An Independent Preference Measure: The 
NPAT Survey 


Ansolabehere, Snyder, and Stewart (n.d.) use survey 
data from the National Political Awareness Test 
(NPAT) to measure the ideal points of House mem- 
bers and test for the independent influence of party on 
roll call voting. They study the 103d—105th Congresses 
and find a large amount of party influence. More 
important, they find that the results obtained using 
NPAT-based estimates are very similar to the results 
obtained using lopsided roll calls.» 





t As Snyder (1992) pomts out, one problem with interest group 
ratings is that they are usually based on only 10-25 roll calls. With so 
few, the distribution of scores may be quite sensitive to the distribu- 
tion of cutlines. 

9 The NPAT survey is admmistered by Project Vote Smart, which 
can be found at hitp:/Avww.vote-smarLorg. Seo Erikson and Wright 
(1997) and Ansolabehere, Snyder, and Stewart (n.d ) for details 
about the survey. 
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FIQURE 1. McCarty, Poole, and Rosenthal’s Example 
A: How Ideal Points and Ranks Should Appear if the Example Is Accurate 


Xiop 


L ideal Point 





435 Rank 


B: How Ideal Points and Ranks Actually Appear 


108rd Congress 





162 


Rank of Rol Call Score 





Here, we present some other comparisons between 
the ideal point estimates based on NPAT and lopsided 
roll calls. As in our earlier article, we use Heckman and 
Snyder’s (1997) linear factor model to estimate repre- 
sentatives’ ideal points. We applied this model to five 
different sets of data: the 1996 and 1998 NPAT surveys 
and the roll call voting records of the 103d, 104th, and 


105th Congress. We also conducted a parallel analysis 
using Poole and Rosenthal’s W-NOMINATE model in 
place of the linear model. 

We focus on the first dimension. Let X4,, be the 
ideal point estimates based on all roll calls, let X_op be 
the estimates based on lopsided roll calls and let Xypar 
be the estimates based on the NPAT responses. We 
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match the ideal points derived from the 1996 NPAT 
with the roll calls for the 103d and 104th Congress, and 
those derived from the 1998 NPAT with the roll calls 
for the 105th Congress.1° 

Figure 2 shows plots of X4,, versus Xyp47 and X; op 
versus Xyp47 for the 103d and 105th Congress (the 
plots for the 104th Congress are similar). The relation- 
ship between Xypar and X,,, is strongly positive, but 
it exhibits a large “jump,” which is clearly related to 
party affiliation. In contrast, the relationship between 
X Lop and Xyprar is not only strong and positive but also 
quite smooth—nearly linear, in fact. Clearly, if XNPAT 
represents an accurate estimate of members’ first- 
dimension ideal points, then X; op does as well. 

Table 1 presents correlations among the variables 
Xari Xpop, and Xyp47. The correlations are high, 
which is not surprising in view of Figure 2. More 
important, in each Congress the correlation between 
Xypar and Xz op is noticeably higher than the correla- 
tion between Xyp47 and X4,,. This is true even for 
“moderate” legislators.11 It is also true whether the 
estimates of X417, XLop, and Xyp47 are obtained using 
the linear factor model or W-NOMINATE (compare 
the top and bottom halves of the table). 

The conclusion is simple: The ideal point estimates 





10 The ideal pomts estimated using NPAT are quite stable over tme. 
For the 135 representatives who completed both the 1996 and 1998 
NPAT, the correlation between the two scores is more than .97. 

1 Moderates are those whose Xyp.7 score lies in the middle 50% 
obtained by elummating the upper and lower quartiles. 
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FIQURE 2. Ideal Point Estimates using NPAT, All Roll Calls, and Lopsided Roll Calis 
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obtained using lopsided roll calls are very similar to 
those obtained using the NPAT survey. In fact, the 
estimates derived from the NPAT survey are more 
similar to those based on lopsided roll calls than they 





TABLE 1. Comparison of Ideal Point 
Estimates, Linear Model and W-NOMINATE 
Corr. Cor. 

between between 

Xnear ANd Xpnear and 





Linear Model, All Reps. 


103d Congress (N = 132) 92 96 

104th Congress (N = 170) 92 94 

105th Congress (N = 176) 92 .94 
Linear Model, Moderates 

103d Congress (N = 65) -85 -90 

104th Congress (N = 84) 82 85 

105th Congress (N = 87) 84 .86 
W-NOMINATE, Ali Reps. 

103d Congress (N = 132) .91 .95 

104th Congress (N = 170) .90 .96 

105th Congress (N = 176) .90 .95 
W-NOMINATE, Moderates - 

103d Congress (N = 65): .80 .92 

104th Congress (N = 82) .81 .90 

106th Congress (N = 86) .80 .87 
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TABLE 2 Comparison of Party Influence Estimates, Linear Model and W-NOMINATE 









are to those based on close votes. Moreover, this is true 
for moderate representatives as well as extremists. 

These results are strongly contrary to what we would 
expect if there were no party influence on any roll calls. 
In a world with no party influence, we would expect 
Xuzz to be a better estimate of members’ true ideal 
points than Xz op, if only because it is based on a much 
larger sample. So, assuming that the NPAT responses 
yield a good measure of ideal points, the results in 
Table 2 and Figure 2 strongly suggest the presence of 
some party influence (or some other influence that is 
correlated with party). Something is “distorting” mem- 
bers’ choices on roll calls that are not lopsided. 

Finally, it is important to note that the distribution of 
implied “cutlines” on the NPAT questions is relatively 
uniform. Thus, it is not true that there is a high 
correlation between Xyp,r and Xzop simply because 
NPAT has few questions with cutlines near the ideal 
points of moderates. NPAT includes many items on 
which respondents split nearly evenly in their answers. 
For example, in 1996 nearly one-third of the items (88 
out of 268) were “close” questions, to which 35-65% of 
the representatives answered “yes.” 13 


12 Jt is possible that many respondents tailored their NPAT re- 
sponses in order to make themselves appear more moderate than 
they really are. We find it remarkable, however, that the first- 
dimension scores recovered from the NPAT responses are so similar 
to those recovered from the lopsided roll calls. It seems that a more 
likely explanation for the close correspondence is that both are fairly 
good measures of representatives’ actual (revealed) ideal pomts. 
13 This ignores extremely lopsided questions to which more than 95% 
answered “yes” or “no” 


Dependent Variable = X, 
103d Congress 104th Congress 105th Congress 
Linear Model 
XnpaT At — — .38 — — 42 — — 
(02) (.02) (02) 
Xip — .51 7 — .45 .46 — 7 44 
(.02) (.01) (.02) (.01) (.02) (.01) 
Republican 1.24 1.03 1.13 1.33 1.16 1.16 1.23 1.10 1.17 
(04) (.04) (03) (03) (04) (02) (.04) (.05) (03) 
Constant .62 51 .87 54 47 54 57 51 56 
(,03) (02) (01) (.02) (02) (01) (.02) (.02) (01) 
N 132 132 427 170 170 426 176 176 425 
FP 98 98 98 98 99 99 98 98 98 
W-NOMINATE 
Xneat 


MANY FINDINGS OF McCARTY, POOLE, 
AND ROSENTHAL ARE CONSISTENT WITH 
PARTY INFLUENCE 


McCarty, Poole, and Rosenthal claim that their Figure 
8 implies very little party influence on close votes. For 
instance, they note that since the 91st House (1969-70) 
the average difference between Republican and Dem- 
ocratic cutlines on close roll calls has been about .02, or 
a distance of only about 8 or 9 out of 435 ideal points. 
But their results for lopsided roll calls tell a very 
different story: The average distance between the 
Democratic and Republican cutlines is —.15. This 
translates into a distance of about 65 ideal points of 
House members. Most interesting is the sign of this 
measure, which indicates negative party influence. That 
is, their findings for lopsided roll calls are consistent 
with, for instance, a Democratic leader who pressures 
the rank-and-file Democrats to vote for the conserva- 
tive alternative. Also interesting is the fact that the 
party influence that McCarty, Poole, and Rosenthal 
find on lopsided roll calls is many times the effect they 
find on close roll calls. 

These findings are puzzling. In two important re- 





14 Alternatively, as McCarty, Poole, and Rosenthal note, this result 
could occur if Democratic leaders pressure the Republican rank-and- 
file to vote m the liberal direction. Yet, for the effect to cause a 


Republicans 
more strongly than 
matter what the reason for the negative sign, it mdicates that the 
rank-and-file is pressured to vote against the direction its party 
prefers. 
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spects they contradict the way it seems a rational party 
leader would act. First, they suggest that party leaders 
concentrate their resources not on close roll calls but 
on lopsided roll calls. Second, they suggest that on 
lopsided roll calls leaders pressure legislators to vote 
against their party, not with it. 

The most likely explanation for these puzzling find- 
ings is not that party leaders are acting irrationally but 
that McCarty, Poole, and Rosenthal use an inappro- 
priate baseline in their is. They note that a 
two-cutline method cannot identify the overlap in ideal 
points of the two parties. Consequently, to measure 
this overlap, McCarty, Poole, and Rosenthal use a 
one-cutline model in the first stage of their method. 
This forms the baseline used in the second stage to 
measure party influence. They implicitly assume that 
the baseline reflects no party influence, but they use all 
roll call votes to form this baseline. If parties exert 
some influence on some of these roll calls, then the 
baseline reflects ideal points that are shifted from their 
true values by an amount that corresponds to the 
average degree of party influence. Thus, if party infiu- 
ence exists, then the baseline is tainted with that 
influence and cannot be used to measure it. Using this 
baseline to assess party influence on a subset of roll 
calls will understate the true degree of influence when 
it exists, and there will be a spurious finding of negative 
influence when it does not exist. 

We believe this is exactly what happens in McCarty, 
Poole, and Rosenthal’s Figure 8. Since the true amount 
of party influence on lopsided roll calls is zero, or at 
least below the average (baseline) amount, the method 
spuriously shows negative influence. Next, although the 
method shows positive influence on close roll calls, it 
shows not the true degree but the difference between 
the true degree and the average (baseline) degree. 

It is likely that McCarty, Poole, and Rosenthal’s 
Figure 9 does reflect a better baseline and, therefore, 
approximately captures the true extent of party influ- 
ence in Congress. In that figure McCarty, Poole, and 
Rosenthal have shifted ideal points so that on lopsided 
roll calls the two parties have identical cutlines. This 
effectively sets the baseline as the ideal points that one 
would estimate from a sample of lopsided roll calls. 
Figure 9, like our results, indicates a large amount of 
party influence in Congress. For the average Congress 
it indicates that party influence shifts ideal points by 
about 50 ranks. If no overlap exists between the parties, 
then this is about half the effect of pulling the median 
member of the House to the ideal point of the median 
of the majority caucus. If there is significant overlap, 
and the parties are distributed fairly uniformly within 
the overlap region, then 50 ranks translates into about 
one-quarter the effect of pulling the median of the 
House to the median of the majority party. 

McCarty, Poole, and Rosenthal (2001) note that in 
the most recent Congresses “the order of change is of 
100 ranks, or about half the Democratic membership” 
(p. 684). They note that this places “the most moderate 
Republicans in the middle of the Democratic Party” 
(p. 684), which “is seriously lacking in face validity” 
(p. 684). Four points should be noted in response. 
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First, this is the maximum effect they find among 
modern Congresses, and it occurs only recently, in 
Congresses noted for their partisanship. Second, a shift 
of 100 ranks does not necessarily imply that the major- 
ity party is pulling its members 100 ranks. It could arise 
because, say, the majority party pulls its members 60 


- ranks in one direction, and the minority party pulls its 


members 40 ranks in the other direction. Third, a shift 
of 100 ranks, even if only one party is exerting pressure, 
is not overly large. For instance, if there is enough 
overlap, it is about two-thirds the effect of a “strong” 
party, that is, a party that can make its most moderate 
member vote like the median of the party. Fourth, 
the effect is smaller than McCarty, Poole, and 
Rosenthal’s own estimate of the effect of a switch in 
party. Specifically, when a switch occurs, the switcher’s 
ideal point shifts on average 28% of the total ranks, 
which in the House translates to 122 legislators. 


THE LINEAR FUNCTIONAL FORM 
DOES NOT DRIVE OUR RESULTS: 
A COMPARISON WITH W-NOMINATE 


We now show that the basic conclusions in our 
earlier article are not simply the result of the spe- 
cific functional form used in that analysis. We do this 
by comparing the linear model with Poole and 
Rosenthal’s W-NOMINATE model. We also complete 
the analysis of the previous section by showing that the 
estimates of party influence are the same whether we 
use the lopsided roll calls or the NPAT responses to 
estimate ideal points. 
Consider estimates of the following equations: 


Xau = bo + biXpop + c Republican + e; and 
Xa = bo F bi X NPAT + c Republican +e. 


Table 2 presented the estimates of these equa- 
tions for the 103d-105th Congress. In the top half of 
the table, X47, XLop and Xypar are all estimated 
using the linear factor model. In the bottom half, 
W-NOMINATE is used. To facilitate comparisons, we 
standardize Marrs XLOP and XNPAT so that each has a 
mean of zero and a standard deviation of one in each 
Congress. Thus, the estimates of b, represent the 
expected changé,in X,,,, measured in standard devi- 
ations, produced by a change of one standard deviation 
in Xypar OF X_op\ We do not standardize the Repub-\ 
lican dummy vale so the estimates of c represent 
the expected e in X,,;, measured in standard 
deviations, of switching parties. 





15 To see this, consider the following example. Suppose there are 218 
Republicans and 217 Demoerats. Suppose that the 167 most liberal 
members are Democrats. For the rest, Republicans and Democrats 
alternate in their ranks. That ‘is, the 168th most liberal member is a 
Republican, the 169th most is a Democrat, and so on. This 
means that the 267th most ltberal member is a Democrat, and all 
remainmg members are Republicans. The median of the Republi- 
cans w the 326th most liberal member To achieve “strong” party 
influence, the Republican leaders must make the 168th member vote 
like the 326th member, a shift of 158 ranks. A shift of 100 ranks 5 a 
little less than two-thirds this effect. 
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The findings are straightforward. First, the estimates 
of party influence are similar whether we use X; op or 
Xypar to control for preferences. If anything, party 
influence appears even more important when Xyp47 is 
used. This is true for both the linear model and the 
W-NOMINATE model. Second, the estimated effect 
of party influence is substantively large and highly 
statistically significant regardless of whether we use the 
linear or W-NOMINATE functional forms. The W- 
NOMINATE model produces smaller estimates of 
party influence, but the qualitative conclusions are the 
same. Finally, the third, sixth, and ninth columns in 
Table 2 show that the estimates for the restricted 
sample of representatives with NPAT scores are simi- 
lar to the estimates for the full sample of ail represen- 
tatives with scores based on roll calls.16 


A FOCUS ON CLASSIFICATION SUCCESS 
UNDERSTATES PARTY INFLUENCE 


We noted above a difficulty with the baseline used by 
McCarty, Poole, and Rosenthal to measure party in- 
fluence. Even if this problem is ignored, however, their 
method is flawed because it will frequently miss party 
influence, even when it is present. A simple example 
reveals the problem. The example shows that even if 
the majority party exerts a large amount of influence— 
enough to move policy from the chamber median to 
the party median, sometimes called a “strong” party 
influence—adding a second cutline typically will lead to 
only a tiny gain in classification success. In fact, in 
typical cases the gain will be similar to or even less than 
the 1-2% gain reported by McCarty, Poole, and 
Rosenthal. In many cases, adding a second cutline will 
not improve classification success at all, despite the 
presence of substantial party influence. 

Consider voting over one “canonical” bill, which pits 
the majority party median against the floor median. We 
will calculate the classification success using a one- 
cutline model. Classification success using a two-cut- 
_ line model (one for each party) will be 100%.17 

There are two parties, L and R. Each has a symmet- 
ric, triangular distribution of ideal points, and for 
simplicity we assume the triangles are similar.!8 With- 


16 The full sample does not contain exactly 435 observations because 
of intraseasion turnover. 

7 Tn this example, we assume that voting is determinsstic, which we 
do for analytic convenience only (as do McCarty, Poole, and 
Rosenthal in their example on page 675). Similar results hold when 
voting is probabilistic, We cannot find simple closed-form solutions 
under probabilistic votmg, even for simple examples, so we must run 
simulations. These show that the basic conclusions of this section 
hold—sometimes even more strongly—when voting is probabilistic. 
There are ‘yo countervailing forces. First, if parties must allocate 
their “mfluence” before seeing the member-specific Idiosyncratic 
shocka, then they will want to pad their majorities, which increases 
the amount of measurable party influence. This will not be true, of 
course, if parties can observe the member-specific idiosyncratic 
shocks before allocating thear influence. Second, voting “errors” will 
be especially frequent among legislators close to the cutline, hence 
among many of the influenced legislators, so adding a second cutline 
will not improve classification success as much as it does under 
deterministic voting. 

18 We employ a triangular distribution, because it is an analytically 
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out further loss of generality, fix the support of the R 
distribution at [0, 1]. Let the support of the L distribu- 
tion be [/p, 11], with J) < 0. Let Nz be the measure of 
the set of legislators in R, let N, be the measure of the 
set of legislators in L, and let y = N,/Np < 1 (so Ris 
the majority party). Normalize by setting Nz = 1. Thus, 
the density of the R distribution, fp, is given by fp(z) = 
áz for z < 1/2 and by f(z) = 41- z) for z > 1/72. 
Denote the cumulative distribution functions of the 
ideal point distributions by F; and Fp; s0, Fi (ly) = 0, 
Fi) = y, Fp(0) = 0, and Fp(1) = 1. Denote the 
“overlap” between the parties by . For convenience 
we assume that the median of party L is to the left of 
all members of party R and that the median of party R 
is to the right-of all members of party L. That is, we 
assume (h + 1,)/2 = 0 and 4} = 1/2, When /, 50,6 = 
0. When /, > 0, is equal to the fraction of legislators 
whose ideal points lie in [0, /,], that is, 6 = [F,(l,) — 
F,(0) + FRL) — FRON + v).19 Denote the cham- 
ber median by p. The status quo is at p. The bill is at 
the median of the R distribution, 1/2; so the cutline for 
legislators who are free from party influence is c = 
(2p + 1)/4. 

One other point is crucial, which we denote cp. This 
is the point at which, if all members of R with ideal 
points in [cg, c} are persuaded to vote for the legisla- 
tion, the bill will exactly pass. So [1 — Fe(cr)] + [y — 
F,(c)] = Fr(eg) + File) = (1 + W)/2. Note that F,(c) 
might be equal to y, in which case only members of R 
vote for the bill, and cg = Fp"! ((1 — )/2). 

We assume that R only exerts influence on R- 
affiliated legislators with ideal points in [c,p, c] and 
persuades all of them to vote for the bill. It passes, and - 
policy is moved from the chamber median to the R 
caucus median, a large shift. Denote the fraction of R 
affiliates who “switch” their vote as a result of pressure 
by Sz = Fr(c) — FRR). . . 

Let c, be the optimal cutline using a one-cutline 
model. That is, taking the distributions of idea! points 
as known, c, is chosen to minimize the fraction of votes 
that are misclassified. Let M be the fraction of misclas- 
sified votes using a one-cutline model. Clearly, there is 
no misclassification using a two-cutline model: Setting 
the L cutline at c and the R cutline at cp, produces a 
perfect classification. To calculate c, and M, we must 
consider five cases. A few straightforward calculations 
yield the following: 


1. u = l. Then c, = cpg, and M = 0. 

2. u <h, l <c, andi,/2 Scr. Thence, = cp 
and M = 2(1, ~ cp)*/(1 + 4). 

3. u < h, l <c, and l,/2 > cpg. Then c = 1/2, 
and M = (É — 2c2)741 + y). 


4. u < h, 4 > c, and },/2 S cpg. Thenc, = cp, 


and M = 2{(l, — cR}? — (h — ¢)*V(1 + y). 


tractable way to capture the distribution of ideal points typically 
produced by empirical (metric) models of roll call voting, such as 
Poole and Rosenthal’s NOMINATE, D-NOMINATE, and 
W-NOMINATE, and various linear factor models. 

1° Given our parameterization, w = (l ~ h)? and } = 4/741 + y). 
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FIGURE 3. Increase in Classification Success When Party Influence Exists 





5. e < h, L > c, andl,/2 > cp. Then c; =1,/2, 
and M = [Ê — 2ch — Xl, ~c)* V(1 + W). 


Figure 3 gives an example of case 2. 

We can treat two parameters as “primitives”: the 
relative size of the minority party and the degree of 
overlap $. We can calculate all others, including M and 
Sp, from these. Rather than present a full characteriza- 
tion, we provide a table of calculations to illustrate 
what happens in a variety of circumstances. In Table 3, 
the columns labeled Np and N, show the number of 
legislators implied by the value of for a legislature 
with 435 members. 

The conclusion is simple: M is never very large, even 
when there is substantial overlap between the parties 
(see the columns labeled p and M in Table 3). In many 
cases, M = 0. In all cases, the amount of influence is 
large: 15-20% of the R legislators switch their vote as 
a result of party influence, and the policy outcome is 
moved from p to .5 (see the columns labeled Sp and p). 

What does Table 3 tell us about the results reported 
by McCarty, Poole, and Rosenthal? As they note, even 
on a straight party vote there is an upper bound on the 
amount to which a two-cutline model can increase 
classification over a one-cutline model. They then 
report (McCarty, Poole, and Rosenthal 2001, 678): 
“The average upper bound over all the Congresses we 
analyze is 5%.” Given the parameterization in our 
example, a two-cutline model reduces misclassifica- 
tions on a straight party vote by B/(1 + y) = o/4. 
Setting this equal to .05 yields p = .20. For the even 
higher value of ọ = .25 (highlighted in Table 3), we see 
that M ranges from .000 to .023. So, even if R influences 
its members on all close votes, there is only a gain of 
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0% to 2.3% in classification success under the two- 
cutline model. In our earlier article we estimated 
statistically significant party influence on about 45% of 
votes in the 82d-105th Congress (1951-98). If R influ- 
ences its members on only 45% of the close roll calls, 
and if we use the 2.3% figure and average across all 
close roll calls, the gain in classification success under 
the two-cutline model is just 1%.” This is even less 
than what McCarty, Poole, and Rosenthal find. For 
example, in their Figure 7 for the same set of Con- 
gresses, on close votes the average gain in classification 
success from the two-cutline model is about 1.5%. 


CONCLUSION 


McCarty, Poole, and Rosenthal claim that our strate- 
gy—the use of lopsided roll calls to estimate ideal 
points—may not accurately estimate the ideal points of 
moderate representatives, which would bias our esti- 
mates of party influence. We would agree if that were 
the case, but we believe the use of lopsided roll calls 
does not lead to poor estimates of moderates’ ideal 
points. 

McCarty, Poole, and Rosenthal’s argument rests 
heavily on a simple example with errorless voting. 
We have shown that this assumption is strongly 
contradicted by the evidence, which reveals that con- 
gressional voting contains a fairly large amount of 
“error.” Therefore, it is much more accurate to view 
roll call voting in terms of a low-dimensional probabi- 


» Even this is probably an overestimate. Durmg the 82d-105th 
Congress the average value of ẹ was .71, which corresponds to the 
bottom of the highlighted rows in Table 3, for which M = .000. 
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TABLE 3. Calculations for Cases 1-5 

Ņ N N e l Case L c c C4 S 
.85 235 200 .05 .15 1 .19 .35 .19 .19 .000 .17 
.85 235 200 .10 .22 2 .19 .35 .19 .19 .001 17 
85 235 200 15 .26 2 20 35 19 19 .005 A7 
85 235 200 20 30 2 21 .36 19 19 .013 18 
85 235 200 25 34 2 22 38 19 19 -023 .19 
.85 235 200 .30 .37 4 .24 .37 .19 .19 .035 .20 
.85 235 200 35 40 5 25 37 20 20 .045 20 
78 245 190 .05 .15 1 .24 .37 24 24 .000 16 
78 245 190 10 21 1 .24 .37 .24 .24 .000 .16 
.78 245 190 .15 .26 2 .24 .37 24 24 .001 16 
.78 ‘245 190 20 30 2 24 37 24 24 .004 16 
.78 245 190 25 33 2 25 -38 -24 24 010 A7 
.78 245 190 30 36 2 28 38 24 24 .018 18 
78 245 190 35 .39 4 .27 38 24 24 .028 18 
71 255 180 .05 .15 1 27 39 27 27 .000 15 
71 255 180 10 21 1 27 39 27 27 .000 15 
71 255 180 ~ 15 25 1 27 39 27 27 .000 15 
71 255 180 20 29 2 27 .39 27 27 .001 15 
71 255 180 -25 33 2 28 39 27 27 .004 16 
71 255 180 .30 36 2 .28 39 27 -27 .009 .16 
71 255 180 35 .39 2 29 .39 27 27 .016 16 
4 = (sze of mmnonty party\isize of party). 







c = outline on ball under sincere voting. 

Cp = cutine inside R under party influence (c = cutine inside L). 

c4 = cutine choice to mmnimze clage:ficabon errors in one-cutine model 
M = fracbon of votes musciaseified under one-cutine model 

8a = fraction of R legrelators who change ther vote due to party mfiluence. 


listic voting model than a deterministic model. Poole 
and Rosenthal themselves have strenuously argued 
in support of this view and have mustered a large 
amount of evidence for it in their important work on 
NOMINATE. In a probabilistic voting framework, we 
can estimate the ideal points of moderates as well as 
extremists, even using a set of roll calls with cutlines 
that rarely fall between moderates. The Monte Carlo 
simulations in our previous article demonstrate this. 
We demonstrate it in another way here, by showing 
that an independent source of data for measuring ideal 
points—the NPAT survey—produces estimates that 
are very close to those produced by our lopsided roll 
call strategy. 

We also show that McCarty, Poole, and Rosenthal’s 
own results, when properly interpreted, actually sup- 
port our earlier conclusions. In particular, their two- 
cutline method reveals that party influence is signifi- 
cant on average close votes. Indeed, it is large enough 
to move the induced ideal point of a legislator across 
about 11% of the House. We also demonstrate that 
McCarty, Poole, and Rosenthal’s “classification im- 
provement” criterion is inappropriate for assessing 
party influence. Even when party influence is large— 
large enough to be consistent with one definition of 
“strong party government”—the improvement in clas- 
sification success often will be as small or even smaller 
than the 1.5% that McCarty, Poole, and Rosenthal 
find. 

Many questions remain open. Perhaps the most 
important concerns the sources of party influence. 





McCarty, Poole, and Rosenthal (2001, 674) argue that 
“the main influence of party discipline is not on the 
votes on specific roll calls but on the choice of ideal 
point made by the representative.” This may be true, 
but it is meaningless in the context of their article. 
After all, their ideal point estimates are defined in 
terms of roll call votes! Because these are the only data 
used to estimate ideal points, it is a contradiction to 
argue that party influence is “manifest in the location 
of the legislator’s ideal point” (p. 686) but that party 
influence has no effect on the legislator’s roll call votes. 
More usefully, McCarty, Poole, and Rosenthal 
(2001, 686) suggest that forces external to Congress, 
such as “external pressures of campaign donors and 
primary races,” are more important than internal 
forces. A rough calculation provides a crude idea about 
the relative magnitude of internal and external forces. 
McCarty, Poole, and Rosenthal note that when repre- 
sentatives switch parties, on average their ideal point 
moves across 28% of the ideal points of the legislature. 
If one accepts our interpretation of their results, then 
influence due to forces within the House causes a 
legislator’s ideal point to move across about 11% of the 
ideal points of the House. This implies that when a 
legislator switches parties, about 39% (11/28) of the 
total movement is caused by party influence within the 
legislature. The remaining 61% is evidently due to 
external factors, such as new primary voters and cam- 
paign donors. Interestingly, this is roughly consistent 
with Levitt’s (1996) study of senatorial voting. Levitt 
finds that senators place approximately equal weight on 
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the positions desired by their primary constituents and 
the positions desired by their caucus or party leaders. 
Yet, as McCarty, Poole, and Rosenthal note, most 
studies find that changing external circumstances, such 
as redistricting, have little effect on members’ voting 
scores, which suggests little responsiveness to overall 
district preferences. Taken together, these findings 
imply that primary electorates and campaign donors 
have a greater effect than overall constituencies on 
representatives’ roll call voting. Most democratic the- 
orists would probably find this highly disturbing. 
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Feminism and Liberalism Reconsidered: The Case of Catharine 


MacKinnon 


DENISE SCHAEFFER College of the Holy Cross 


liberal political theories. I highlight the ways in which MacKinnon makes use of several fundamental liberal 
tenets, such as the primacy of individual choice, and then consider how her work contributes to an ongoing 
discussion about the relevance of liberal theory to contemporary feminist concerns. 


eminist theory has contributed a great deal to our 

understanding of the incompleteness of unrecon- 

structed liberalism for addressing all aspects of 
human life. But the view that liberalism and feminism 
are incompatible has become widespread, especially 
among radical feminists who reject liberalism for offer- 
ing women “a piece of the pie as currently and poison- 
ously baked” (Morgan 1996, 5). When liberalism is 
understood as thoroughly patriarchal, feminism is un- 
derstood as something separate from and beyond 
liberalism. The presentation of liberalism and femi- 
nism as disjunctive—indeed, as a contradiction’— 
raises the question of whether we have become too 
eager to dissociate thoroughly feminism from liberal- 
ism. Once we acknowledge the gender bias in many 
classical liberal tenets as originally formulated, we are 
left with the question of whether liberalism is simply 
one of “the master’s tools [that] will never dismantle 
the master’s house” (Lorde 1984).? Does liberalism 
remain relevant to feminism, or does feminism require 
“a new process of theorizing and a new form of theory” 
(MacKinnon 1989, 116)? The prominent feminist legal 
theorist, Catharine A. MacKinnon, is one of the more 
forceful defenders of this latter claim. Yet, I shall 
argue, certain aspects of the theoretical project she lays 
out in Toward a Feminist Theory of the State rest on 
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full potential. 

2 “Liberalism” is notoriously difficult to define. I will refer to specific 
features that are accepted as intrmaic to the liberal 
tradition, which is made of up several strands, without claimmg to 
address all the features that can be attributed to liberalizm. In 
general, I focus on liberal values, principles, and theoretical commit- 
ments (such as the primacy of individual choice and self-determma- 
tion, and the equal moral worth of all human beings based on a 
shared human core of dignity), rather than on liberal political 


liberal underpinnings. I maintain that liberal theory 
remains crucial to feminist theory’s attempt to articu- 
late fully the possibilities of feminism, even as a 
feminist critique is necessary to articulate fully the 
possibilities of liberalism. 

Although the most vociferous debate in feminist 
theory currently centers on disentangling the complex 
relationship between feminism and postmodernism, 
the relationship between feminism and liberalism con- 
tinues to be analyzed and reassessed as well (see, e.g., 
Allen 1988; Hirschmann 1996; Johnson 1994; Kens- 
inger 1997; Nussbaum 1999; Sinopoli and Hirschmann 
1991; Wendell 1987). Pauline Johnson (1994) has 
identified three stages in the changing relationship 
between feminism and liberalism. First, the classical 
paradigm was problematically applied to women by 
liberal theorists such as Mill. Second, feminist theorists 
exposed the patriarchal biases of the classical para- 
digm. The third stage, in which Johnson locates her 
own work, involves grappling with the following ques- 
tion: “In what terms, namely, are the value commit- 
ments of liberalism, which have underpinned femi- 
nism’s own culture-critique, to be preserved in the light 
of the powerful feminist challenge to the conception of 
politically qualified subjectivity which has traditionally 
underpinned the critical values of liberalism” (1994, 
69)? 

Defenders of liberalism against the feminist critique 
often fail to take into account the specific challenges 

by Catharine MacKinnon. The exception is 
Martha Nussbaum (1999), who responds to several 
aspects of MacKinnon’s critique, although she does not 
address MacKinnon’s critique of liberal epistemology, 
to which I will devote considerable attention. Other 
commentators who address MacKinnon’s work tend to 
ignore her theoretical claims and focus on the more 
limited task of reconciling antipornography legislation 
with the First Amendment (see, e.g., Scoccia 1996). 
Such an approach does not address the broader ques- 
tion of whether the assumptions underlying liberal 
theory are themselves part of the problem. Similarly, 
Easton (1994) examines Britain’s 1986 Public Order 
Act, which prohibits incitement to racial hatred, with a 
view toward prohibiting pornography as incitement to 
sexual hatred. She defends such legislation by appeal- 
ing to the liberal concern for “a more equal and just 
social order” (p. 154). Although this argument appears 
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to link the practical political issue with broader theo- 
retical questions, it appeals to an overly broad concep- 
tion of liberalism—both Marxism and anarchism, for 
example, are certai also concerned with a more 
equal and just social order—and disregards the degree 
to which “equality” and “justice” are contested terms. 
While offering useful practical insights, Easton circum- 
vents important theoretical questions about feminism 
and liberalism. 

In light of MacKinnon’s (1987) claim to offer an 
“unmodified” feminism as well as the significant effect 
of her work on both the political and theoretical 
landscapes, it is important to confront the unique 
challenge she poses to liberalism before we can deter- 
mine whether feminism can be useful in revealing not 
only the limitations but also the potential of liberalism. 
I will demonstrate that MacKinnon’s work should be 
understood as part of a conversation within liberalism, 
not as a separate and fundamentally antagonistic en- 
terprise. My intent is neither to render MacKinnon’s 
ideas more palatable or less threatening nor to dismiss 
her equally important debt to Marxism? Although 
MacKinnon goes out of her way to distinguish her 
feminist theory from both liberalism and Marxism, the 
Marxist dimensions of her work have been widely 
discussed (Brown 1996; Cornell 1991; Ring 1990). Yet, 
the liberal dimensions of her work remain underappre- 
ciated. This results in a largely inaccurate understand- 
ing of MacKinnon’s feminist theory and contributes to 
the common misperception of the relationship between 
contemporary feminism and liberal theory as funda- 
mentally antagonistic. 

I begin with MacKinnon’s theoretical writings rather 
than the more recent Only Words, which is a particular 
application of the feminist method and jurisprudence 
that MacKinnon explains and defends in her earlier 
works. In Toward a Feminist Theory of the State, she 
distinguishes her own position from two other tenden- 
cies found in feminist approaches to liberalism.* The 
first charges liberalism with inconsistency, arguing that 
liberal ideals traditionally reserved for the public realm 
must be extended to the private realm (see Okin 1990). 
The second, which grew largely out of the work of 
Carol Gilligan (1982), argues that the virtues tradition- 
ally relegated to the (feminine) private realm should 
replace the traditional (masculine) liberal model of the 
public realm structured around atomistic individualism 
(see Held 1993; Ruddick 1989). Scholars of this per- 
suasion argue that the public sphere should be recast in 
terms of care and connectedness, which would entail a 
revaluation of women’s voices, work, and ways of being 
in the world. 

In contrast, MacKinnon believes our options should 
not be limited to insisting that women are independent 
individuals, on the one hand, or celebrating the fact 
that they are not, on the other. Rather, she is interested 


>We may broadly between “liberal” and “Marxist” 
femimst theories, but MacKinnon (1989, 39) reminds us that “few are 
exclusively Liberal or exclusively radical.” 

4 Nussbaum (1999) offers a thorough and persuase defense of 
liberalism against these two strands of feminist critique. 
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in changing the social conditions that prevent women 
from being independent individuals. She criticizes lib- 
eral theory for presupposing that women are self- 
defining individuals, but she directs feminism to create 
conditions that support rather than undermine wom- 
en’s individuality, which liberals erroneously take for 
granted. To insist that women are individuals “will not 
make it so; it will obscure the need to make change so 
that it can be so” (MacKinnon 1987, 59, emphasis in 
original). Historically, liberalism as applied to the 
project of women’s liberation has generally focused 
upon removing the explicit, state-sanctioned barriers 
that kept women from competing in the public sphere. 
MacKinnon sees barriers where others have not. In 
order to make these barriers visible, she challenges 
liberal theory in two ways, criticizing the liberal ten- 
dency toward abstract equality as well as the tendency 
to rely on empiricist assumptions about what it means 
to know social reality. 


MACKINNON’S CRITIQUE OF ABSTRACT 
EQUALITY 


On the surface of her argument, MacKinnon rejects 
the classical liberal conception of equality. She ac- 
knowledges that formal equality has achieved impor- 
tant advances for women (noting that it has been even 
more successful in gaining access for men to women’s 
traditional protections), but she argues that it is insuf- 
ficient for several reasons. First, it is virtually impossi- 
ble to apply the liberal understanding of equality 
(treating likes alike and unlikes differently) to issues of 
sex and gender, since the sexes are socially defined by 
their mutual unlikeness. “Socially, one tells a woman 
from a man by their difference from each other, but a 
woman is legally recognized to be discriminated against 
on the basis of sex only when she can first be said to be 
the same as a man... . Sex equality becomes a contra- 
diction in terms, something of an oxymoron” (1989, 
216). 
Second, MacKinnon argues that gender difference is 
more accurately gender dominance. Norms of mascu- 
linity and femininity vary over time, but the social 
meanings of “man” and “woman” remain linked to a 
hierarchical conception of heterosexuality in which 
dominance and submission define the two. MacKinnon 
maintains that despite the legal distinction between 
rape and sex, which turns on the question of consent, a 
certain degree of force/dominance is actually built into 
our understanding of normal heterosexual behavior. It 
is socially acceptable for men to pressure women for 
sex; only after some (ambiguous) threshold is crossed is 
a wrong committed. We dither over how much pressure 
is too much without questioning why any at all is 
acceptable. When a judge refers to “half-won argu- 
ments in parked cars” as part of the normal dance of 
sexual behavior that the law must protect, MacKinnon 
asks why half-won is enough. “Why not half-lost” 
(1991b, 193)? The roles men and women are expected 
to play in this view are not simply different, but 
inherently unequal. 

The acceptance of hierarchy in sexual relations as a 
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natural (and thus innocuous and immutable) difference 
between men and women undergirds our tendency to 
accept other “differences” between the sexes. Because 
equality is premised on sameness, this leads to a real 
disparity in social power between men and women, 
with the result that women are rarely “similarly situ- 
ated” to men. Beyond the obvious fact that women get 
pregnant, centuries of economic dependence, differen- 
tial treatment in education, discriminatory legislation, 
and physical/sexual intimidation have created a situa- 
tion in which the average woman is not similarly 
situated to the average man. The playing field is not 
level, and pretending does not make it so. 

Because the average woman is not similarly situated 
to men, purely formal equality can be quite detrimen- 
tal, as the “feminization of poverty” phenomenon 
indicates (see Pearce 1990). Insofar as the sexes are 
socially unequal, when the law treats them neutrally, it 
tends to enforce women’s inequality. For example, the 
move to a gender-neutral criterion for deciding child 
custody did not result in greater equality for women. 
“Men often look like better ‘parents’ under gender- 
neutral rules like level of income and presence of 
nuclear family, because men make more money and (as 
they say) initiate the building of family units. In effect, 
they get preferred because society advantages them 
before they get into court, and law is prohibited from 
taking that preference into account because that would 
mean taking gender into account” (MacKinnon 1991a, 
83-4). 

Therefore, if women are treated by the law in 
abstraction from the fact that they are women, their 
unequal status is rendered invisible and remains un- 
changed. But if they ask to be treated as “women,” they 
provide justification for unequal treatment by admit- 
ting they are different from men. MacKinnon thus 
explains why contemporary feminism appears to be 
divided against itself, why it is said that feminism 
cannot decide whether women want equality or “spe- 
cial treatment.” Feminists are accused of trying to have 
it both ways. The question MacKinnon asks is: Men 
have it both ways, and why is that not considered 
unfair? Her answer is that male privilege is invisible; it 
is not described as having it both ways. For men, there 
is no conflict between the (so-called) neutral and the 
particular. 

For these reasons, MacKinnon (1991b, 184) con- 
cludes that liberalism is insufficient to accomplish 
feminist goals: “Applied to women, liberalism has 
supported state intervention on behalf of women as 
abstract persons with abstract rights, without scrutiniz- 
ing the content of these notions in gendered terms.” 
What such scrutiny entails is not an abandonment of 
liberal principles but a new way of studying social 
reality in order to perceive better where and how it 
contradicts those principles. MacKinnon demonstrates 
that liberalism is insufficient unless supplemented by a 
feminist critique of the gendered social reality that 
liberal neutrality takes as a given, but it does not follow 
that liberalism and feminism are incompatible. Indeed, 
in my view, her analysis suggests that each requires the 
other. After I examine MacKinnon’s epistemological 
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critique of liberalism, it will become apparent just how 
and why this is the case. 


CHALLENGING THE “FACTS?” OF LIFE 


In order to correct liberalism’s deficiencies, Mac- 
Kinnon offers a rethinking of the epistemological as- 
sumptions that underlie liberal political theories. The 
liberal approach to sex equality is most helpful in 
redressing inequalities due to stereotyping. Yet, any 
difference that is found to have a factual basis justifies 
differential treatment. The problem, according to 
MacKinnon, is that many “facts” about women that are 
taken to be real differences are not immutable givens; 
they have been produced by the sexual inequality 
embedded in our social arrangements. Women have 
been damaged by sexual inequality; if we take their 
damaged selves to be an accurate picture of what they 
are, they will never be similarly situated to men. 
Stereotypes are “most deeply injurious at the point at 
which they become empirically real” (MacKinnon 
1989, 230). The more “real” the stereotype, the harder 
it is to argue that it must be changed or fixed. “Where 
liberal feminism sees sexism primarily as an illusion or 
myth to be dispelled, an inaccuracy to be corrected, 
true feminism sees the male point of view as funda- 
mental to the male power to create the world in its own 
image, the image of its desires, not just as its delusory 
end product” (MacKinnon 1991b, 183). 

First and foremost, liberalism assumes individual 
agents who participate in but are not constructed by 
the field of social interaction, makers of meaning who 
are not themselves made. This view underlies the 
liberal state’s commitment to neutrality rather than 
interference and its unwillingness to change the social 
conditions perceived as “facts.” But insofar as the state 
remains neutral in the face of a social arrangement that 
is not neutral, male dominance is reinforced. MacKin- 
non charges that liberal neutrality or “objectivity” is 
merely blind to its own bias. She goes so far as to state 
that “male dominance is perhaps the most pervasive 
and tenacious system of power in history . . . it is meta- 
physically nearly perfect. Its point of view is the 
standard for point-of-viewlessness, its particularity the 
meaning of universality” (1989, 116-7). Its metaphys- 
ical near-perfection lies in its coherence; the male 
perspective is so embedded in social reality that it 
becomes almost invisible as anything particular in 
itself. It comes across as “the way things are” (p. 162) 
and thus, in the empiricist view, defines rationality, 
making all accounts of reality that contradict the 
accepted “facts” easily dismissed. It is this virtual 
invisibility of male privilege, rather than force, that 
serves as gender inequality’s best protection. 

Pornography, which disseminates a sexual script 
wherein women are portrayed as sexual objects that 
exist solely to gratify men,5 is a primary channel 





5 MacKinnon defines pornography as the graphic sexually explicit 
subordination of women (as well as men, children, or transsexuals) 
through pictures or words. Her definition does not include all erotic 
or sexually explicit materials, only those that also present women as 
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through which sex and domination are fused. MacKin- 
non therefore treats pornography as a crucial feminist 
issue, arguing that it harms women in three ways. First, 
women are harmed in order to make pornography (just 
as children are harmed in the production of child 
pornography). Second, it perpetuates the image of 
women as sexual objects and thus dehumanizes them, 
undermining both their demands to be treated as full 
human beings and their credibility when they complain 
about sexual violation. “Men treat women as who they 
see women as being. Pornography constructs who that 
is” (1992, 298). It legitimates and promotes gender 
inequality by making that inequality sexual. The more 
widespread pornography use becomes, the more diffi- 
cult it becomes to dissociate sex and degradation. How 
many adolescents absorb their understanding of sex 
from pornography? What effect might this have? 

But the effect of pornography on the social position 
of women is not limited to the dissemination of harm- 
ful views of them. The conditioning effect constitutes 
the third and most important way pornography is 
harmful.® By appealing not to the intellect but to a 
largely unconscious sexual (physical) response, pornog- 
raphy habituates its users to experience a sexual thrill 
from the degradation of others (usually women). Inso- 
far as it is used for sexual gratification, MacKinnon 
argues, pornography must be considered as a material 
component of sex rather than something that reflects 
or depicts sexual experiences.” Thus, pornography is 
more of an action than a communication of ideas, and 
MacKinnon implicitly invokes Mill’s harm principle in 
framing her argument in these terms. Moreover, she 
attempts to redress the harm of pornography through 
legal reform rather than dismissing legal change as 
specious, as would someone who adopts a more thor- 
oughly Marxist perspective. 

By conditioning sexual desire and thus shaping sex- 
ual behavior, pornography does not just distort human 
sexual desires.® It also causes the dominance/submis- 
sion model of sexuality to become an empirical “fact” 
about sexual behavior, which renders the distinction 
between reality and myth or stereotype virtually mean- 


“dehumanized as sexual objects, things, or commodities; enjoying 
pain or humiliation or rape; being tied up, cut up, mutilated, bruised 
or physically hurt; in postures of sexual submussion or servility or 
display; reduced to body parts, penetrated by objects or anmnals, or 


intended to distinguish pornography from other sexually explicit 
material that does not meet these cnteria and therefore is not 


sexually discruminatory. 

6I believe it m the most important because it circumvents the 
argument that pornographic images are mere speech or representa- 
tion of ideas. The Seventh Circuit Court of Appeals acknowledged 
the harm that pornography causes to women by disseminating the 
view that they are sexual objects, but it then used this admission to 
categorize pormography as protected speech that must not be 
abridged based on the particular content or vi t it expresses. 
See Amencan Booksellers v. Hudnut 771 F. 2d 323 (7th Cir. 1985), aff. 
mem. 475 U.S. 1001 (1986). 

7 For a more detailed explanation of how pornography is an actrvity 
and not just a representation of ideas, see MacKinnon 1993. 

2 Nussbaum (1999) directs her arguments to this aspect of MacKm- 
non's discussion of pornography. 
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ingless. MacKinnon is not challenging the dominance/ 
submission model of heterosexuality as a myth that 
needs to be dispelled; she acknowledges dominance/ 
submission as a reality of sexual behavior and then 
challenges both the authenticity and the justice of that 
reality. The empiricism that informs so much liberal 
theorizing can at most capture a static picture of social 
reality. It fails to detect the contingency of the “facts” 
it perceives, and it thus regards pornography as mere 
representation and dominance/submission as reality. 
When the issue is framed this way, it becomes almost 
impossible to argue against gender dominance, as it is 
taken to be empirically real. This difficulty exemplifies 
what MacKinnon means when she says patriarchy is 
metaphysically nearly perfect. 

MacKinnon charges that most feminist analyses, 
even some that call themselves “radical,” fail to capture 
the complex reality of gender inequality because they 
unwittingly work from within a traditional liberal epis- 
temology. For example, in Gyn/Ecology, Daly (1978) 
discusses the traditional Hindu practice of suttee, in 
which a widow throws herself upon her dead husband’s 
funeral pyre, supposedly out of overwhelming grief and 
a desire to remain pure. Daly proves that the widows 
do not voluntarily throw themselves but are often 
drugged or pushed. By showing that suttee usually is 
not voluntary, Daly believes she is demystifying the 
whole practice. Her analysis suggests that women’s 
consciousness exists outside the prevailing understand- 
ing of their society. Women may be told that life is 
nothing without a husband, but they do not believe it. 
The harm Daly identifies is the explicit coercion of 
suttee, and she assumes that the women’s reality is 
located in the absence of coercion. 

But institutionalized inequality has the power to 
create inequality in its own image. In MacKinnon’s 
view, Daly ignores the material effects of systemic 
gender inequality, which not only hide women’s au- 
thentic response to the death of a husband but also 
shape it. Daly locates the “real” feminine response to 
suttee in women’s resistance, but their resistance is 
neither as universal nor as untouched by social expec- 
tations as Daly suggests. Daly thus closes off the 
possibility of understanding the true power of suttee, 
which is most profoundly manifested in the woman 
who does not have to be pushed, who wants to die 
when her husband dies. This woman, according to 
MacKinnon, is the greatest victim, notwithstanding the 
absence of visible, immediate coercion. A woman who 
jumps is not evidence of lack of coercion, but of 
coercion so fundamental that it has become founda- 
tional to a woman’s self-understanding. 

MacKinnon discerns a similar internalization when 
women subscribe to a social understanding of sexuality 
that tells them they should orient themselves toward 
men’s desires because to do so is to be a woman. What 
is at stake in MacKinnon’s analysis is the degree to 
which the social meaning of gender becomes the 
content of women’s lives. To the extent that women 
become “thingified in the head” (1989, 99), they appear 
to consent to their subjection, and liberalism is hard 
pressed to help them. “In other words, to the extent 
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pornography succeeds in constructing social reality, it 
becomes invisible as harm” (1992, 299). Moreover, to 
the extent that dominance and submission are eroti- 
cized as the basis of heterosexual relations, it is difficult 
for both men and women to discern the line between 
sex and rape. MacKinnon shares the liberal preoccu- 
pation with choice and consent but tries to make them 
meaningful for women by identifying unacknowledged 


barriers that interfere with their free exercise. 


BEYOND INEQUALITY: MACKINNON’S 
LIBERAL HUMANISM 


MacKinnon clearly draws on Marxist insights in formu- 
lating her critique of liberal idealism, but she then 
distances herself from Marxist critics by charging them 
with oversimplifying social reality as much as liberalism 
does, albeit from the opposite direction. “As liberal 
theory has looked for the truth of women in the mirror 
of nature, left theory has looked for the truth of women 
in the mirror of social materiality” (1989, 122). Like 
liberal idealism, materialism fails to capture the com- 
plex reality of woman as someone who both is and is 
not a product of her conditions. The complexity of this 
reality allows MacKinnon to avoid invoking the con- 
cept of false consciousness. We cannot say that either 
the woman who is explicitly coerced or the woman who 
does not need to be coerced is suffering from false 
consciousness. Both are exhibiting a real response to 
real conditions, and both responses need to be taken 
into account in order to appreciate those conditions. 
Yet, neither is exhibiting a fully real response because 
ultimately both are products of oppressive conditions. 
We have no idea, under these circumstances, what a 
woman’s real response would be. Feminist theory as 
conceptualized by MacKinnon orients itself toward 
both dimensions of women’s reality: the reality of 
women’s inequality and the reality of women as self- 
determining human beings. 

One must take careful note of MacKinnon’s ac- 
knowledgment that patriarchy is only nearly perfect. 
This careful qualification allows MacKinnon to identify 
some critical space from which women can challenge 
the system that -has in large part produced them. 
Marxists encounter a similar difficulty in trying to 
account for the class consciousness of the proletariat 
under capitalism, a parallel MacKinnon mentions 
(1989, 103-4). Yet, her theory entails a much weaker 
version of this problem because she never presents 
consciousness as a unilinear reflection of material 
conditions. If she did, it would be difficult to justify her 
argument that the empirical “facts” about women are 
not actually true, although they accurately describe 
women as they are and so cannot be dismissed as mere 
stereotypes. Paradoxically, the stereotypes are “true 
and false at the same time” (p. 104), which is why 
women can claim consciousness that existing social 
reality is not theirs and simultaneously argue that they 
have been constructed by that reality. MacKinnon does 
not dismiss patriarchy as mere illusion. “As it justifies 
itself, namely as natural, universal, unchangeable, 
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given, and morally correct, it is illusory, but the fact 
that it is powerful is not illusion” (p. 100). 

Although sexual inequality is real, feminism is also 
real, and it provides a glimpse of a “shadow world” 
(MacKinnon 1989, 104) of possibilities for women. 
Women are what they are (which is reflected in yet not 
exhausted by what they are said to be), but they are (or 
could be) something more or something else. The 
evidence of women’s reality beyond the reality of their 
experience of inequality is the awareness that they 
could be more than they are allowed to be, that is, its 
evidence is feminism. If sexual inequality were not real, 
feminism would not exist. If it were exhaustive of the 
real, feminism could not exist. Feminism exists at the 
intersection of the reality of women’s oppression and 
their reality as complete human beings and affirms the 
reality of both. The Enlightenment epistemology that is 
central to liberalism—the detached, rational subject 
knowing a separate, static object—is insufficient to 
capture this complex reality. MacKinnon thus requires 
an epistemic position that locates itself within the 
dynamics of the existing social system of gender in- 
equality . use women are never “outside” it) and 
yet is not wholly determined by that system (because 
women are not reducible to it). Mind and world are 
neither detached nor collapsed but interpenetrated. 

The necessary intersection of the reality of women’s 
inequality and the reality of women as self-determining 
human beings is not always appreciated by MacKin- 
non’s commentators. Often, MacKinnon is presented 
as someone who seeks to sensitize her audience to 
women’s Victimization without pointing them beyond 
it (Brown 1996, 94; Elshtain 1997, 253; Ring 1987, 470). 
Consider, for example, Cornell’s (1991, 128-9) charac- 
terization: 

MacKinnon is a “realist” in two senses. First, she is a 
“realist” in that she argues for a descriptive methodology 
in which the reality of gender difference, understood as a 
determinate presence that determines individual identity 
as sexualized, is traced and brought to consciousness. 
Thus, she argues against those... who have argued that 
social and legal reality is indeterminate. Secondly, and in a 
related manner, MacKinnon is a realist in the colloquial 
sense that she insists that women face up to reality. We 
must confront what male domination has done to us, 
rather than try to see the world of gender hierarchy 
through rose-colored glasses. 


Cornell suggests that MacKinnon sees women’s op- 
pression as constitutive of women’s reality. This is 
partly accurate insofar as MacKinnon criticizes those 
who abstract from existing inequality to view women as 
essentially free. But what Cornell ignores is that Mac- 
Kinnon herself then abstracts from that inequality in 
order to argue that oppression is not exhaustive of 
women’s reality. If it were, radical feminism would be 
no more than a celebration of victimhood, a worldview 
that denies women’s capacity for choice and self- 
determination. Instead, the emancipatory potential in 
women’s consciousness of the existing inequality is 
affirmed by MacKinnon (1989, 91) as essential to the 
development of feminist consciousness: “Women expe- 
tienced the walls that have contained them as walls— 
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and sometimes walked through them.” She may criti- 
cize liberals who focus their attention on protecting the 
individuality of the exceptions to the rule without 
questioning why the rule—pervasive factual inequali- 
ties between the sexes—persists, but she cannot help 
but rely on the existence of “exceptions” to the perva- 
siveness of patriarchy in order to advance her own 
arguments. 

Cornell also opposes MacKinnon’s realism to the 
poststructuralist view that social reality is indetermi- 
nate. Clearly, MacKinnon sees the social/sexual objec- 
tification of woman as fairly stable rather than as a 
perpetually shifting text. Although she does emphasize 
the discursive construction of sex and gender and the 
importance of seeing the meaning of “sex” and “wom- 
an” in their social context, she does so for the purpose 
of arguing that this context is a distortion. She disputes 
the social meaning of womanhood not on the grounds 
that identity is inherently indeterminate but because 
there is something to “woman” that is authentic and 
real, and it must be taken into consideration in assess- 
ing whether our social order is just. Moreover, she 
seeks recourse to a position beyond power and history 
from which social reality can be evaluated. In her 
characterization of existing social reality as a distortion 
she implies another reality against which it can be 
measured. Although this reality does not, in MacKin- 
non’s view, simply underlie the cultural production of 
gender, untouched and waiting to be discovered once 
we strip away the layers of stereotypes, her references 
to this reality reveal her liberal, humanist tendencies. 
She posits a shared human core that she considers 
authentic rather than arbitrary, something that is more 
than a mere reflection of a new social order and serves 
as the basis for her arguments on behalf of women as 
individuals of intrinsic dignity and worth. 

MacKinnon does not propose that, apart from a 
shared experience of sexual objectification, women 
share any particular qualities, and she does not argue 
that women would be just like men absent gender 
inequality. If there is such a thing as the “human,” our 
understanding of it cannot be complete until women 
contribute to that understanding. Yet, MacKinnon 
(1989, 187) invokes a liberal humanist vision when she 
frames her principal question in the following terms: 
“Are women human beings or not?” Implicit in her 
challenge is an overriding concern with women’s ca- 
pacity for choice and self-determination. Moreover, 
the reasoning she supplies for her reluctance to spell 
out in advance any concrete details of a shared human 
core reflects liberal concerns about preserving individ- 
ual choice. Critics who deride MacKinnon as paternal- 
istic (Berns 1994; McElroy 1995) assume that she 


° Because MacKinnon’s theory generalizes about gender, many 
commentators accuse her of essentialsm, that m, of attributing a 
fired essence m common to all women (see Cornel 1993; Harris 
1990). Rapaport (1993) defends MacKinnon agamst this charge, 
demonstrating that she makes only socially and historically contex- 
tualzed generalizations limited to modern Western industralized 
society. I agree that the generalizations are contextualized, but I also 
think MacKinnon employs a general (and liberal) conception of the 
human. 
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desires a state-sponsored, top-down process of social 
transformation in the name of her own vision of the 
good (feminist) society. Yet, MacKinnon explicitly 
rejects such an approach, criticizing those who claim to 
miss women’s voices so much that they (paradoxically) 
“proceed to imagine for them the world they should be 
a part of building” (MacKinnon 1987, 219). 

Rather, women must participate in building this 
world as “real” (free) women, not as walking projec- 
tions of male needs. If civil rights were conceptualized 
in such a way as to recognize the harm to women 
caused by sexual use and abuse, then individual women 
could bring suits against pornographers (and the spe- 
cific harm would have to be proven in each case, 
arguably a very high standard to meet). If pornography 
were actionable as a civil violation, the state would not 
have to ban books or art by decree. In MacKinnon’s 
presentation, the conflict over pornography exists be- 
tween individuals and not between the individual and 
the state. She intends her attack on pornography as an 
expansion of individual rights, rather than an abridge- 
ment of them.!°. 

It is particularly important to note MacKinnon’s 
attention to individualism because it is generally be- 
lieved that liberalism and feminism use two different 
units of social analysis, which creates a fundamental 
chasm between them insofar as liberalism is individu- 
alist and radical feminism collectivist.11 That distinc- 
tion is not absolute, however, from the point of view of 
either liberalism or radical feminism. Liberals have 
long acknowledged the importance of individual mem- 
bership in various collectivities to the definition of the 
individual.: And although MacKinnon argues on be- 
half of women as a group, her antipornography civil 
Tights legislation, written with Andrea Dworkin, is 
designed to permit anyone (including any man) who 
has been harmed on the basis of sex through the 
making or use of pornography, as the ordinance defines 
it, to sue. According to MacKinnon, any individual can 
occupy a position of sexual degradation; the antipor- 
nography statute is “gender-neutral in overall design” 
(MacKinnon 1992, 300). Her point is that when the 
recipient of harm is viewed as feminine (understood as 
sexually violable), the harm tends to disappear or get 
explained away. She does not jettison Mill’s harm 
principle; she asks that it be applied more judiciously. 
It is true that this requires looking beyond individual 
cases in order to see how gender operates structurally; 
but the group-level analysis is shown to be n 
precisely because of the way gender affects individual 
cases. 

Moreover, unlike some feminists who, inspired by 


10 MacKinnon has criticized Canadian policy for taking a criminal 
approach to pornography, although she commends the Canadians for 
at least recognizing that pornography is a sex equality issue (Mac- 
Kinnon and Dworkin 1996). 

11 Although he does not address MacKinnon, Graham (1994) argues 
that radical feminism 1s liberal with regard to ends but not with 
regard to means. My analysis of MacKinnon’s rejection of paternal- 
ian suggests that she, at least, is liberal with regard to means as well. 
12 Johnson (1994) provides an overview of liberalism’s appreciation 
of the interrelatedness of the mdividual and the group. See also 
Kautz 1995. 


American Political Science Review 


Marx's attempt to eradicate class, seek to eradicate sex 
(Firestone 1970), MacKinnon does not demand an 
enforced obliteration of the gender distinction alto- 
gether, hoping to bring about a substantive sameness. 
Her “clean slate” is not a final goal; it is a new 
beginning from which individuals can and must act, and 
it therefore transcends the distinction between “equal- 
ity of opportunity” and “equality of results.” Some have 
argued that MacKinnon discards such concern about 
fairness in favor of “revenge” (Cornell 1991, 138), but 
this is not the case. The perennial liberal tension 
between freedom and equality may not be resolved by 
MacKinnon, but it is a concern that she incorporates. 


CONNECTING EPISTEMOLOGY AND 
POLITICS 


The problem is that the empiricist epistemology tradi- 
tionally employed by liberal theorists hampers our 
recognition of the barriers that make it virtually impos- 
sible for women to occupy the position of the individual 
who is the basic social unit in liberal theory. MacKin- 
non identifies this position, with the attendant capacity 
for choice and self-determination, as essential to the 
full experience of humanity. She does not, as do some 
other feminist critics, argue on behalf of an alternative 
to the liberal conception of the individual; she employs 
a liberal view of the self as an essential part of her 
argument. Her critique challenges the assumption that 
women presently occupy this position and that they 
should be treated neutrally by the law; her goal, 
however, is to foster social conditions under which such 
a view of women would be justified. For example, only 
when women have speech (only when they are heard 
even when they contradict the speech scripted for 
them) will they be in the position of liberalism’s 
presumed “individual,” whose free speech must be 
protected from government interference. It is from this 
perspective that she launches her attack on pornogra- 
phy. 

Paradoxically, MacKinnon wants liberals to see dif- 
ferently and to see more of what already tends to 
register as contrary to liberal principles. In her presen- 
tation of the complex relationship between conscious- 
ness and material conditions she stretches liberal the- 
ory the farthest, although she detects an appreciation 
for that complexity in Mill’s work ([1859] 1978, [1869] 
1988).!3 For example, MacKinnon (1989, 44-5) cites 
favorably his acknowledgment that “women’s consent 
to their place is no less coerced for seeming acquies- 
cent.” Mill also addressed the fact that myths support- 
ing sexual inequality are true and false at the same 
time, which makes it impossible to defend women as 
“just like men.” He recognized the ways in which social 
practices caused a harm to women much-more pro- 
found than formal inequality, even if the remedies he 


3 Rmg (1991, 64-84) detects self-contradiction in Mill, resulting 
from his commitment to empincsm, rather than subtlety m his 
treatment of women. But based on Mill’s appreciation of the 
complex relationship between consciousness and material conditions, 
Burgess-Jackson (1995) argues that Mill has been mislabeled as a 
liberal feminist and was in fact a radical femmist. 
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proposed did not adequately address that harm.™* The 
principle of individual inviolability supplied him, as it 
ultimately supplies radical feminism, with a standard 
for arguing the injustice of such harm, but that princi- 
ple requires the contributions of a feminist perspective 
(like that of MacKinnon, or perhaps Harriet Taylor) in 
order even to recognize the harm. Liberalism is open to 
seeing harm in new places; the question is whether it is 
open to seeing harm in new ways. MacKinnon has been 
quite successful in arguing that sexual harassment 
constitutes harm, which suggests that mainstream 
American culture is willing to rethink its received 
notions of acceptable sexual behavior without aban- 
doning its dominant liberal framework. 

Despite the influence of Marxism on her work, 
MacKinnon does not contextualize or historicize the 
ideals to which she appeals in defending the impor- 
tance of women’s reality. Some see this as a potential 
contradiction in her work. For example, Ring (1991) 
believes MacKinnon emphasizes relativism and subjec- 
tivism over objectivity, and she defends a version of 
dialectical materialism as superior to MacKinnon’s 
approach. Ring argues that the interaction between 
multiple subjectivities can provide access to objective 
reality. For example, the truth of whether a rape 
occurred can emerge out of the conflict between the 
man’s defensive account of his actions and the woman’s 
outrage at her perceived violation. Both must be 
weighed according to their emotional intensity. Each 
subjective response is valid as a subjective response; 
their clash should yield something greater than the sum 
of its parts. But what if a woman at some level believes 
she deserved what happened to her in a parked car? 
What if the woman’s outrage does not materialize or is 
so tinged with ambivalence that it lacks the kind of 
emotional intensity that would impress a jury? Accord- 
ing to Ring (1991, 201), “if that emotional bedrock of 
rage that I am here assuming accompanies a sense of 
violation, never emerges, the jury will indeed have to 
find that a rape did not occur.” She adds (p. 201): 
“Catharine MacKinnon herself would have to believe 
in the superiority of conventional objectivity, if she 
were to argue that such a legal decision is erroneous.” 
MacKinnon indeed appeals to some “objective” stan- 
dard even as she rejects conventional empiricism. I 
have shown that MacKinnon orients herself not only 
toward a material inequality (that must be taken into 
account in order to correct liberalism’s idealism) but 
also toward an abstract vantage point that cannot 
simply grow out of women’s experience of inequality, 
and to which she attaches fairly traditional liberal 
ideals. 

Presently, “sex equality” may be meaningless insofar 
as its requirements result in a contradiction in terms, 
but MacKinnon is not content to expose its contin- 
gency; rather, she works toward bringing about what 
she refers to as its true, comprehensive meaning (1989, 
242). To better understand what she means by this, we 
must simultaneously draw on and think beyond the 


14 Analyses of Mill’s failings in this regard are provided in Eisenstein 
1989 and Ring 1991. 
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traditional liberal vocabulary. Only by revealing the 
illiberal tendencies of liberalism can certain harms be 
made visible. MacKinnon holds liberals up to their own 
standards in order to show them how they fall short. 
Civil rights activists faced a similar challenge; they had 
to recast the humanist ideals of liberalism in order to 
render them more comprehensive. Similarly, MacKin- 
non calls on us to recognize another unacknowledged 
instance of a failure to live up to liberalism’s promise— 
its abstract, humanist, universal promise. She does not 
hold out an alternative promise for women, although 
she does say that it is hard to imagine what social 
reality will look like once that promise is met (1989, 
249). This is not only because true sexual equality has 
never been tried, but also because true sexual equality 
will allow individuals—male and female—to shape the 
social landscape. 

It is not surprising that MacKinnon affirms the 
successes achieved by civil rights activists who worked 
within a liberal framework. “There came a point in 
Black people’s movement for equality in this country 
when slavery stopped being a question of how it could 
be justified and became a question of how it could be 
ended” (MacKinnon 1991a, 90). Just as arguments 
about racial equality shifted from explicit discrimina- 
tion to systemic or endemic racism, so does radical 
feminism turn our attention to how sexism can be 
embedded in the social fabric in ways that may not 
come to light without sensitizing liberalism to its own 
blind spots. According to MacKinnon, liberalism’s 
most dangerous blind spot in the application of its 
principles to women is the persistent presumption that 
they are meant to be sexually used, which undermines 
any individual woman’s chances at a fair trial, an equal 
opportunity, and so on. Her arguments are designed to 
get us to recognize this presumption as a barrier, but 
nowhere does she herself advance an argument that 
barriers as such are wrong. 

Instead, I submit, MacKinnon relies on liberal op- 
position to barriers to individual choice and self- 
determination. Taking as a given that harm to another 
justifies limitation to one’s freedom, she argues only 
that harm to women in the form of sexual objectifica- 
tion is such an instance. This requires more of a change 
in how we see women than in how we conceptualize the 
harm principle. The problem that MacKinnon detects 
arises from the fact that the sexual use and abuse of 
women will not register as harm if it is seen as only 
natural, if indeed “sex is what women are for” (Mac- 
Kinnon 1991b, 191, emphasis in original). The hierar- 
chical conception of gender operates in advance, 
undermining individual women’s accounts of their ex- 


15 In making manifest the illusory quality of some of liberalism’s most 
fundamental assumptions, MacKinnon operates as critic in much the 
same way as Rousseau. She reveals the chains underneath the flowers 
(or hearts and flowers) with a view to moving from illusory to true 
liberty and equality. She even uses language that explicitly recalis the 
opening lines of On the Social Contract (Rousseau [1782] 1968), 
observing that “women are men’s equals, everywhere m chains” 
(1989, 104). We would hardly characterize Rousseau as extrinsic to 
the liberal tradition, which has been profoundly shaped by many of 
his insights (via Kant), even though Rousseau himself had republican 
aims. 
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periences of discrimination and violation by making 
such treatment seem appropriate to them. It certainly 
contributes to the view that pornography’s harm to 
women is less insidious than the effect of pornography 
regulation on pornographers. If we could reach an 
agreement about how best to delineate the harm to 
women caused by the eroticization of dominance and 
submission, there is no inherent reason it could not be 
rectified from within a liberal perspective.16 


CONCLUSION 


If we take liberalism to be “essentially a view about the 
bases of political and social criticism, about what 
counts when we evaluate alternative political and social 
institutions and practices” (Johnston 1994), we see that 
MacKinnon agrees with liberals that choice, consent, 
and protecting individuals from harm by others and 
from state power should “count” fundamentally. She 
also argues that women must “count” in how these 
abstract ideals are given life in a concrete social system. 
She demands that liberals consider soberly the ques- 
tion of whether women are indeed human beings, and 
she spells out the serious implications of the seemingly 
innocuous equivocation, “Yes, but...” This is not to 
reduce her theory to a demand for inclusion. But we 
cannot understand her work in complete opposition to 
liberal theory, and we cannot understand the ways in 
which liberalism may or may not be “feminist,” or 
make sense for women, until we understand the com- 
plexity of her work and appreciate the depth of her 
challenge to liberalism. 

My analysis shows that MacKinnon is part of an 
ongoing conversation. We must not ignore the extent 
to which she reformulates the liberal project, especially 
its idealism. Her work is not merely a mirror or a 
repetition of traditional liberal rhetoric and concepts; it 
is a rearticulation that shapes that which is articulated. 
She neither completely abandons traditional liberal 
theory nor mirrors it. In fact, her relationship to 
liberalism exhibits the same tensions we see in her 
characterization of feminism’s relationship to the sta- 
tus quo of male power—more than a mere reflection 
but not entirely separable either. Whether or not we 
agree with MacKinnon’s presentation of sexuality, her 
presentation of feminism’s method and dilemmas pro- 
vides a particularly fruitful example of both the possi- 
bilities and the difficulties of situated social criticism. 

The tensions that characterize MacKinnon’s rela- 
tionship to liberalism, as a harsh critic as well as a 
fervent adherent, arise from her attempts to compre- 
hend both the concrete (inequality) and abstract 
(equality) of women’s reality. Her vision requires dou- 
ble vision in order to appreciate the concreteness of 
women’s inequality and the “shadow world” of their 
equality. MacKinnon works to create the political 
conditions under which this reality, at present only a 


16 Scocca (1996) provides a detailed defense of the position that a 
ban on violent pornography is consistent with Mill’s harm principle. 
Easton (1994, Hxi) pomts out that the harm prmeple has been used 
by both defenders and critics of the right to pornography. 
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shadow world, might flourish, and women might expe- 
rience their full humanity. Such conditions do not 
involve an abandonment of liberal principles; on the 
contrary, those principles provide the basis for these 
conditions. For the most part MacKinnon’s rhetoric 
emphasizes the limits rather than the possibilities of 
liberalism, but she also offers modest endorsements; 
she acknowledges, for example, that “legal guarantees 
of equality in liberal regimes provide an opening” for 
social changes (1989, 242). A great deal of feminist 
input is necessary to pry open that opening, but even in 
MacKinnon’s estimation the potential is there. My 
reading of her work is intended to highlight that 
potential and the ways she makes use of it. 

What, then, are we to make of MacKinnon’s claim 
that her feminism is “unmodified” by existing theories? 
This claim applies more accurately to some aspects of 
ber work than others. When MacKinnon is rethinking 
the social category “woman,” her inquiry is radically 
new, and she demands that we engage her arguments 
on their own terms without filtering them through 
predetermined categories of what we do and do not 
already “know” about sex. To questions of individual 
freedom, equality, and choice, however, she does not 
offer “unmodified” answers. Rather, she invokes a 
liberal conception of these ideals. Hers is certainly not 
a “liberal feminism” if that phrase is taken to mean the 
application of liberal principles to women, without 
rethinking the category “women.” But it is precisely on 
the basis of these liberal ideals that she argues such a 
rethinking is necessary. 

To conclude either that MacKinnon should be 
lumped in with liberalism or that she abandons it is to 
oversimplify not only her theory but also liberalism. 
Too often the feminist critique turns liberalism into an 
overly static concept. It is obvious that liberalism relies 
on some basic, abstract ontological and political claims, 
but it is also the case that these claims are and must be 
continually articulated with respect to the specific 
questions they are intended to address. This process of 
articulation produces a diverse range of possibilities 
and attaches different meanings and implications to 
those claims. That is why there are and have been many 
disputes about and within liberalism, disputes in which 
feminism has participated and continues to participate. 
When feminism acknowledges this participation rather 
than claim it is proceeding on a different plane alto- 
gether, it can help reconceptualize the plane and 
parameters of liberal theory. Liberal theorists are 
paying more attention than ever to issues of racial, 
sexual, and economic inequality and pondering 
whether liberal theory offers the resources to address 
these inequalities (see, e.g., MacLean and Mills 1983; 
Yack 1996). My analysis lends support to the notion 
that liberalism’s resources are still relevant to the issue 
of sexual inequality. 

MacKinnon can contribute to this ongomg reevalu- 
ation of liberalism, and once her salience is acknowl- 
edged, the possibility of a more synergistic relationship 
between feminism and liberalism must also be ac- 
knowledged, whether or not we accept MacKinnon’s 


specific prescriptions. The reopening of a dialogue 
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between feminism and liberalism is necessary, accord- 
ing to Johnson (1994, 136): 


If contemporary feminist theory persuades itself that it can 
have no dialogue with the formulations of humanist ideals 
which have rooted themselves (albeit in an incomplete, 
flawed and fragile form) in modern social life, then, it 
inevitably severs itself from a reflective, interpretative 
relationship with the sense of frustrated potentials and 
dissatisfied cultural needs which have, from the first, 
galvanized the modern women’s movement. 


A real dialogue consists of contributions from both 
sides. Feminist theory must recognize the contribution 
of liberal theory to its own enterprise, beyond liberal- 
ism’s historical role. 

I do not suggest that liberal assumptions, ideals, and 
institutions are not contestable and contested; they are 
and should be. I do not claim there is anything in 
liberalism that is “off limits” for feminist interrogation; 
but when the interrogation yields results that partially 
affirm the framework being challenged, these results 
need to be acknowledged in order that the strengths 
and weaknesses of both the critique and the framework 
can be fully appreciated. Obviously, there are risks in 
invoking liberalism’s abstractions, which, as MacKin- 
non shows, can be as much part of the problem as part 
of the solution. But there are risks in disavowing them 
as well. The extreme view that feminism and liberalism 
are incompatible drives an unnecessary wedge in what 
otherwise could be a productive alliance. 


17 For examples of the kind of feminist analysts that proceeds along 
these Imes, see Allen (1988) on the msue of privacy and Kiss (1997) 
on the issue of rights. 
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dialogue with liberalism (as well as with other theoretical traditions). 


LM tree vite wal while not liberalism in denial or disguise, is engaged in 


Her analysis of my work (Schaeffer 2001) im- 

pressively integrates its epistemic, legal, politi- 
cal, and social dimensions, avoiding common misread- 
ings. Her engagement is serious, careful, supple, lucid, 
and respectful of the text—largely an exception and a 
relief, as well as a real contribution. If her article is 
“part of an ongoing conversation” (p. 706) with liber- 
alism that she sees taking place in my theory, it is a 
conversation well worth having. 

I criticize the liberal tradition! for its methodological 
idealism, meaning the extent to which it interprets 
material reality as driven by ideas and morality to the 
neglect of the reverse; for its individualism, meaning its 
failure to grasp systematically the group-based deter- 
minants of inequality; and. for its relative blindness to 
organized power in diverse social forms, so that it often 
effectively sides with dominance. These tendencies are 
traced from epistemology to policy on the substantive 
ground of the status and treatment of women relative 
to men. Any attempt to reinvent me as a born-again 
liberal must contend with this, as Schaeffer, who de- 
scribes much of my analysis unusually well, does with 
only varying success. 

Schaeffer’s project in this article might be variously 
reformulated as to show that liberalism can respond to 
feminist criticism and stay liberal, that liberalism is 
receptive to feminism, or that liberalism would be all 
feminism needed if it was correctly applied to women. 
Certainly, liberalism can be feminist and still be liberal. 
The question is whether that is enough for women. 
Admittedly, John Stuart Mill’s ({1869] 1975) stunningly 
insightful and prescient analysis of women’s roles, in a 
work written with Harriet Taylor that I analyze in some 
depth, has more to offer women than many works that 
purport feminism today. But his work is the only 
classical example drawn on by Schaeffer to any extent. 
Certainly too, some contemporary liberal scholars, 
notably Martha Nussbaum (1999), Steven Schulhofer 
(1998), and Cass Sunstein (1993), contend productively 
with feminist concerns on liberal terrain. But Schaeffer 
does not ask why liberalism as a whole, long ruling 
ideology, needed feminism to notice the humanity of 
women in the first place, and why it has yet to face 
either the facts or the implications of women’s material 
inequality as a group, has not controlled male violence 


O: writes for a reader like Denise Schaeffer. 


Catharine A. MacKinnon is Flizabeth A. Long Professor of Law, 
Unrversity of Michigan Law School, Ann Arbor, MI 48109-1215. 


1 Although Schaeffer clearty com: that liberalism is a classi- 
cal tradition that underlies much of today’s liberalism and conserva- 
tsm, given the falsehood widely by others concerning my 


work that is it allied with the Right, it probably bears repeating that 
my critique of liberalism 13 not a defense of conservatism. 


societywide, and has not equalized the status of women 
relative to men. If liberalism “inherently” can meet 
feminism’s challenges, having had the chance for some 
time, why hasn’t it? 

According to Professor Schaeffer, my work “rest[s] 
on liberal underpinnings” (p. 699); specifically, in her 
view, I “share[] the liberal preoccupation with choice 
and consent” (p. 703).* Actually, I do not use or 
theorize either choice or consent as my own—as I do 
equality, for example. My work on consent is devoted 
to critique, to showing that the liberal tradition has not 
meant what it says it means by these terms or delivered 
on its promise where women and sex are concerned. 
While liberal theory uses consent as if it means free- 
dom—to refer to unburdened, open, and unforced 
selection—it often applies the term in situations that 
better demonstrate acquiescence in inequality and 
acceptance of what one has little realistic possibility of 
refusing. Calling inequality freedom promotes inequal- 
ity. To say that my “overriding concern [is] with 
women’s capacity for choice and self-determination” 
(p. 704) is to proceed as if the terms mean what they 
pretend to mean, relabeling in liberal terms the sys- 
temic sex inequality that I criticize in both liberalism 
and the legal and social order to which it is integral—a 
reality that liberalism itself has gone far to rationalize, 
cover up, and legitimate as consensual and chosen. 

Schaeffer at points clearly grasps that I criticize 
liberalism for failing to live up to its own standards. At 
other points she claims that its terms are my terms. To 
point out that liberalism claims to be animated by 
concerns that, when applied, often do not mean in 
reality what many of its well-intentioned practitioners 
purport to mean by them in theory, is not to accept its 
terms as one’s own. My point has been that the social 
conditions under which women could have what liber- 
alism claims we have now, under which its ideas might 
work and its terms might mean what they are supposed 
to mean, would be conditions of sex equality, a goal 
that liberalism also claims to embrace but has done as 
much as any philosophy to preclude, while purporting 
the contrary. I think we are long past the point where 
it makes sense to argue that liberal theory will produce 
sex equality on its own. Whether liberalism would 
recognize its own concepts if they meant in reality what 
they are supposed to mean in theory is unclear, since 
they would no longer support power as currently 
organized. The peculiar genius of liberalism has been 
to seem to accommodate fundamental challenges with- 


*Ed. Note: The author uses empty brackets ([]) to indicate omitted 
letters in her quotes from Schaeffer's article. 
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out fundamentally changing. Schaeffer's work provides 
a kind of illustration. 

In this vein, Professor Schaeffer mistakes my “wom- 
en” for liberalism’s “individuals.” To empower women 
to bring civil claims for sex discrimination, as my work 
on sexual harassment and pornography does, is not to 
treat them as liberalism’s “individuals”— quite the 
contrary. Both sexual harassment and pornography are 
sex discimination claims; sex discrimination is a group- 
based claim. Women who sue one at a time for sex 
discrimination are suing for harm to them as women, not 
for harm to them as individuals: They are suing as 
members of their group, for injury to themselves in their 
capacity as group members, that is, on the basis of sex. No 
liberal view of the self is buried in this work.2 Deempha- 
sizing the prevalent liberal tendency to elide groups as 
constitutive of individuality, while missing the inherent 
group basis of equality claims (see p. 704) makes it 
possible to minimize the conflict between liberalism and 
my work on the crucial point of unit of analysis. 

No “general (and liberal) conception of the human” 
(p. 704, note 9}—that is, a conception of people 
without gender—animates my work. The view that I 
“posit[] a shared human core that [I] consider[] authen- 
tic rather than arbitrary” (p. 704) is ungrounded in my 
text.3 I do not, and do not need to, posit or assume 
“that there is something to ‘woman’ that is authentic 
and real” prior to or apart from society (p. 704). 
Rather, I argue that by existing social standards and in 
comparison with men—both, by the way, grounded 
and culturally specific standards—women, in their sta- 
tus as members of the group women, are violated and 
unequal I analyze women and men as socially and 
politically constituted throughout, an approach that 
neither reveals nor presupposes “authentic” or “arbi- 
trary” beings but rather exposes thoroughly socially 
contingent ones. The attempt to shoehorn a feminist 
analysis into a liberal one fails here. 

Liberals often seem unable to interpret the world in 
other than liberal terms. Schaeffer posits that I “seek{] 
recourse to a position beyond power and history from 
which social reality can be evaluated” (p. 704). If I were 
a liberal, saying what liberals translate me to be saying, 
this is what I would be doing. But it is not what I do. My 
analysis is solidly and explicitly grounded throughout in 
the concrete specific substance of women’s experience, 
socially located in time and place, a material ground 
that liberal theorists often dismiss as just the particu- 
lars or just the examples or just the applications. But it 
is the relation between reality and theory that feminism 
changes—a methodology liberals often miss because it 
is so profoundly nonliberal.* Moreover, my critique of 


2 Actually, contrary to Schaeffer’s assertion (p. 706), much of the 
Black cvil rights movement's legal analysts was profoundly nonlib- 
eral in the same way, although the compromises and tortured 
translations into which it has been forced have largely vitiated that 
foundation or forced it into subtext, much to the detriment of the 
effective use of law for racial equality. Schaeffer also seems unaware, 
or not to have systematically constdered, that some lawyers and legal 
activists are or can be Marzists, or otherwise not liberals, including in 
their practice of law. 

3 For my view on this subject, see MacKinnon 1998, 171. 

4 For further discussion, see MacKmnon 2000. 
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objectivity presents a systematic critique of proceeding 
any other way. There is no “abstract vantage point” (p. 
705) in my work, nor does Schaeffer indicate or docu- 
ment where one is to be found. She merely posits it. It 
is also to impose a liberal framework where none exists 
to assume that because I assert that some claims about 
women’s reality are factually wrong, I must be “appeal- 
[ing] to some ‘objective’ standard” (p. 705). Because 
liberalism has not yet theorized the factual outside 
objectivity does not mean that I don’t. 

To say that I do not “contextualize or historicize the 
ideals to which [I] appeal[]” (p. 705) is therefore twice 
wrong. My analysis is thoroughly contextual, grounded, 
and substantive, and I do not appeal to ideals as such. 
Liberals tend to assume that a critical analysis has a 
moral or ideal basis and aims at a moral goal because 
their analysis does. Schaeffer is thus correct that I do 
not say why oppression and subordination and inequal- 
ity are wrong (p. 706), but this is because my analysis is 
moral or ideal in neither basis nor purpose. It shows 
that women are a political group—oppressed, subor- 
dinated, and unequal—and explores the contours and 
implications of that reality for theory and politics and 
law. That reality established, anyone is welcome to 
defend or contest it. 

While Schaeffer’s attempt to reclaim my work for 
liberalism is thus at key points overdrawn, her own 
work is part of ongoing efforts to reclaim liberalism as 
a tool of change for women. But it needs to go farther. 
Tf Schaeffer is correct that there is “no inherent reason” 
(p. 706) liberalism cannot end the harm dane to 
women by the erotization of dominance and subordi- 
nation, she should explain why it has yet to do so. In 
reality, liberalism’s ideology of consent and choice has 
made it impossible effectively to stop the pornography 
industry, even as some individual liberals and conser- 
vatives—both heirs to the liberal tradition—have op- 
posed it because of the harm it does. Professor Schaef- 
fer contends that seeing the harm in sexual 
objectification requires a change in how women, not 
harm,.are seen (p. 706), but she does not explain why 
the existing liberal concept of harm has so far been 
incapable of seeing women as harmed when they are 
sexually objectified. If Mill’s harm principle exemplifies 
liberalism, as it does for Schaeffer, and this liberalism is 
all women need, then it is time for liberals to start 
applying that principle when it counts, including on the 
pornography issue. I would gladly cite John Stuart Mill 
for opposition to pornography based on its harm. But 
Schaeffer's soothing and uncharacteristically oblique 
reassurances that readers can take my work seriously in 
the abstract but demur on facing or doing anything 
about its tougher substantive realities—at least this is 
how I read those passages™—leave me unsettled as to 
the benefits for the most violated women of the liberal- 
feminist “alliance” (p. 707) she urges. 


S “Whether or not we agree with MacKinnon’s presentation of 
sexuality...” (p. 706); “whether or not we accept MacKinnon’s 
specific prescription .. .” (p. 707). i 
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An oddly static notion of liberalism pervades Schaef- 
fer’s analysis, even as her work itself—with others, such 
as Zillah Eisenstein’s tellingty titled The Radical Future 
of Liberal Feminism (1981)}—illustrates its potential 
dynamism. In an analysis of points of convergence that 
has many virtues, Schaeffer may underestimate both 
liberalism and women. She seems not to have imagined 
that feminism may be changing liberalism fundamen- 
tally and against its grain, rather than being subsumed 
under it or having been there in it all along. 
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Political Theory and Feminist Social Criticism (Contempo- 
rary Political Theory). By Brooke A. Ackerly. New York: 
Cambridge University Press, 2000. 234p. $54.95 cloth, 
$19.95 paper. 

Iris Marion Young, Unversity of Chicago 

Brooke Ackerly theorizes from the ground up. She has 

observed the discussions, critical practices, and organizing 

efforts of grassroots groups of poor women in less developed 
countries, particularly a local chapter of Save the Children 
led by women in Tanjail, Bangladesh, and the Self-Employed 

Women’s Association of Gujarat, India. Ackerly gives a 

theoretical account of the aims and methods of social criti- 

cism that learns from the circumstances and virtues of these 
and other women’s organizations, is generalizable to others 
contexts of criticism, and can critically evaluate their prac- 
tices. The originality of this book rests in situating a theory of 

social criticism in the context of the practice and i 

of activist women in some of the poorest places in the world. 

She brings important insights to critical social theory in 

general. 

Ackerly aims to synthesize this perspective with some 
major contemporary Western political theories of democracy 
and social criticism, including those of Michael Walzer, 
Martha Nussbaum, and several theorists of deliberative de- 
mocracy. Each of these contributes to a theory of social 
criticism adequate to the practical requirements of these 
women’s organizations, but all fall short taken separately. 
From the critical synthesis of these theories with the theo- 
retical writings and discursive practices of those she terms 
Third World feminists, Ackerly constructs what she claims is 
a more adequate account of social criticism. 

The category “Third World feminists” includes quite a 
diversity of people. Most of the grassroots activists Ackerly 
identifies are located in the global South, but many of the 
theorists she draws on are North American, such as Kimberly 
Crenshaw, Jane Mansbridge, Elizabeth Spelman, Bell Hooks, 
Patricia Hill Collins, and Susan Bordo. Although the cate- 
gory seems to distinguish a particular form and local context 
of feminist, the definition of Third World feminism comes to 
a universalist commitment to justice and democracy. “Third 
World feminists advocate scrutinizing and challenging exist- 
ing values, practices and norms based neither on the roles 
that the societies have identified for women, nor on the roles 
that societies has identified for men, but rather based on 
ongoing opposition to harmful inequalities in women’s life be 
they a function of sex or other categorization” (p. 57). When 
so broadly defined, the category is in danger of losing any 
ability to distinguish one perspective from others; it also 
seems to lose reference to the particular context and experi- 
ences of women in the developing world. 

From Michael Walzer’s theory of interpretation as social 
criticism Ackerly derives the position that social criticism 
should proceed from within the context of the society it 
criticizes, with an insider’s knowledge of local meanings and 
relations. As do some critics of Walzer, Ackerly finds that his 
theory is not well enough protected from relativism and that 
his appeal to shared understandings fails sufficiently to 
acknowledge social difference. Her resistance to the unity of 
the idea of shared understandings is appropriate, but Ackerly 
does not sufficiently recognize Walzer’s view that societies 
contain a distinction between ideals and actions that opens 
the distance needed for criticism. 
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Ackerly finds in Martha Nussbaum’s statement of universal 
capability values a better vehicle for taking critical distance 
from local practice, although she reformulates some of these 
values. Nussbaum’s theory falls short, according to Ackerly, 
in provision for democratic consultation, especially with the 
most oppressed segments of the society being criticized. 
Nussbaum would contest this judgment, but Ackerly is cor- 
rect to identify a kind of monologism ın the method of social 
Criticism that lists universal human values, which we then 
apply to evaluate local situations. 

Ackerly looks to theories of deliberative democracy for a 
more dialogical method, but she finds that these set condi- 
tions of deliberation so ideal that they cannot guide discur- 
sive practices under conditions of structural inequality. In 
place of a theory that presumes equal relations among 
dispassionate deliberators, Ackerly offers an account of de- 
liberative inquiry as an exploratory, educative, expressive, 
and confidence-building process whereby members of grass- 
roots organizations learn together to formulate criticisms of 
the dominative processes that marginalize them. 

In later chapters of the book, the critical processes about 
which Ackerly theorizes are illustrated with extended ac- 
counts of recent international feminist debates. Most inter- 
esting is the analysis of the transnational feminist movement 
to make women’s rights recognized as human rights. Feminist 
criticisms of the exclusion of issues such as violence against 
women from mainstream human rights discourse not only 
succeeded in shifting that discourse but also emboldened 
local women’s organizations in many parts of the world and 
helped create a strong transnational feminist network. De- 
bates within the feminist human rights movement emerged, 
however, over the extent to which even this critical stance 
capitulated too much to an antidiscrimination model of 
human rights that deemphasizes social and economic rights. 
With this analysis of the feminist human rights debate 
Ackerly intends, among other things, to illustrate an aspect of 
Third World feminist social criticism that she finds underem- 
phasized in theories of social criticism, namely, that a process 
of social criticism should have reflective self-criticism built 
into it. 

Ackerly claims to offer a theory of Third World feminist 
social criticism that incorporates the strengths of other 
important theories but avoids their weaknesses. As a con- 
structive project her account is plausible and coherent, but 
we might want to know more about how the new theory 
should be tested. Ackerly applies the theory along the lines I 
have indicated, but she could offer more in the way of 
possible objections to her account and directions for answer- 
ing them. 

Ackerly may paint too optimistic a picture of the ability of 
local women’s organizations to forge networks, learn through 
discussion, and develop skills of critical self-reflection. The 
book aims to speak from the perspective of poor women, but 
it reflects little on how conditions of poverty and the limited 
resources of organizations, as well as the situation of depen- 
dence in which many of these women and organizations often 
find themselves, can stand as powerful impediments to 
self-reflective discussion within a group and the collective 
ability publicly to critiaze powerful people and institutions. 

This effort to fuse political theory with observation of and 
reflection on grassroots practice in the global South should 
be emulated and expanded. Some of the most important 
claims of justice today come from vital social movements in 
that part of the world, and it is high time that political 
theorists interested in issues of democratic process and social 
criticism incorporate the expenence of these movements into 
their theoretical world. 
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On the Socratic Education: An Introduction to the Shorter 
Platonic Dialogues. By Christopher Brouell. Lanham, MD: 
Rowman & Littlefield. 240p. $35.00. 


Peter C. Emberley, Carleton University 


Christopher Bruell offers a series of interconnected commen- 
taries on sixteen of Plato’s shorter dialogues. Selection was 
based on the criterion of highlighting Socrates’ art of educa- 
tion. Modestly ing that his commentaries are an 
“introduction” to the dialogues, and that they are offered as 
a contribution to contemporary debates on liberal education, 
Bruell reveals himself a subtle exegete and makes a highly 
important contribution to classical scholarship. Attentive to 
every nuance and twist of a dialogue, he displays a discern- 
ment and irony developed only after years of painstaking 
reading and rereading of the dialogues. That Bruell is himself 
an acclaimed teacher, and that his own students have gone on 
to become recognized teachers, offers assurance that he is 
deeply conscious of the Socratic imperative to exhibit conso- 
nance between speeches and deeds. In the preface, Bruell 
proposes that a recovery of the wisdom of these dialogues is 
necessary to answer the question of what makes a life most 
worth living. 

Socrates’ trial and execution for impiety and corruption of 
the young induced Plato to portray him not as a corrupter, 
but as one who truly cared about the young Athenians who 
gravitated toward him. Plato invariably lingers on the effort 
Socrates makes to give the decency chosen by decent people 
its due tribute. This did not mean, however, that Socrates is 
not also portrayed as unsure whether virtue is teachable, 
whether his students have any capacity for being taught, and 
whether anything true can be said about the divine things 
toward which piety is directed. 

In the context of today’s preoccupation with transparency 
and authenticity, and the reduction of liberal education to 
skills or moralistic platitudes, Socratic education must inev- 
itably appear manipulative and presumptuous. Bruell, in 
contrast, defends Socrates’ soulcraft and offers a comprehen- 
sive appreciation of why the tactic of adapting speeches to 
souls is s0 Necessary. 

In a piquant aside on the viability of our own political 
regime, which (arguably) takes no responsibility for inculcat- 
ing substantive beliefs, Bruell asks whether it should be a 
matter of concern to us that Socrates, who himself called 
many of his fellow citizens’ beliefs about virtue into question, 
would not have been executed in our own times. Naively 
assuming a harmony of human desires, virtues, and laws, it 
may be that we are failing today to see the dangers unre- 
strained ambitions and longings, or timidity and indifference, 
can precipitate. Socratic education, Bruell demonstrates, 
takes these dangers seriously. It walks the thin line of 
recognizing the need to produce critical awareness of the 
laws without creating contempt for the substantive beliefs 
they embody. 

Socrates’ “errors,” overstatements, use of shame or ambi- 
tion, techniques of awakening consciousness of deficiency in 
those who confidently believe themselves to be complete—all 
are part of his diagnostic art to read and, where necessary, 
cure a soul Nowhere is such cunning more needed than when 
the interlocutor has far-ranging political or philosophical 
ambitions and where, as Bruell’s deft analysis of the Alcibia- 
des shows, despite beauty and most of life’s advantages, 
Alcibiades’ genuine capacity to discharge the offices wisely is 
absent. Whether it is taming Theages, whose passions run to 
love of tyranny, or reshaping the noble exertions of Crito, 
whose friendship leaps too precipitously over lawfulness, 
Socrates weaves the woof or warp of his mterlocutors’ diverse 
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souls into a unified web. The range of interlocutors in Plato’s 
dialogues comprise together a universe of the manifold 
manifestation of wisdom in different character, and a chal- 
lenge to that rare individual who can bring together political 
rule and philosophy. Plato’s depiction of Socratic education 
serves as a sustained defense of Socrates against the Athe- 
nian charge that he was corrupting the young. 

The value of Bruell’s portrait of Socrates extends beyond 
classical Athens and sheds light on contemporary platitudes 
that range from cultural relativism, legal positivism, and 
psychological reductionism to how to respond to the seduc- 
tive appeal of sham teachers and the prolixity of those 
academics who lack the self-conscious, moderating reluc- 
tance to teach and open themselves to learning something 
from their students. 

In the style of philosophical commentary, Bruell does not 
eschew any grand theories, but he allows each segment of a 
dialogue to unfold with just the mildest interpretive nudge. 
Nonetheless, certain larger themes do appear, and the author 
is quite insistent that sections of the book must be read in the 
Tight order. From the plethora of conclusions a reader is 
invited to reach in piecing together Bruell’s own innuendos, 
admissions of perplexity, and ironic use of a weak argument, 
one in particular sticks out: the place of the divine in human 
affairs 


Bruell is persistently gnomic about those metaphysical 
ideas for which Plato is widely known: the theory of Forms, 
the recognition of the Good beyond being, and the immor- 
tality of the soul—experiences that other commentators 
argue lie at the heart of Socrates’ reform of piety. One is 
tempted to say that at times Bruell takes nearly a choirboy’s 
glee in exposing Socrates’ impiety, or lifting ‘the veil from 
piety, while being too reticent about genuine intimations of 
spiritual transcendence. In his commentary on the Second 
Alcibiades, Bruell focuses on how Alcibiades was persuaded 
not to pray to the gods lest his (unreflected) wishes be 
answered, but in the discussion of the Euthyphro he empha- 
sizes the line of inquiry that would have piety subsumed to an 
intelligible form, and even justice. Appealing to the Sympo- 
sium, Bruell suggests that for Socrates communication with 
the gods 1s limited to the daemonic and, more property, to his 
own eros. 

Piety does not appear as a significant virtue in Socratic 
education. Bruell’s Socrates, it seems, can tolerate no reli- 
gious transcendence not rendered into theological belief, or 
beliefs not ratified by reason. Although the proposal that 
Socrates ultimately abandons religious piety may be persua- 
sive to individuals for whom experience of the divine must be 
translated into rational propositions, an equally compelling 
interpretation is possible. It may be that Socrates’ qualifica- 
tions constitute not so much an argument against the reality 
and transcendence of the divine, as much as they are a strike 
against the human presumption regarding the extent to which 
the divine can be absorbed into the world. Despite, or 
perhaps because of, the human distance from the gods, 
Socrates may still be contending that experiential openness to 
the divine is essential to the human condition, and that order 
in a polity 1s based on harmony with the divine measure. 

If this option needs still to be explored, the question as to 
whether Bruell’s book on Plato’s dialogues succeeds in its 
ambition of providing a substantive foundation for contem- 
porary debates in liberal education, not to say the answer to 
the question of what makes a life most worth living, may lead 
readers who do not yet feel that they have plumbed the 
source of the tensions and crises in liberal education and 
modernity to reach for an additional text—Saint Augustine’s 
Confessions. 
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Culture, Citizenship, and Community: A Contextual Explo- 
ration of Justice as Evenhandedness. By Joseph H. Ca- 
rens. Oxford: Oxford University Press, 2000. 284p. $65.00 
cloth, $24.95 paper. 

Citizenshtp in Diverse Societies. Edited by Will Kymlicka 
and Wayne Norman. Oxford: Oxford University Press, 
2000. 444p. $65.00 cloth, $24.95 paper. 

Rethinking Multiculturalism: Cultural Diversity and Politi- 
cal Theory. By Bhikhu Parekh. Cambridge, MA: Harvard 
University Press, 2000. 379p. $35.00. 


David Miller, Nuffield College, Oxford 


Each of these books is a welcome and distinguished addition 
to the rapidly growing literature on multiculturalism and its 
implications for liberal political theory. The fact that virtually 
all liberal societies are now effectively multicultural raises two 
main problems. The first is the problem of justice, or how far 
liberal societies should deviate from standard principles of 
equal treatment of citizens in order to accommodate the 
demands of cultural minorities, such as granting them special 
rights of various kinds. The second is the problem of cohe- 
sion, or how far respect for cultural difference needs to be 
constrained by the need to maintain a cohesive political 
community, for example, by insisting on a common educa- 
tional curriculum as a preparation for citizenship. All three 
books grapple with these questions both at an abstract level 
and by engaging with a wide range of concrete policy issues 
that have arisen in liberal societies; in this respect each 18 a 
fine example of engaged political theory that is not afraid to 
address controversial issues directly. The accompanying dan- 
ger is that an interest in finding pragmatic solutions may blur 
the underlying principles at stake. I shall ask how far this 
danger is realized in each case. 

Will Kymlicka’s theory of multicultural citizenship occu- 
pies a prominent place in this field and is discussed at some 
length by both Carens and Parekh. In Citizenship in Diverse 
Societies, Kymlicka is represented by a substantial introduc- 
tion (with Wayne Norman) that lays out very helpfully the 
main issues that arise when citizenship becomes differenti- 
ated by group. Kymlicka frequently has been criticized for 
relying on an overly simple classification of cultural minori- 
ties as either immigrant ethnic groups or minority nations, 
and here a more complex classificatory scheme is introduced. 
The editors also borrow Jacob Levy’s eightfold schema for 
classifying cultural rights. 

The remaining essays in the Kymlicka and Norman book 
are directed mainly at specific issues that arise for citizenship 
in culturally plural societies. Two excellent contributions by 
Eamonn Callan and Jeff Spinner-Halev examine education; 
they ask how far religious groups are entitled to msist on 
separate schooling for their children and how far common 
schooling can property be required in the name of citizen- 
ship. Democratic institutions are discussed by Jane Mans- 
bridge, who explores the benefits and costs of “descriptive” 
representation (Le., by someone who shares your character- 
istics) for disadvantaged groups, and by Melissa Willams, 
who looks at forms of democratic deliberation from the 
perspective of such groups. The legal recognition of religious 
groups is considered by Tariq Modood and Ayelet Shachar; 
the latter addresses in particular the effect of multicultural- 
ism on women, a theme pursued further by Sawitri Saharso. 
Denise Reaume and Pierre Coulombe tackle the problem of 
language rights in a Canadian context. Jacob Levy examines 
different ways of incorporating indigenous law into liberal 
legal systems, and John Borrows looks at forms of citizenship 
that may be appropriate for aboriginal peoples. Graham 
Smith and Rainer Baubock ask how far federal institutions 
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can satisfy the demands of national minorities. A more 
general article by Jeremy Waldron on civic responsibility 
appears to retreat a little from his earlier celebrations of 
cosmopolitan identity. Overall, this high-quality collection 
should be read by anyone interested in the issue of multicul- 
tural citizenship. 

Culture, Citizenship, and Context is at one level a sustained 
attempt to grapple with some of the most difficult issues that 
have arisen in culturally plural societies, but at another level 
it presents and exemplifies a method of doing political theory 
that Carens labels “contextualist.” He claims that we run into 
serious difficulty if we try to resolve practical problems in 
particular societies by appeal to abstract principles alone; 
instead, we must learn a lot about local history and culture 
and must be prepared to find value in the institutional 
solutions that we find on the ground. Contextualism does not 
go all the way: Some principles, particularly regarding human 
Tights, are cross-culturally valid. 

The main conceptual tool that Carens employs to tackle 
multicultural questions is “justice as evenhandedness,” which 
he contrasts with the more familiar liberal idea of justice as 
neutrality. The latter counsels us to enact laws that apply 
equally to everyone and to ignore culture-based claims for 
special treatment. Justice as evenhandedness, by contrast, 
requires “a sensitive balancing of competing claims for 
recognition and support in matters of culture and identity” 
(p. 12). Sometimes such a balancing entails precisely a policy 
of neutrality, but more often it requires looking for solutions 
that give religious or other cultural groups at least part of 
what they demand. 

In practice, Carens balances interests: He asks how far 
existing law or policy frustrates the desires of a particular 
group and whether those desires correspond to a fundamen- 
tal interest of the members in question. One implication of 
this approach is that questions about the value of particular 
ways of life cannot be avoided. For example, in considering 
whether the Amish should be allowed to withdraw their 
children from high school, Carens argues that we should 
weigh the value of the Amish way of life against the consum- 
erist alternative that becomes less available without a high 
school education. 

In reading the book, which is consistently interesting and 
very well written, I found myself agreemg with most of 
Carens’s practical judgments, but I became a little worried 
that the principles behind them remained so unelaborated. 
Because the case for a contextual approach is powerfully 
made and yields dividends in discussions of topics as wide- 
ranging as immigration in Quebec, Muslim religious prac- 
tices, aboriginal self-government, and democracy in Fiji, this 
worry should not be overstated. But it raises the question 
whether a liberal theory of multicultural citizenship is possi- 
ble. 

Among these three volumes, Bhikhu Parekh goes the 
farthest in distancing himself from liberalism. He claims to be 
developing a political theory of multiculturalism and insists 
that liberalism has to be seen as a culturally specific way of 
life rather than a neutral perspective from which the political 
claims of different ways of life can be judged. He writes on 
the basis of practical experience with Britain’s Commission 
for Racial Equality, and his judgments on concrete questions 
are always thoughtful and well informed. But, as with Carens, 
there is a question about the basis on which these judgments 
are made. Parekh values cultural diversity, respect for cul- 
tural difference, and dialogue between cultural majorities and 
minorities. These background values cannot take us very far, 
however, when we have to settle such practical issues as 
whether Sikhs should be exempt from safety-helmet laws, 
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whether female circumcision always should be illegal, or 
whether the right to free speech extends to cover Salman 
Rushdie’s right to publish The Sataruc Verses. It is difficult to 
avoid concluding that Parekh relies on liberal princrples to 
resolve these questions, even while castigatmg liberalism for 
its partisanship. 

This tendency to give with one hand and take back with the 
other is the main flaw of Rethinking Mulficulturalsm. For 
instance, Parekh is quick to expose the limitations and 
“liberal bias” of the UN Declaration of Human Rights, 
arguing that even values such as respect for human life and 
human dignity are culturally relative: “What constitutes 
humiliation or cruelty ... vanes with cultures and cannot be 
universally legislated” (p. 135). But in considering the case of 
China a few pages later, he is happy to condemn it for 
permitting, inter alia, “some of the worst forms of personal 
humiliation” and “disregard for human dignity.” More gen- 
erally, the intercultural dialogue on which Parekh relies to 
resolve the practical conflicts thrown up by multiculturalism 
can proceed only against the background of substantial 
agreement on basic questions of justice. There is more to be 
said in favor of John Rawls than Parekh allows in his brief 
discussion of that author. 

This weakness notwithstanding, there is much to admire in 
Parekh’s book, not least his willingness to unsettle, in the 
gentlest possible way, some of our complacent assumptions 
about issues such as religious dress, animal slaughter, and 
polygamy. It will no doubt provoke discussion—some of its 
main ideas have been forthrightly criticized in Brian Barry’s 
Culture and Equality (2001)}—and, like the other two books, is 
essential reading for students of multiculturalism. 


The Covenant Connection: From Federal Theology to Mod- 
ern Federalism. Edited by Daniel J. Elazar and John 
Kincaid. Lanham, MD: Lexmgton Books, 2000. 352p. 
$75.00. 

Christopher M. Duncan, University of Dayton 

Daniel J. Elazar (1934-99) was a prolific scholar and is easily 

the single most recognized name in the area of American 

federalism studies. During his career, he authored or edited 
more than 60 books, produced scores of other publications, 
founded and edited the journal Publrus, served as a presiden- 
tial appointee for three terms on the U.S. Advisory Commis- 
sion on Intergovernmental Relations, and committed himself 
to professional and public service both in the United States 
and Israel. Between 1995 and 1998, he produced the monu- 
mental four-volume series on the covenant tradition in 
politics: Covenant and Polity in Biblical Israel (1995), Cove- 
nant and Commonwealth (1996), Covenant and Constitution 

(1997), and Covenant and Civil Soctety (1998). He will forever 

be known as the author of American Federalism, A View from 

the States (1966), with its ubiquitous conceptualizations of 

American political cultures, but it is his work on the linkages 

between religious covenant thought and its sociopolitical 

manifestations and extensions across political history that 
would seem to be his own preferred scholarly legacy, if forced 
to choose. 

Oddly enough, although this volume was just published last 
year, the essays collected here by Elazar and Kincaid serve in 
the capacity of “prequel” to the work produced at the close of 
Elazar’s life and career, as well as to some of the very strong 
work of the other contributors. In 1975, Elazar established 
two workshops to explore the covenant tradition, one at 
Temple University in Philadelphia and the other at Bar-Ilan 
in Israel. The essays in this volume represent work done in 
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the former setting and were to have been presented 15 years 
ago but for various reasons were not. The larger goal, of the 
workshops themselves was to “restore the convenantal tradi- 
tion to the same status as that of the natural law tradition in 
the study of political lfe and thought” (p. x). 

The published work that followed these initial efforts over 
the last twenty years has certainly moved us closer to that 
goal, but the essays in this volume are important mostly for 
the promise they demonstrate of what was to come later. 
Although many of the contributions possess an integrity of 
their own, their value consists pri as an introduction to 
the subject(s) and should not be used by serious students 
except to augment or begin their reading of the later or 
coterminous work of many of the authors, such as J. Wayne 
Baker, Thomas O. Hueglin, Donald Lutz, Charles S. McCoy, 
and James W. Skillen, to name a few. 

Elazar conceptualizes the work at hand as a project of 
recovery and resuscitation of the relationship between the 
covenantal tradition and American federalism and constitu- 
tionalism. To accomplish this task, the authors explore both 
thematically and in loosely chronological order the political 
history, theory, and theology of early Calvinism across the 
Reformation to the Puritans—both English and colonial— 
and the relationship among their notions of a covenanted 
community, the social contract theorists, and American fed- 
eralism. The Covenant Connection falls prey along the way to 
many of the difficulties of edited volumes in general, such as 
rough transitions, a degree of repetition, unevenness among 
chapters, intermittent thinness and thickness regarding the 
overall argument, and occasionally eclectic decisions regard- 
ing inclusion, but its chief and ironically unavoidable weak- 
ness is that it could not benefit from the literature of the last 
twenty years, much of which was produced by the authors 
themselves. 

Others are better suited than I to judge the precision of the 
historical and theological exegesis regarding covenant theol- 
ogy and to comprehend the full meaning of passages like the 
following, from Charles Butlers essay: “The Reformed Au- 

ini ded like Calvin to the realist view of Adam’s 
headship of the race—rejected Catharinus’s suggestion, but 
as thought on the Covenant of Grace made the representa- 
tive principle of covenanted federal h ip more common 
through Christ’s being so presented...” (p. 105). Neverthe- 
lesa, the general thrust of work is very persuasive. In vastly 
simplified form, the argument rests on the following sort of 
progression. The Latin foedus, which serves as the root for 
the English federal, translated originally as covenant. Cove- 
nant discourse as opposed to, say, liberal or rights-based 
discourse in turn yielded what Lutz calls in its later American 
form communitarianism (p. 225). In its theological sense, a 
covenant was, according Torrance, “a promise binding two 
people or parties to love one another unconditionally” (p. 
146). 


Elazar explains the general linkage between that concept 
and political theory in the following manner: “Just as God 
limits the exercise of His omnipotence by entering into 
covenants with humanity, thereby endowing people with 
freedom, so too, does covenant limit the exercise of the 
boundless self, not only for the common good, but also 
individual freedom. Thus, while contracts tend to create 
relations of mutual self-advantage and limited liability, the 
thrust of covenant is toward the creation of communities or 
commonwealths animated by concern for the public good” 
(p. 6). Hence, the federal theology of the reformers eventu- 
ally comes to undergird and animate what might have been 
viewed as simply a pragmatic or structural arrangement 
called federalism in a way that not only reconnects it with a 
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richer and more profound history but also forces us to see the 
seemingly secular political reformers of our own tradition, 
such as Locke, in a vastly different light. These facts alone 
make the book worth the effort. 

In the hands of these authors, federalism becomes some- 
thing of a normative imperative and a theoretical codification 
for a well-ordered state. By pushing the reader to pick up 
what are today often neglected thinkers from the early 
modern period—such as Heinrich Bullinger, Johannes Al- 
thusius, and Johannes Cocceius—or to revisit the place of 
John Knox in the historical scheme, Elazar and company do 
a great service to the history of political thought and Refor- 
mation theology as well. By demonstrating its extension over 
time to the colonial shores and later American political 
thought, they force us to reconsider the genealogy of Amer- 
ican political discourse and allow us to explore what is at least 
a complimentary if not an alternative conceptualization of 
the American political “soul.” Properly used, the reorienta- 
tion intended here and developed m much greater detail in 
later work provides a receptive reader with new tools for 
addressing the continuing problems of the modern liberal 
state concerning pluralism, tolerance, the relationship be- 
tween liberty and equality, the relationship between parts and 
wholes, and, perhaps most important, the fundamental na- 
ture and meaning of ci itself. Despite the fact that 
much of this gets One Doren SER eee save yi 
is no mean feat. 


Democracy: Power, Legitimacy, and the Inter- 

of Democratic Ideas. By Katherine Fierlbeck. 
Manchester and New York: Manchester University Press, 
1998. 216p. $69.95. 

Global Justice: Defending Coamopolitanism. By Charles 
Jones. Oxford: Orford University Press, 1999. 249p. $65.00 
cloth, $19.95 paper. 

Nomos XLI: Global Justice. Edited by Ian Shapiro and Lea 
Brilmayer. New York: New York University Press, 1999. 
222p. $50.00. 


Charles R. Beitz, Princeton University 


The monograph by Charles Jones is both less and more than 
its title suggests. It is less because his book is devoted almost 
exclusively to global distributive justice, a topic of great 
contemporary importance but one that hardly exhausts the 
subject of global justice. (Global political justice is a distinct 
topic, about which we have, if anything, even less clarity). It 
is more in that Jones’s ambitions go beyond the defense of 
cosmopolitanism promised in the subtitle. Hand in hand with 
the presentation of his own version of cosmopolitanism he 
offers a tour d'horizon of contemporary thought about inter- 
national distributive justice—indeed, his book is probably the 
most philosophically sophisticated such account in print. This 
combination of advocacy and critical survey results in a work 
that should interest not only scholars of the subject but also 
those seeking an intelligent introduction to it. 
Global Justice is organized around a distinction between 
litan and communitarian approaches to interna- 
tional distributive justice. The first several chapters consist of 
critical discussions of three different cosmopolitan theories— 
those based on utilitarianism, human rights, and the form of 
global Kantianism defended by Onora O'Neill in Faces of 
Hunger (1986) and in portions of Towards Justice and Virtue 
(1996). Jones’s own view is a form of human-rights-based 
cosmopolitanism. Its central idea will be familiar to readers 
of Henry Shue’s Basic Rights (rev. ed. 1996). Like Shue, Jones 
argues that human rights are protections of important human 
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interests, and among these rights, none is more important 
than the right to subsistence. Accepting for the moment 
Jones’s conception of a human right, his argument that there 
are subsistence rights and that they are morally basic seems 
to me persuasive. 

It is one thing, however, to say why basic rights should 
matter to those whose rights they are and another to explain 
why those who have duties to contribute to their satisfaction 
should be moved to do so. This is because the considerations 
that serve to ground claims of right are typically recipient 
oriented: They have to do with the interests of the rights 
holder. It is frequently not obvious why anyone else should 
care about these considerations—at least to the extent nec- 
essary to motivate affirmative action to satisfy their demands. 
Jones replies to this question indirectly. He describes an 
allocation of both positive and negative duties associated with 
human rights, and he identifies and refutes several reasons 
someone who is subject to these duties might resist comply- 
ing. The argument is plausible as far as it goes, but some 
readers may think it does not go far enough. Skepticism that 
human rights can generate duties to act often arises from a 
denial that the duty holder stands in the kind of relationship 
to the rights holder that can explain why the duty holder 
should be moved by an appreciation of the rights holder’s 
situation. Jones’s defense of cosmopolitanism would be more 
powerful if it included an account of the moral importance of 
subsistence duties that connects more perspicuously with the 
perspective of the holders of these duties. 

Jones does address a variety of other doubts about cosmo- 
politan theories that have been expressed in the recent 
literature. These include views emphasizing the special 
claims of compatriots, the ethical significance of nationalism, 
the constraints resulting from recognition of cultural differ- 
ences, and the (allegedly) nonderivative moral importance of 
the state. The typology of anticosmopolitan positions is 
illuminating, and within it Jones finds a place for virtually all 
the leading anticosmopolitan writers of the last decade or so. 
Throughout, the critical discussion is thoughtful and pene- 
trating. Indeed, Jones’s formulation of views with which he 
disagrees is sometimes clearer than the original. 

Nomos XLI contains eight contributions, some of which 
originated as papers or comments presented at the August 
1996 annual meeting of the American Society for Political 
and Legal Philosophy. It is a mixed bag. The contributions by 
Brian Barry, Samuel Scheffler, Charles Jones, and Hillel 
Steiner are free-standing essays devoted to one or another 
aspect of the subject of global justice. (Jones’s is essentially 
the same as chapter 5 of his book.) Three are commentaries 
(two, by John Kane and Liam Murphy, respond to the 
Scheffler article, and the third, by Debra Satz, apparently 
originated as a commentary on Barry but now ranges much 
more broadly). The eighth item—an essay on political real- 
ism by Lea Brilmayer, a coeditor of the volume—is primarily 
metatheoretical: It includes a provocative comparison of the 
domestic and international realms but does not engage the 
normative problems taken up in the other articles. 

The contributions by Barry and Scheffler are the most 
substantial of the lot. Both consider the conflict between local 
loyalties and the demands of global justice, but they approach 
the subject from different directions. Barry is interested in 
reliance on nationalism (sometimes, as he points out, mistak- 
enly identified with statism) to justify resistance to require- 
ments of global justice, particularly those involving humani- 
tarian intervention and international redistribution. He 
distinguishes two varieties of nationalism: the “blood and 
soil” nationalism found in much of the real world and the 
“academic” nationalism of such writers as Michael Walzer 


717 


Book Reviews: POLITICAL THEORY 


and David Miller. These views are more different than many 
people recogmze (including some proponents of academic 
nationalism), but Barry holds that neither is compatible with 
cosmopolitan principles. Because we have strong reasons to 
accept these prinaples, whereas we have little reason to 
regard the nation as an object of independent value, he 
argues that both kinds of nationalism must be rejected. 

At the end of the essay, Barry turns to a different kind of 
nationalism—“cosmopolitan” (or “civic”) nationalism, which 
he says is “essential to the operation of a liberal democratic 
polity” (p. 53) because it helps to motivate social coopera- 
tion. Unlike the blood and soil and academic varieties, civic 
nationalism cannot come into conflict with cosmopolitan 
principles because it is constrained by them from the outset. 
It is not clear that Barry really has in mind a form of 
nationalism at all. It seems more like liberal patriotism: a 
shared allegiance to a particular society—but not necessarily 
a national society—based on a principled approval of the 
society’s basic political values and an appreciation of the 
distinctive features of its social life that constitute its identity. 
Whatever we call it, this form of group loyalty is plainly 
important for the success of liberal polities; equally plainly, it 
is the only form that a cosmopolitan should accept. The 
arguments, both critical and constructive, are characteristi- 
cally clear and sharp and exhibit great good sense. 

Scheffler has a different interest. He notes that the conflict 
of global and sectional obligations often presents itself as a 
conflict between “general” responsibilities (those owed to 
everyone) and “special” responsibilities (those arising out of 
membership in various kinds of groups). When special re- 
sponsibilities require us to subordinate general responsibili- 
ties, this does not seem like selfishness but, rather, a recog- 
nition of the importance of the special relationships that 
constitute the group. Against this, Scheffler considers an 
objection (“the distributive objection”) that might be pressed 
by cosmopolitans. It holds that special responsibilities confer 
unwarranted advantages on those to whom they are owed. 
These persons have already benefited in some way from 
participation in the special relationship, and it is not right for 
them to benefit again, and at the expense of nonmembers 
who are needier, when others who are parties to the relation- 
ship honor their special responsibilities. To recognize special 
responsibilities, that is, is to acquiesce in a distinctive kind of 
unfairness. 

Having framed the objection, Scheffler considers several 
rejoinders to it. He believes that these rejoinders are fre- 
quently overreaching, but he concludes that we cannot dis- 
miss altogether the idea that there are such things as special 
responsibilities and that those may take priority over global 
or general ones. He argues that this possibility of conflict 
between the global and the sectional is inherent in the 
plurality of our ethical commitments. At the same time he 
believes, although strictly speakmg he does not argue, that 
the sectional has too often been allowed to eclipse the 
global—that the tendency has been to honor special respon- 
sibilities at the expense of global ones, even when there is no 
credible defense for doing so. 

It is not a criticism to observe that this analysis takes us 
only so far. As Scheffler points out, we need a better 
understanding of the constraints that special and general 

sibilities can plausibly be allowed to impose on each 
other, but he does not propose such an understanding here. 
Accordingly, on the strength of this analysis, it is not possible 
to say with precision how much and in what ways the special 
responsibilities that we have reason to accept should con- 
strain the global responsibilities that cosmopolitan consider- 
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ations require us to recognize. It remains as a challenge for 
the future to see how far this task can be carried out. 

Katherine Fieribeck states that her book is about “the 
consequences of the globalization of democratic norms” (p. 
7). Each chapter addresses a different area of controversy in 
the recent literatures of democratic theory and comparative 
political development. These involve the meaning and justi- 
fication of democracy, its relationship to liberal constitution- 
alism (on the one hand) and the aspirations of ethnic and 
cultural minorities (on the other), the relationship between 
democracy and the economy, and the significance of civil 
society for democratic stability. The reference to “globaliza- 
tion” in the title might therefore mislead. This is not a book 
about either globalization as a social process or the extension 
of democratic ideas to the global level; rather, it examines 
some normative and analytical issues that arise in the context 
of the spread of ostensibly democratic political forms beyond 
the Western industrial democracies. 

With the exception of the chapter that criticizes the 
multiculturalists and the conclusion, the book is mainly 
composed of critical, synthetic reviews of the relevant litera- 
tures. As far as I can judge, these discussions are reasonably 
complete in their coverage. For the most part, Fierlbeck’s 
own positions are plausible enough, particularly her criticism 
of the political temptation to stretch the definition of democ- 
racy beyond the point where it can serve any moral purpose. 
But the critical discussions are not very detailed—perhaps 
inevitably, as they range over large and diverse literatures— 
and readers who are strongly attracted to positions that the 
author rejects may not find their minds changed. Moreover, 
these discussions are not, except inferentially, elements in the 
progressive development of a single, unifying theory. This is 
too bad, because Fierlbeck’s insistence that we should hold to 
a well-defined and ethically significant conception of democ- 
racy for purposes of foreign and international policy is timely 
and well founded. One wishes that the contours of such a 
conception had been more clearly etched and its normative 
foundations more straightforwardly defended. 


Lives of the Psychics: The Shared Worlds of Science and 
Mysticism. By Fred M. Frohock Chicago: University of 
Chicago Press, 2000. 281p. $27.50. 

Millennial Visions: Essays on Twentieth-Century Millenari- 
anism. Edited by Martha F. Lee. Westport, CT: Praeger, 
2000. 214p. $65.00. 

Jodi Dean, Hobart and William Smith Colleges 

Mainstream political science, particularly in its rationalist 

and pluralist modes, has relied on a narrative prominent in 

social thought at least since the work of Max Weber. This 
narrative reads modernity as a process of disenchantment. 

The process involves a division of the world into differing 

value spheres in the wake of the retreat of transcendental 

ideas as well as a demystification that sets out procedures and 
probabilities that determine both what is worth knowing and 
the conditions of certainty. One oddity of this narrative of 
disenchantment is its radical disconnection from the practices 
and beliefs that continue to enchant the world, a point Jane 

Bennett persuasively argues (The Enchantments of Contem- 

Porary Life: Crossings, Energetics, and Ethics, 2001). Not only 

do religious and magical worldviews continue to provide 

many people in ostensibly disenchanted societies with orien- 
tation and meaning, not only are there multiple knowledge 
communities, but also discourses that claim objective, scien- 
tific status themselves rely on magical, spectral, and incanta- 
tory supplements. Lives of the Psychics and Millennial Visions 
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bring to expression these supplements and are valuable 
contributions to the work of reorienting the study and 
practice of politics. 

Both books are concerned with spiritual and supernatural 
answers to the question of what it means to be human. 
Although inquiries into the place of humans in what may be 
a multitude of universes as well as into the variety of 
experiences and realities in which we are situated have broad 
currency, those involved in the study of politics will be 
especially interested in these books because of their chal- 
lenge to the suppositions of rationality that too often deter- 
mine the boundaries of the political. That is to say, they 
remind us that today, in the highly integrated, mobile, and 
virtual societies of global capitalist technoculture, more 
pressing than the problem of competing conceptions of the 
good is the challenge of competing conceptions of the real. 

Frohock presents bis study as an ethnography of psychic 
and spiritual experiences. Relying primarily on interviews 
with “believers” and “skeptics,” he considers near-death and 
out-of-body experiences, experiments on extrasensory per- 
ception or pst, therapeutic touch, and mysticism. A benefit of 
this ethnographic mode is that it enables experiencers to 
speak for themselves. Frohock demonstrates thereby the 
value of testimonies to the supernatural, as they accommo- 
date a richer, fuller range of the experience and meaning of 
being human. Indeed, one strength of the book stems from 
the ways he draws out the pleasures of accounts of mystical 
experiences for those who have and hear them, pleasures that 
exceed the rationalist or skeptical attempt to measure their 
specific factual content ing to positivist standards. 

This does not mean that Frohock abstains from evaluating 
the facticity of these testimonies. Rather, he introduces 
plausible criteria for assessing their claims to empirical truth. 
He uses the pleasing metaphor of half-way houses along the 
not-all-that-alippery slope from hard science to the fantastic. 
These half-way houses are “hybrid sciences,” “inquiries that 
examine a wide range of events and assertions with broader 
and less defined understandings of truth than found in 
experimental science” (pp. 106-7). They function according 
to four rules. First, lack of evidence is not enough to dismiss 
an experience. Second, the failure of a belief to pass the 
inference test of a particular human community is not 
sufficient for rejecting that belief. Third, less probable argu- 
ments do not refute more probable arguments. Fourth, 
mathematical laws are trumps: One should be skeptical of 
claims that violate basic mathematical principles. As Frohock 
explains, these rules are weak, designed primarily to avoid a 
too quick rejection of innovative or extraordinary beliefs. 

Frohock’s rules, as well as his argument more broadly, 
stem from the basic premise of the limits of human knowl- 
edge. That is to say, Frohock moves from the indisputable 
claim that much of what we know is limited by our senses and 
intellect to the idea that one should be open to the factual 
truth of supernatural experiences. “If we accept a larger 
universe beyond our comprehension, then it must follow that 
we cannot know what our tenure here, in the space-time 
domain that we know as human experience, means in any 
sense of the larger scheme of things” (p. 228). Although this 
argument is not clearly wrong, it is unsatisfying insofar as it 
simply inverts the terms of rationalist approaches to truth. As 
Frohock himself nicely explains, contemporary approaches to 
knowledge, in particular those concerned with decisions 
made under conditions of risk and uncertainty, accept the 
limits of thought. They fully admit that human cognition is 
faulty and incomplete, that the world is complicated, and so 
on. They then respond to these limiting conditions with strict 
evidentiary criteria. They say that since human understanding 
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is limited, knowledge begins with what we can know. Frohock 
says that since human understanding is limited, knowledge 
begins with what we cannot know. 

Perhaps the unsatisfying character of this sort of stand-off 
indicates the limits of certain kinds of rationalist thought, the 
way such thought is rent by antagonisms for which it cannot 
account under its own terms. If I am right, then more fruitful 
approaches to the multiplicity of human knowledges are 
those that consider the mechanisms and techniques for 
distinguishing truth from falsity, the disciplinary investments 
that accompany a will to truth, and the fantasies that infuse 
discourses of truth. Frohock suggests as much when he 
gestures toward the importance of narratives, allegories, and 
parables as “instruments calibrated to interpret, to decipher, 
to render the mysterious intelligible without reducing it to 
human proportions” (p. 227). But he too quickly moves away 
from an analysis of the discourses of the mysterious and 
paranormal to his argument for their possible plausibility and 
their challenge to reality. 

The best contributions to Martha Lee’s Millennial Visions 
treat millenarianism as a discursive thematic that supple- 
ments larger regimes of power and knowledge. Rather than 
view millennial thinking as deviant or marginal to political 
institutions and practices, they analyze these institutions and 
practices for the ways that they manifest millennialist longing 
for and fears of the future. The first four of the eleven 
chapters were given as public lectures at the University of 
Windsor and provide general approaches to the millennium. 
Of these, the essay by Michael Barkun stands out for its 
analysis of the millennium as a concept open to interpreta- 
tion and deployment. As he makes clear, this openness makes 
the millennium itself a field of struggle over meaning, partic- 
ular by such dominant institutions as the church, state, and 
economy as they seek to thwart the more critical and 
potentially anarchic versions of millennial fervor. The contri- 
bution by Mark Kingwell similarly resists the urge to patholo- 
gize millennial emphases on transformation, but he goes 
farther than Barkun and acknowledges that the profound 
desires at the root of millennial expectation can fuel critical 
interrogation of the market, the state, the citizen, and the 
person. 

The remaining seven chapters in Lee’s book focus on 
specific manifestations of millennialism in domains that 
range from racial hatred, to the writing of Betty Friedan, to 
UFOs, to Quebec nationalism. This latter contribution is 
particularly interesting: Susan J. Palmer reads the populanty 
of apocalyptic religious movements in the context of Que- 
bec’s “separatist aspirations and deliberate insularity” (p. 
175). As she makes clear, the problem of these groups’ 
relation to the Canadian government is primarily a problem 
of their relationship to nationalism and the Quebecois strug- 
gle over the proper meaning and practices of citizenship. Her 
analysis is particularly valuable because one of these groups, 
the Raelians, has received international attention for its 
efforts to clone a human being. In fact, as Palmer points 
out, the Raelians “have recently launched a company, 
CLONAID, that offers future cloning services to its inves- 
tors” (p. 185). 

Admittedly, there is something odd in reading about the 
millennium today. Y2K seems such old news that it is hard to 
recall what the fuss was about. Nevertheless, the strongest 
chapters in Lee’s book remind us why we should care: 
Millennial thinking continues to orient political and social 
formations in ways that often conflict with equity and social 
justice. To this extent, these chapters cohere nicely with work 
by Lee Quinby (most recently in Millennial Seduction: A 
Skeptic Confronts Apocalyptic Culture, 1999). For example, 
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Frederick Wall’s essay examines the ways that racial reduc- 
tionism not only gives similar structure to the belief systems 
of black and white racialists but also leads each group to 
anti-Semitism, as Jews are the enemy named for conspinng to 
weaken the pure race. Wall makes his argument through a 
close reading of the anti-Semitic tract The Turner Diaries 
(1980). He shows how its account of a struggle between 
whites and nonwhites has influenced some black rap music. 

Not all the contributions m the Lee volume attend to the 
multiplicity of meanings of millennium or to the ways that 
sedimented apparatuses of power rely on millennialist sup- 
plements. To this extent, these chapters, it seems to me, 
remain in precisely that apocalyptic logic they seek to criti- 
cize, namely, a binary of insider and outsider, chosen and 
damned, knower and believer. In their very marginalization 
of millennial thinking, these essays reinforce it. Thus, 
Thomas Flanagan, in an essay that identifies millennial 
themes in political movements from the French Revolution, 
through Marxism, to eco-terrorism and radical feminism, 
attacks efforts to transform the human condition. His attack 
relies on an account of the natural family and the natural 
market. And it fails to acknowledge the already institution- 
alized practices of the production and transformation of 
human beings deeply embedded in sciences, states, and 
educational institutions. Similarly, Philip Lamy’s discussion 
of UFO belief collapses mto one group UFO religious cults, 
those who attempt to study UFOs in ways that they under- 
stand as scientific, and general belief in extraterrestrials. He 
then treats this group as marginal or subcultural, as if a 
majority of Americans did not believe in the reality of UFOs 
(which they do) and as if mainstream popular culture did not 
overflow with alien-related effluvia (which it does). 

In conclusion, political scientists will disagree with numer- 
ous aspects of Lives of the Psychics and Millennial Visions. But 
the disagreements could well be productive, engaging funda- 
mental questions of what ıt means to be human in a world 
that continues to enchant and confound us. 


Welfare in the Kantian State. By Alexander Kaufman. Or- 
ford: Oxford University Press, 1999. 179p. $45.00. 


Elisabeth Ellis, Texas A&M University 


Kant does make it hard for us. On the scope of legitimate 
state action, for example, he seems to make two contradictory 
claims: First, the state should care for society's disadvantaged 
members; second, the state may not engage in social welfare 
policies. In his most important political work, The M 

ics of Morals, Kant argues at one point that the state should 
be allowed to “impose taxes on the people for its own 
preservation, such as taxes to support organizations providing 
for the poor” (Prussian Academy Edition, VI: 326). A few 
pages earlier, however, Kant writes: “The well-being of a 
state must not be understood as the welfare of 1ts citizens and 
their happiness” (ibid., VI: 318, emphasis in original). Else- 
where, he argues forcefully that one cannot justify state 
paternalism in terms of the welfare and happiness of the 
people. He complains about the “common mistake, when the 
principle of right is under discussion, of substituting the 
principle of happiness for it” (Theory and Practice, ibid., VIL: 
301). In a note to The Confict of the Faculties, Kant compares 
subjects who accept paternalistic welfare policies with “well- 
fed” and “docile” sheep (ibid., VII: 87). It seems reasonable 
to conclude that, despite occasional support for state activism 
in the social welfare arena, Kant generally opposes such 
policies. According to this reading, citizens and their govern- 
ments ought to preserve justice, not welfare, and individual 
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freedom of choice, rather than collective material happiness. 
Libertarian interpreters of Kant have drawn just this conclu- 
sion, defending a minimalist state with Kantian principles. 

As Alexander Kaufman correctly demonstrates, Kant does 
not oppose the welfare state. His apparent remarks to the 
contrary notwithstanding, a comprehensive reading of his 
political theory reveals that Kant objected specifically to state 
paternalism rather than to redistributive policies as such. The 
libertarian interpreters (Kaufman cites Friedrich Hayek, 
among others) are right in their view that, for Kant, justice 
means preserving individual freedom of choice as far as 
possible in civil society. A principled judge of society must ask 
whether individual citizens are treated as ends in themselves, 
not whether their happiness is maximized. The familiar 
Kantian epistemological strategies apply here: Justice cannot 
be determined on a material basis, such as happiness, but 
must refer to formal principles independent of empirical 
circumstance. So far, so good for the minimalist reading. 

Libertarian interpreters err, however, when they apply 
Kant’s exclusion of material ends to the state’s choice of 
means. True, oppressive systems of government may not be 
defended in material terms; one cannot argue, as the cam- 
eralists did before Kant, that the people’s welfare justifies 
paternalistic despotism (pp. 50-61). Yet, the policies chosen 
by citizens in a “rightful civil condition” may themselves aim 
to improve collective welfare. In fact, they ought to do so. 
Public policies “must conform with the universal end of the 
public [happiness], and to be in accord with this... is the 
proper task of politics” (Toward Perpetual Peace, Prussian 
Academy Edition, VIL: 386). Moreover, in the name of 
securing substantive freedom for the people, the Kantian 
state may intervene to correct what Kaufman calls “inherent- 
ly coercive conditions” (p. 148). In only 162 pages of text, 
Kaufman refutes the libertarian interpretation, defends ‘a 
reading of Kant’s political works in which justice demands 
equal “access to the opportunity to develop one’s capacity for 
unconditioned purposiveness” (p. 153), and briefly illustrates 
some consequences for public policy. 

Kaufman makes his case against the minimalist interpre- 
tation of the Kantian state so persuasively that the reader 
may wonder about the strength of the opposition. Kaufman 
cites Wilhelm von Humboldt and Friedrich A. Hayek as 
representatives of the libertarian interpretation, including a 
few more recent commentators, such as Bruce Aune. By the 
19908, however, this interpretation of Kant had already been 
fairly well refuted in the literature. First, anyone who argues 
that no Kantian system of justice could legitimate policies 
that benefit the disadvantaged would have to contend with 
the enormous challenge presented by John Rawls in A Theory 
of Justice (1971), whose theory (with its difference principle) 
is explicitly “Kantian constructivist.” 

Second, Allen D. Rosen (Kant’s Theory of Justice, 1993, 
cited by Kaufman, pp. 27-8) covered much the same ground 
as Kaufman, with most of the same interlocutors and many 
similar conclusions. Rosen’s arguments against the libertar- 
ian interpretation of Kant’s opposition to paternalism are 
much easier to follow than Kaufman’s, although Rosen does 
not convey the complex structure of Kant’s actual system as 
well as Kaufman does. For example, Rosen (Kant’s Theory, p. 
206) defends Kantian welfare policies mainly on abstract 
principle: “No people could rationally agree to a constitution 
that failed to contain a provision guaranteeing [at least] the 
basic needs of all citizens. Constitutionally guaranteed rights 
to a mmimal level of well-being are thus, on Kantian princi- 
ples, part of the structure of any just and rational civil 
society.” These principles are grounded by analogy to the 
moral duty of beneficence. 
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Kaufman discusses duties, too, but the bulk of his case for 
a social welfare state rests on Kant’s political rather than 
purely ethical arguments. Briefly, according to Kaufman, in 
order to exercise free choice meaningfully, Kantian citizens 
must have access to opportunities to develop their capacities 
to choose. Substantive freedom depends on the satisfaction 
of some basic needs and on the equal distribution of oppor- 
tunities for self-development. “A capabilities distribuendum 
appears to offer the best currency for Kantian distributive 
justice, since such a distribuendum grounds the principle of 
distribution in the substantive basis for claims to equal 
political standing: equality of purposive agency” (p. 154). In 
Kaufman’s rich interpretation, then, state welfare policies are 
not merely permitted but justified as the means to realize the 
Kantian ideal of self-rule. An ancillary benefit of Kaufman’s 
subtler treatment of the question of a Kantian welfare state is 
the sense it makes of Kant’s notorious distinction between 
active and passive citizenship, which can now be read in the 
context of enabling equality of opportunity rather than as the 
classist, sexist, exclusionary argument it appears to be (pp. 
31-2). 

The apparent contradiction in Kant’s theory of the state is 
thus resolved. As Kaufman ably demonstrates, a well-consti- 
tuted state may engage in redistribution in order to realize 
the promise of substantive freedom for all citizens. Perhaps 
Kant did not make it so hard for us after all. The debate over 
a Kantian welfare state may well tell us more about our field’s 
interpretive shortcomings than about inconsistencies in the 
original work. 


Retrieving Political Emotion: Thumos, Aristotle, and Gen- 
der. By Barbara Koziak. University Park Pennsylvania 
State University Press, 2000. 203p. $29.95. 

George Klosko, University of Virginia 

Koziak believes that the subject of emotion has been unduly 

neglected by political philosophers and explores aspects of 

the emotional within political life. In the liberal tradition, 
political actors are conceptualized as motivated by narrowly 
construed self-interest, which, as Albert Hirschmann argues, 
was originally supported for its ability to resist the prompt- 
ings of dangerous passions. Emotion has been discussed from 
different perspectives by such important thinkers as Hobbes, 

Spinoza, Hume, and Rousseau. But Koziak turns back to 

Aristotle “because few canonical political philosophers and 

even fewer contemporary political theorists make use of the 

possibilities that the Aristotelian treatment of political emo- 

tion offered” (p. 3). 

The book undertakes three main tasks. Three chapters 
(3-5) are devoted to Aristotle’s account of the emotions, 
concentrating on his treatment of thumos. This psychic 
element, frequently translated as “spirit” or “spiritedness,” is 
familiar to students of political theory as the second or 
middle part of the tripartite soul in Plato’s Republic. Koziak 
believes that our understanding of thumos has been thrown 
off by Plato’s overly narrow, harsh treatment of the concept. 
To a large extent, her discussion of Aristotle is intended to 
correct this impression. 

Koziak notes that “no general treatment of the emotional 
has been written in the history of normative political theory” 
(p. 1). To help remedy this situation, she devotes chapter 2 to 
the treatment of thumos in Homer’s Iliad and Plato’s Repub- 
lic, in addition to the chapters on Aristotle. But her concerns 
are not entirely historical. Koziak believes that Aristotle’s 
account of the emotional can make an important contribu- 
tion to contemporary debates. The last chapter explores 


applications and supplements Aristotle’s account with discus- 
sion of recent feminist theory, especially the ethic of care and 
its implications. In view of traditional portrayals of women as 
excessively emotional, Koziak claims a connection between 
the oppression of women and other marginalized groups and 
predominant views of emotion (p. 4). 

Proponents of the feminist ethic of care attempt to rees- 
tablish the emotions at the heart of moral theory. Koziak 
welcomes their contributions, although not uncritically. She 
claims that care feminists neglected the subject of political 
emotion and fail to provide a sustained justification of the 
ethics care supports and the practices it recommends (pp. 
172-4). In the final analysis, she favors an approach that 
combines insights of Aristotle and feminist theory. 

In view of the book’s highly ambitious agenda, it is not 
surprising that its main contributions are more suggestions 
that established claims. The chapters on Aristotle present the 
most sustained arguments. Koziak contends that Aristotle 
understood thumos in different senses. Although he did at 
times identify it with the kind of anger discussed by Plato, he 
also viewed it as a general emotional capacity, capable of 
being shaped in different ways by political institutions. Artis- 
tic media, especially tragedies, have important effects, as 
discussed in the Poetics. Extrapolating from Aristotle’s ac- 
count, Kozak criticizes contemporary society for neglecting 
the role of political institutions in shaping citizens’ emotions. 

The discussion of both Homer and Plato 1s less satisfying. 
Although Koziak presents an informative account of thumos 
in The Iliad, The Odyssey is not discussed, and it is not clear 
why she concentrates on thumos as opposed to Homer's 
more general treatment of the emotional. Similarly, discus- 
sion of Plato is more or less confined to Book IV of The 
Republic, but his treatment of the emotions elsewhere in The 
Republic and in other dialogues is essentially ignored, includ- 
ing the highly intellectualistic account of the emotions in the 
Laches, Charmides, and Protagoras. 

Koziak’s critique of liberal political theory is potentially 
interesting. Her depiction of the overly rationalistic, emotion- 
ally impoverished actors in liberal theory provides a kind of 
companion piece to the “disembodied selves” put forth in 
Michael Sandel’s famous critique (Liberalism and the Limuts 
of Justice, 1982). But rather than examine the works of such 
leading contemporary liberal thinkers as John Rawls, Ronald 
Dworkin, and Thomas Nagel, Koziak discusses 1995 debates 
in Congress surrounding welfare reform. She criticizes as- 
pects of different speakers’ emotional appeals—and to this 
subject devotes only three pages (pp. 155-7). This discussion 
is supplemented by three more pages on Stephen Holmes’s 
account of the suppression of emotion in theoretical models 
based on rational self-interest during the rise of liberal theory 
(Passions and Constraint, 1995, pp. 157-60). But more evi- 
dence and more nuanced discussion are clearly called for. 

Throughout the book, Koziak’s reliance on Aristotle is not 
adequately defended, especially her concern with his analysis 
of thumos. Once again, why concentrate on thumos instead 
of his theory of the emotions as a whole? This is not to 
question the value of Aristotle’s insights, but the proper 
standard here is comparative: Why focus on this particular 
theme in Aristotle, instead of the many far more recent 
treatments of the emotions? In chapter 1, Koziak surveys an 
entire literature by recent scholars who provide sophisticated 
accounts of the emotions, but their findings do not figure in 
her analysis. In a work on the emotions in political life, 
moreover, it is necessary to take into account the differences 
between the Greek polis and contemporary societies, a 
subject that receives little or no attention. 

In spite of these and other obvious problems, Koziak 
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believes that things cannot be set right if we continue to 
accept misguided contemporary accounts of ancient thumos: 
“If we cannot see how expansive thumos becomes with 
Aristotle, and how our classical heritage, despite the fame of 
its rationalism, houses a significant role for emotion, we will 
continue to be blind to the emotional dimensions of political 
life and their need to be normatively theonzed” (p. 177). 


Where We Live, Work, and Play: The Environmental Justice 
Movement and the Struggle for a New Environmentalism. 
By Patrick Novotny. Westport, CT: Praeger, 2000. 115p. 
$55.00. 


William Chaloupka, University of Montana 


Case studies of four groups working in the much discussed 
arena of “environmental justice” form the core of Where We 
Live, Work and Play. The Gulf Coast Tenants Organization 
(active in Louisiana, Mississippi, and Alabama), the South- 
West Organizing Project (New Mexico), the Ou, Chemical 
and Atomic Workers (OCAW) Local 4-620 (Louisiana), and 
the Labor/Community Strategy Center (Los Angeles) are 
examined, mainly on the basis of how each has framed 
environmental questions in relationship to the groups’ other 
work. Novotny primarily studies the groups’ self-descriptions 
and organizing materials, and he traces the ways in which 
each moved to incorporate environmental issues into its 
original focus on class or civil rights concerns. In each case, 
this move “took place in the context of organizations that had 
already been working [in their] communities for years” (p. 
73). 

The Gulf Coast Tenants Organization expanded on work 
with public housing project residents, adding environmental 
concerns to the more familiar issues of racial equality and 
discrimination that it had emphasized for a decade. The 
SouthWest Organizing Project was founded twenty years ago 
to address a wide array of social and racial justice issues. 
OCAW Local 4620 came to environmental concerns as an 
outgrowth of its long lockout battle at a BASF chemical 
plant. The Labor/Community Strategy Center, which works 
with low-wage workers and on community concerns, moved 
into explicitly environmental organizing after a 1992 explo- 
sion at a local chemical plant. In each case, the book 
documents the ways the group frames environmental con- 
cerns and traces the transition that brought it into what now 
can be described as the environmental justice movement. 
Novotny mostly avoids speculating on why these groups now 
find the environment an attractive issue, other than to suggest 
that such framing expresses community concerns. 

The book emphasizes the framing conducted by these 
groups, which in part is of interest as “a part of the repertoire 
of mobilization strategies that are available to a movement” 
(p. 7). Novotny also suggests that this framing implies a 
position in the ongoing debate about the interconnection 
between language and nature; “the language used to refer to 
the environment is rich with cultural connotations, laden with 
the history of social relations and struggles” (p. 85). Going 
farther, the last two chapters acknowledge that the analysis 
raises intriguing questions about environmental justice and 
the environmental movement in general. All the groups 
began to emphasize the environment after working with their 
communities for years. Some of the groups initially expressed 
suspicion of the environmental movement, which they some- 
times criticized for having a “narrow” or “single-issue” focus 
(p. 77). “That these groups repeatedly used the term envi- 
Tonment in quotation marks reflected [their work] to create a 
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new definition of environment as distinct from that of 
the ... environmental movement” 

This is a short book (94 pages including notes), but ıt raises 
important questions. Environmental justice arises at a time 
when the nation’s largest environmental groups have experi- 
enced a decade of challenges, including community opposi- 
tion to the logging of public lands, for example, at the behest 
of antienvironmentalist groups in the “Wise Use” movement. 
Although Novotny does not directly address the NIMBY (not 
in my backyard) argument that has long been a part of the 
discussion of environmental politics, his focus on the phrase 
that provides the book’s ttle suggests that race and class 
concerns may alter the way NIMBY is understood in discus- 
sions of environmental politics. 

Sunilarly, although the book does not directly suggest it, 
the implication arises that mainstream environmental groups 
may enhance their legitimacy by explicitly altering their scope 
to embrace “issues of racism, poverty, and the legacy of 
injustice” (p. 81). Mainstream environmentalism “has been 
beset by an assumption of the environment as a concern for 
more well-to-do groups,” whereas environmental justice of- 
fers a corrective, “showing that groups from many different 
backgrounds have a stake in [protecting] the environment” 
(p. 93). 

Novotny’s work joins a growing genre of environmental 
justice books. It expands on case treatment in other volumes, 
including The Struggle for Ecological Democracy (1998), ed- 
ited by Daniel Faber, and David Schlosberg’s Environmental 
Justice and the New Pluralisrn (1999). Other recent studies, 
such as Sylvia Noble Tesh’s Uncertain Hazards: Environmen- 
tal Actrsts and Scientific Proof (2000), Christopher H. Fore- 
man’s The Promise and Peril of Environmental Justice (1998), 
and Schlosberg’s book take as their project the critical or 
theoretical consideration of the movement or its context. 
Novotny adds to that by gathering historical information that 
should assist in the ongoing investigation of a number of 
issues, including several mentioned or implied in this book. 


Power versus Liberty: Madison, Hamilton, Wilson, and 
Jeferson. By James H. Read. Charlottesville: University 
Press of Virginia, 2000. 201p. $4750 cloth, $16.50 paper. 


Michael T. Gibbons, University of South Florida, Tampa 


The political thinking of four leading figures of the American 
founding can, James Read argues, best be understood in 
terms of “a four-way comparison structured by a set of 
interlocking themes and problems, all of which branch off 
from the basic question of how to reconcile the power of 
government with the liberty of citizens in a republican 
political order” (p. 4). Such an approach has several virtues, 
Read argues. First, it avoids reducing the debate among 
Madison et al. to what Read describes as ideological divisions 
(e.g., the republican-liberalism debate). Second, it empha- 
sizes the extent to which the participants in the debate shared 
a common conception of liberty that provided a common 
reference point for the disagreements that did exist among 
them. Third, it emphasizes the extent to which Madison, 
Hamilton, end Wilson, at least, broke with the traditional 
Whig view that portrayed power and liberty as inherently at 
odds with each other (pp. 6, 8-10). Fourth, it has the 
practical value of reminding us that the issue of how much 
power ought to be vested in the national government is more 
complex than is sometimes commonly understood. This is not 
to say that Madison and his contemporaries can resolve our 
differences about power and liberty. But they do remind us 
that there is no single, simple answer to the question of 


American Political Science Review 


whether the relationship between power and liberty is a 
zero-sum game (pp. 157-74). 

Read presents the respective views of Madison, Hamilton, 
Wilson, and Jefferson on power and liberty and relates these 
to their political thinking per se and to policy positions that 
each took during the decade or so immediately following the 
Constitutional Convention of 1787. Although the text is 
framed as a four-way conversation, the comparisons between 
Madison and Hamilton, on the one hand, and Wilson and 
Jefferson, on the other, are thrown into relief. Read critically 
addresses a number of conventional interpretations of each 
thinker and provides a corrective to those he believes either 
oversimplify or misconstrue the thinker in question as incon- 
sistent. 

For example, Read argues that Madison’s opposition to 
the increase in national power in 1791 in the form of 
Hamilton’s plan to establish a national bank was not incon- 
sistent with his earlier support for an increase in national 
power in the Convention of 1787, in which Madison himself 
favored such authority. The difference can explained by the 
fact that although Madison had favored the policy earlier, 
that authority was explicitly rejected by the convention as a 
whole. More important for Madison, the Constitution was 
ratified with.the understanding that the power to charter a 
bank was not among the powers being granted to the new 
federal authority. In effect, Madison changed his position 
about the bank because it was not what was agreed to by 
citizens participating in the ratification process (pp. 35-51). 
Therefore, to establish a bank would be a paradigm case of a 
violation of the liberty of citizens, whatever one’s personal 
preferences. Madison’s change of position actually reflects, 
Read argues, a principled position regarding liberty. 

Similarly, interpretations of Hamilton that portray him as 
a proponent of aristocracy and power and an umrecon- 
structed enemy of liberty and republicanism fail to take into 
account his commitment to his own particular brand of 
republicanism and liberty. Hamilton, Read argues, tried to 
strike a balance between power and liberty “that sometimes 
entailed correction in the direction of power, sometimes in 
the direction of liberty. Fear of the power of the states was 
the primary driving force behind Hamilton’s efforts to 
strengthen national government. He insisted that the liberty 
of the citizen is not a function of the relative power of the 
state of which he is a member (p. 85). For Hamilton, as for 
Madison, the increase in the authority of the national gov- 
ernment might itself be an extension of the liberty of the 
people as a whole. 

Hamilton did not accept the idea that political initiative 
could spring from what he considered the myth of the 
sovereign people. Rather, the augmentation of power could 
only be ratified or rejected once the new form of power took 
shape and form (as in the case of the bank). From this 
perspective, when an increase in national power represented 
the embodiment of greater liberty, and if the power of the 
states hindered that (as was likely to be the case, according to 
Hamilton), then it follows that greater power of the states 
would also be a hindrance or constraint on public liberty and 
not, as some Jeffersonians would have it, a defense of liberty. 

Read concludes in the case of Madison and Hamilton that 
the problem posed by the relation between power and liberty 
is more complicated than simply seeing them as opposites. 
For Madison, “because threats to liberty are so numerous 
and proceed from so many different sources, preserving 
liberty sometimes will require more governmental power, 
sometimes less” (p. 51). Just as important, for Madison the 
question was not simply one of quantity but of boundaries 
and the genius or “sense of the community” (p. 51). Similarly, 
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Read shows that Hamilton did not prefer power over liberty 
but that ultimately his “nationalism was consistent with his 
commitment to public and private liberty” (p. 85). 

The views of Wilson and Jefferson are interpreted as 
seeing the relationship between power and liberty in much 
more simple terms. Read argues that Wilson believed there 
was a single, national citizenry that was capable of exercising 
its sovereign authority. The Constitution and the national 
government it authorized were expressions of that sover- 
cignty. The exercise of that sovereignty is at the same time 
the embodiment of the liberty of citizens. Hence, Wilson 
“seems to have believed that the principle of the sovereignty 
of the people removed all serious conflict between the power 
of government and the liberty of citizens. This is merely the 
obverse of the equally simple and straightforward belief that 
the power of government and the liberty of citizens are 
eternally antagonistic; Wilson in effect substituted one simple 
proposition for another” (p. 115). In effect, Wilson ignored 
the possibility of a majoritarian threat to liberty. 

Jefferson’s simplicity is the obverse of what Read points to 
in Wilson, the clearest expression of which is perhaps found 
in the Kentucky Resolutions. Jefferson tended “to reduce all 
political divisions to contests between power and liberty” (p. 
121). He identified strong national government with power. 
He identified state governments with liberty. Power and 
liberty are inherently antithetical. “By radically simplifying 
the political alternatives at stake—strong national govern- 
ment equals antimajoritarian government equals unfree gov- 
ernment—Jefferson was able to avoid ever having to choose 
between the state and national versions of popular sover- 
eignty” (p. 123). 

One shortcoming of Read’s analysis is his treatment of the 
concept of public liberty, particularly as it relates to Wilson 
and Jefferson. Early on Read notes that in the eighteenth 
century the term liberty included both private and public 
liberty. The latter was defined as the right to participate in 
the public and political life of one’s community. Read points 
out that although there was a difference between private and 
public liberty, it did not reflect the modern tendency to 
construct a dichotomy between the two, as has been done 
with the notions of negative and positive liberty. Read is, of 
course, absolutely right in emphasizing the inclusion of public 
liberty under the rubric of liberty per se. Nonetheless, the 
importance of public liberty as a distinctive element of 
republicanism does not seem to have the presence one might 
expect. This is particularly important for Read’s treatment of 
Wilson and Jefferson. Both thinkers emphasize the centrality 
of public liberty and the public participation it required. If 
Read had paid more attention to that element in their 
arguments, he might not have interpreted their thinking as 
being as simple as he does. 

A related point is that Read’s treatment of Madison and 
Hamilton reflects their complexity and subtlety, whereas 
Wilson and Jefferson do not fare nearly as well. They are 
interpreted as offering relatively one-dimensional accounts of 
the political problems facing the new nation and politics and, 
not surprisingly, offering one-dimensional answers. This does 
not do justice to either man. Wilson may very well have been 
overly optimistic about the possibility of a national, demo- 
cratic political identity. Nonetheless, missing from Read’s 
account is amy discussion of the idea in Wilson that the 
primary purpose of a political order is the intellectual and 
moral development of citizens. Moreover, because such 
development presumes political participation, and participa- 
tion presumes freedom from want and need, it follows that a 
rough equality goes hand in hand with democracy and vice 
versa. 
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Sumilarly, although Read makes occasional reference to 
the different views of political economy between Hamilton 
and Jefferson, he does not take the opportunity to examine 
the fact that these differences entail radically different views 
of the good society. Hamilton seems to believe that a political 
economy built around what we would today call a national 
economic policy can provide increasing levels of economic 
growth. Hence, he seems to pin the idea of the good society 
on the idea of matertal well-being. Jefferson’s ideal, by 
contrast, is that of political and community self-determina- 
tion through active, independent citizenship. By situating the 
views of Wilson and Jefferson on power and liberty within 
their larger political vision and by paying more attention to 
the nuances of public liberty, Read might have avoided 
portraying them as having overly simple solutions to one of 
the most important questions of political life. 

Read’s detailed reconstruction and analysis of the compet- 
ing accounts of the relationship between power and liberty in 
these four thinkers 1s a valuable contribution to the literature 
on the founding period. He reminds us of the quality of 
thought on profound questions by those at the very center of 
political life of the time. Moreover, he brings to our attention 
the fact that for Madison, Hamilton, Wilson, and Jefferson 
everyday political life was defined in large part by the answers 
we provide to the fundamental questions of politics. 


Environmental Justice and the New Piuralism. By David 
Schlosberg. New York: Oxford University Press, 1999. 
223p. $45.00. 


Wade Sikorski, Willard, MT 


According to David Schlosberg, the first thing to do is forget 
that there 1s any such thing as environmentalism, a single 
philosophy shared by everyone who might be described as an 
environmentalist. Instead of a single theory, which might 
inform the environmental movement the way that Marxist 
theory informed the socialist movement, there is an almost 
unlimited variety of ways environmental advocates identify 
themselves. Schlosberg lists but a few of them: “Not only 
romantic preservationists, efficient conservationists, public- 
health advocates, and environmental illness victims, but also 
deep ecologists, greens, bioregionalists, animal liberationists, 
advocates of permaculture and organic agriculture, ecofemi- 
nists, religious evangelists, social ecologists, steady-state 
economists, neo-Malthusians, neo-Ludidites, neo-Hobbes- 
ians, ecological technology promoters, nature consumers, 
indigenous rights activists, spiritualists, planners, conserva- 
tion biologists, environmental health professionals, environ- 
mental justice advocates, environmental lawyers, gams, 
ecosocialists, nature writers, worker-health advocates, eco- 
anarchist youth, and more” (p. 3). 

As anyone who has ever tried to organize a coalition of 
environmentalists knows, getting anything done is a lot like 
herding cats. The more you try to point everyone in the same 
direction, the more everyone heads off in different directions. 
According to Schlosberg, this diversity of environmental 
philosophies 1s not a cause for despair; it may suggest a 
failure of leadership and lack of theoretical development, but 
it is an inescapable necessity based on different experiences 
of an industrialized environment. Following the thinking of 
Donna Haraway (Simians, Cyborgs, and Women: The Rein- 
vention of Nature, 1991) Schlosberg argues that knowledge 
about the environment is necessarily situated, limited: by 
place and time. As a result, no coalition of environmentalists 
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can come to a single interpretation of the environment and 
the harm being done to it because their experience is 
necessarily dispersed over a wide array of occupations, toxic 
exposures, disease experiences, personal histories, philosoph- 
ical and religious orientations, identity structures, and 
power relationships. However much some may aspire to an 
objective knowledge about the environment, appeals that 
transcend any single ideology or perspective are doomed to 
failure as a purely practical political matter because they 
simply will not reflect the different things people know about 
their lives. 

The task that Schlosberg sets for himself is to find a way for 
different kinds of people with different experiences and 
different knowledges to work together without giving up the 
energy that comes from being different—to make difference 
an advantage rather than a weakness. Classical pluralism, as 
put forward by Edward Banfield, Robert Dahl, Charles 
Lindblom, Nelson Polsby, and David Truman, although it 
ostensibly deals with difference, is miserably inadequate for 
this task. This model of pluralism is what the large environ- 
mental groups, such as the Natural Resources Defense 
Council, National Audubon Society, and the Environmental 
Defense Fund, have followed. These groups built huge 
organizations, centralized control in the staff, went to Wash- 
ington, lobbied Congress, and negotiated with their oppo- 
nents, the major polluting corporations, in an attempt to 
countervail their influence. For all their efforts and all the 
attention they have received, the Big Ten environmental 
groups have accomplished little. 

Schlosberg contends that a new kind of critical pluralism is 
more likely to be effective in protecting the environment and 
seeking environmental justice. Something like William Con- 
nolly’s (The Ethos of Pluralization, 1995) ethic of agonistic 
respect for difference will create the respect needed for 
environmentalists of all stripes to work together in rhizomatic 
pluralities, networks of mutual aid. Instead of approaching 
difference in others as an’ obstacle to be overcome, an 
agonistic respect for difference cultivates an appreciation for 
it, recognizing that identities are necessarily formed by the 
way we distinguish ourselves from others. Because those 
“others” are necessarily lacking in something we appreciate 
or endorse, we should encounter difference within an under- 
standing that there can be no identity without others who 
differ from it, and difference becomes an opportunity to learn 
about the inevitable cruelities and injustices lodged in one’s 
own identity and presumptions. As people reflect on how 
their differences from others constitute their identities, even 
the most worthy ones, they gain an opportunity to understand 
themselves better, perhaps becoming more just and compas- 
sionate. Instead of feeling threatened by differences with 
their allies, environmentalists would be more likely to work 
effectively with them toward common ends. That is the hope 
that Schlosberg pursues. 

Although environmental justice organizations such as the 
Center for Health, Environment, and Justice (CHEJ) may 
not have read Connolly, or writers in a similar vein, such as 
Michel Foucault, Giles Deleuze, and Felix Guattari, that is 
what they have been doing, Schlosberg contends. They have 
been organizing what these writers might call rhizomatic 
pluralities, heterogeneous networks sustained by difference 
and connection, by multiplication and dispersion, not by 
hierarchy and singularity. Instead of focusing on policy at a 
national level, going from the top down (as the Big Ten 
environmental groups have done), environmental justice 
groups such as CHEJ have focused on what is happening at 
the local level, on a particular incinerator, hazardous waste 
dump, nuclear power plant, confined feedlot operation, or 
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whatever. Instead of trying to make itself larger, CHEJ helps 
local organizations get better organized, advises them on 
what has worked for others, provides technical information 
on environmental toxins, and then gets out of the way. By 
weaving together rhizome-like networks of local organiza- 
tions like this, each dealing with the problems in their own 
community, instead of building unitary tree-like organiza- 
tions as the Big Ten have done, organizations such as CHEJ 
are more likely to cause the changes needed to protect the 
environment, according to Schlosberg. 

Unlike many books in political science, Environmental 
Justice and the New Pluralism will be as helpful to political 
activists as it will be to academics. It delineates the differ- 
ences among different generations of pluralist thought and 
relates those differences to efforts to organize the environ- 
mental movement, suggesting ways that would be both more 
effective and more just. This book is important because it is 
one of the first by a political theorist to examine the 
theoretical implications of the environmental justice move- 
ment carefully. Let us hope it opens doors that other political 
scientists will explore. 


Citizen-Soldiers and Manly Warriors: Military Service and 
Gender in the Civic Republican Tradition. By R. Claire 
Snyder. Lanham, MD: Rowman & Littlefield, 1999. 183p. 
$60.00 cloth, $22.95 paper. 


Judith Hicks Stiehm, Florida International University 


This is a clearly written and interesting exploration of the 
citizen-soldier tradition from a postmodern feminist perspec- 
tive. The author asks: “What happens in a tradition that links 
citizenship with soldiering when women become citizens” (p. 
2)? In the republican tradition, Snyder argues, citizens are 
expected to participate in both civic and martial practices. 
When women engage in martial practices, however, they 
subvert a long and close association among men, the mascu- 
line, and the military. One might suppose that desexing 
and/or degendering the military would be Snyder’s goal, but 
her argument is more radical. 

Snyder believes the ideal of “civic masculinity” should be 
retained but reconfigured as a set of practices in which both 
men and women engage. Similarly, she proposes that both 
men and women engage in a set of practices she calls “robust 
feminity” (a concept that emerges only in the last pages of the 
book and is not developed). To me, it seems easier to dissolve 
the association between men and the military and the asso- 
ciation of masculinity and the military than the link between 
men and masculinity. Even if one does not accept Snyder’s 
fmal argument, there is much of value in this book. 

Snyder is clear about her many assumptions. First, she 
argues that the military is a key political institution, and only 
the citizen-soldier tradition places it at the center of political 
thinking. (I agree it should be; thus, the study of U.S. gov- 
ernmental institutions would encompass the Congress, the 
presidency, the judiciary, and the military.) Second, for 
Snyder a citizen is one who engages in civic practices. She 
contrasts this with citizenship conferred by blood or by place 
of birth. Although she discusses Rousseau’s concept of the 
citizen-soldier at length, she does not use his better known 
definition of the citizen as one who is simultaneously sover- 
eign and subject; this would make governing and liability the 
core elements of citizenship. Also, Snyder does not address 
the long-standing link between citizenship and property. 

In the civic republican tradition “soldiering is central to the 
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process of becoming a citizen, because martial practices 
instill in citizens the virtues required for participation in 
self-government aimed at the common good” (p. 100). It is 
true that those who have soldiered have often used their 
service to claim citizenship (African Americans after the Civil 
War and again after World War II, women after World War 
I, 18-year-olds after Vietnam). Some would argue, however, 
that the citizen-soldier tradition does not require all citizens 
to soldier so much as it argues that militaries should be 
composed of citizens as opposed to mercenaries, and citizens 
with civilian lives as opposed to military professionals. Snyder 
rejects both positions. 

Snyder also assumes that the republican tradition “neces- 
sarily entails” (p. 1) a commitment to the principles of liberty, 
equality, camaraderie, the rule of law, the common good, 

ivic virtue, and participatory citizenship. She suggests that 
universal, democratic principles are part of the republican 
tradition. (Some would almost certainly feel it important to 
distinguish the democratic from the republican.) Snyder does 
acknowledge, however, that the citizen-soldier tradition she 
admires has a potential for creating a monster: a military that 
is exclusive, conforming, and chauvinistic, a military that 
moves from patriotic defense to imperial conquest, a military 
composed not of autonomous citizens but of a totalized mass. 

Indeed, when Snyder examines particular instances of 
organizations that invoke the citizen-soldier tradition, she 
finds little virtue. One example is the militias that emerged in 
this country in the 1960s (and faded from the landscape after 
the 1995 bombing in Oklahoma City). These, Snyder sug- 
gests, were rooted in an antidemocratic exclusion of others, 
even though they sprang from the belief of members (largely 
white males) that it was they who were being excluded from 
civic/political participation, whether because government was 
responding to a too heterogeneous citizenry or because it had 
become too large, too remote, too controlled by the wealthy. 

A second example Snyder considers is the culture of two 
state military colleges, Virginia Military Institute and the 
Citadel. She states these claim to celebrate the citizen-soldier 
ideal, but instead they use exclusionary, misogynist, and 
homophobic training practices that are incompatible with the 
tradition. Again, Snyder asserts that the tradition is demo- 
cratic, but she acknowledges that the thought of both Ma- 
chiavelli and Rousseau (which she explores in some depth) 
can lead to deformed and vicious military institutions. Snyder 
does not provide any sustained, “virtuous,” democratic, in- 
clusive examples of a republican state based on the citizen- 
soldier tradition. Is it possible that getting people (men or 
women) to kill and/or to sacrifice their life requires either 
coercion or the conviction that one is special, different, 
extraordinary? Is it possible that exclusion is necessary to the 
functioning of the martial although antithetical to a commit- 
ment to the common good? (Current Swiss practices that 
require male citizens to participate in the military might have 
provided Snyder with a virtuous example; Swiss women not 
only do not participate in that military but also only recently 
have obtained full voting rights.) 

Snyder provides a useful review of the U.S. civic republican 
tradition, beginning with the Second Amendment to the 
Constitution and the Militia Act of 1792, which required 
enrollment of all free, white, able-bodied men between the 
ages of 15 and 45. Not long after that legislation, “voluntary” 
(exclusionary) militias were formed (pp. 87, 93), which were 
believed to be “better” than the inclusive militia dictated by 
law. A little more than a century later the Dick Act of 1903 
gave the federal government some control over these “vol- 
untary” groups, which had become known as National Guard 
units. By 1908 the federal government had obtained the right 
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to use those units outside the United States. In 1916 the 
federal government further expanded its control over the 
National Guard and was given the right to draft citizens for a 
national military. Snyder sees this as a transition from a 
nineteenth-century America grounded in a civic republican 
tradition to one based on a liberal individualism linked to the 
Progressive Era. There are certain to be some who would 
disagree. 

There is room for discussion about what is the common 
good and what creates commitment to it; there is room for 
discussion about the effect of scale on civic participation; and 
there is a lot of room to discuss the crvil-military relationship 
if “civic” soldiers become politicized. For example, Snyder 
seems to see military performances at nineteenth-century 
civic and political events as positrve, but what is the effect of 
military display associated with one party’s presidential con- 
yention— but not that of the other party—as occurred only 
last year? 

Snyder’s contribution is to normative theory, and it may 
be wrong to think immediately about applications of her 
theory to exsting institutions. The fact that one does begin 
thinking about applications may reflect the power of her 
argument. 


Ghandhf’s Dilemma: Nonviolent Principles and Nationalist 
Power. By Manfred Steger. New York: St. Martin’s, 2000. 
232p. $39.95. 

Dennis Dalton, Barnard College 

This is an exceptionally challenging critique of Gandhi's 

thought and leadership, distinguished by a cogent argument 

that is expressed in a lucid and concise style. The main thesis 
cannot be missed because it is stated throughout the book, as 
suggested by its title. Steger asserts in the mtroduction that 

“Gandhi’s dilemma represents a particularly intriging exam- 

ple of the more general political predicament involving the 

reconciliation of moral principles with political power ... my 
book attempts to close a gap in the existing literature on 

Gandhi by examming the origins, meaning and unfolding of 

this dilemma as it applies to the tension between his princi- 

ples of nonviolence and his nationalist project, which aimed 
at wrestling political power from the hands of the British 

colonizers” (pp. 4-5). 

Does Steger close a gap or make a fresh contribution to the 
vast literature on Gandhi? The book is most effective when it 
holds Gandhi to his own standards, by asking whether and 
when his self-professed claims of forging a uniquely nonvio- 
lent form of inclusive nationalism were not undercut by a 
pronounced and pervasive “conceptual violence.” In the 
context of a careful analysis of Gandhi’s writing, Steger 
asserts that “Gandhi’s construction of nationhood appears to 
be incompatible with his professed moral universalist,” be- 
cause “he failed to see that his essentialism vision of a 
supenor Indian spirituality ...was itself born in conceptual 
violence. In other words, the project of the moral regenera- 
tion of the nation was marked by violence at its origin...a 
sort of mirror image of the European nationalist discourse 
with its conceptually violent assumptions of moral superior- 
ity, culturally specific claims to a universal truth, and invidi- 
ous comparisons” (pp. 104, 106). Steger concludes that 
“nonviolent nationalism—Gandhi’s proposed solution to the 
dilemma of balancing nonviolent principles and nationalist 
power—never escaped the status of an unfulfilled ideal” (p. 
139). 
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Other critics of Gandhi have suggested some of this before, 
but the force of this book lies, first, in its employment of 
Gandhi’s own writings and, second, in the development of a 
theoretical framework that uses Gandhi’s example to address 
conceptual antinomies of violence/nonviolence, exclusivity/ 
inclusivity, nationalism/coamopolitanism, dominance/submis- 
sion and realism/idealism in politics. These are neatly reiter- 
ated in the epilogue. Although the book's critique is sweeping 
and fundamental, it can conclude on a perceptive and 
appreciative note by relating Gandhi’s significance to “the 
representatives of the twentieth-century ‘Gandhian tradi- 
tion,’ ” from King to the Dalai Lama (p. 194). This severely 
critical yet not unsympathetic analysis of its subject shows a 
deep understanding of the dilemma that Gandhi himself 


Does the idea of conceptual violence fit the Gandhi 
example? Steger cites Gandhi’s language toward the British 
and modern Western civilization and finds “a clear-cut case 
of conceptual violence” in Gandhi’s “incessantly labeling the 
British raj and its underlying civilization ‘satanic’ ” (p. 159). 
This evidently contradicts his own standards of nonviolent 
expression, and Steger connects this to exclusivist elements in 
Gandhi’s nationalism. Yet, the book does not mention Gan- 
dhi’s vehement indictments of his own culture, which con- 
tained, for him, the Greater Satan. In a long speech on a 
conference on untouchability in 1921, Gandhi asserted: “I 
don’t think it is so difficult to end the Satanism of the [British] 
Empire. That Satanism is of a worldly nature. The Satanism 
of untouchability has taken the colour of religion.” He then 
launches into a scathing assault on this Hindu institution, 
threatening his own individual non-cooperation “against this 
sin of society—against Hindu society” unless it ends (Collect- 
ed Works, 1994, vol. 19, 152). This remark indicates the high 
priority Gandhi consistently gave to social reforms. Most of 
his voluminous writings are explicitly addressed to such 
reforms rather than to his political resistance. Since this 
priority is not recognized in Stegers analysis, certain ques- 
tions arise. How does Gandhi's prioritization of social over 
political reforms square with the sort of nationalism Steger 
depicts? Is the language Gandhi directs at untouchability 
regarded as conceptual violence? 

Another question concerns Gandhi’s argument that non- 
violence must resist institutional forms of violence rather 
than persons. He observed that “the words ‘Satanic’ and 
‘devilish’... describe a system, not persons. We are bound to 
hate evil. But by means of non-co-operation we are able to 
distinguish between the evil and the evil-doer” (Collected 
works, 1994, vol. 19, 114). Steger mentions this distinction (p. 
160) but attributes no significance to it. Yet, Gandhi and the 
nonviolent activists who followed him emphasize this point. 
Gandhi made much of ıt during the salt satyagraha (e.g. 
Collected Works, 1994, vol. 43, 2). Martin Luther King, Jr. 
stressed it as a central tenet in his “philosophy of nonvio- 
lence,” contending that “the attack is directed against forces 
of evil rather than against persons who happen to be doing 
the evil. It is the evil that the nonviolent resister seeks to 
defeat, not the persons victimized by the evil.” King follows 
Gandhi fully by finding this key principle consonant with a 
nonviolent resistance that “avoids not only external violence 
but also internal violence of spirit” (Stride Toward Freedom, 
1958, pp. 102-3). Both Gandhi and King are sensitive to the 
charge of conceptual violence, and they both try to meet it 
through this distinction between sin and sinner. 

The significance of this distinction lies in its implications, 
or what it means in practice. It goes far to explain how 
Gandhi or King could find it easy to establish personal 
rapport with their respective adversaries or enlist white 
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support for their causes. Both also used it effectively as a 
means of disarming the opposition or creating ambivalence in 
official policy or attitudes. In Gandhi’s case, this feature of 
nonviolent practice is superbly analyzed by D. A. Low 
(Britain and Indian Nationalism, 1997). The personal rela- 
tionship Gandhi formed with the Viceroy, Lord Irwin, during 
their negotiations in 1931 has been noted elsewhere for its 
civility of language, but Low adds to this discussion of 
Gandhi’s noteworthy verbal nonviolence by analyzing his 
negotiations with Herbert Emerson, Irwin’s Secretary of the 
Home Department. Of course, Gandhi’s “Dear Friend” 
approach could exasperate or antagonize adversaries. As the 
private correspondence and official documents of the Taj 
show, however, it was precisely the lack of verbal violence 
that prompted British ambivalence if not always outright 
respect or sympathy. At any rate, there is no indication in this 
book that the important records of the India Office archives 
have been examined. 

Finally, if Gandhi’s conceptual violence seemed not to 
bother his adversaries, it appears to trouble even less those 
advocates of nonviolence who sustain his legacy. The impres- 
sive names of nonviolent nationalists noted at the end of the 
book are evidently drawn to Gandhi despite any conceptual 
violence or dilemmas of nationalism. On the contrary, they 
all offer ample testimony, in their lives and their abundant 
writing about Gandhi, to his authentic and consistent practice 
of nonviolent nationalism. 

It is unfortunate that this book, which deserves serious 
scrutiny, is overpriced and lacks a bibliography or a complete 
index. 


Transitional Justice. By Ruti G. TeiteL New York: Oxford 
University Press, 2000. 292p. $35.00. 


Robert L Rotberg, Harvard University 


The political science literature on regime transitions says 
little about the various roles that the law can play to 
strengthen or otherwise assist shifts from old to new govern- 
mental dispensations. Teitel limits her analysis to democratic 
transitions—to transitions in a “positive normative” direc- 
tion—and constructively examines rule-of-law systems in 
times of political flux. She makes the important point, and 
her thorough and dense book elaborates it fully, that the law 
is more than the product of a transition; it structures the 
transition. The jurisprudence of regime transitions responds 
to past repressions and, for the incoming or victorious 
government, reclaims liberal norms in ways which are trans- 
formative and legitimizing. 

Teitel successfully argues that the law contributes far more 
to a liberalizing transition than the usual discussions of the 
virtues of retributive versus restorative justice would imply. 
She examines the varieties of criminal justice in transition; 
explores the several uses and kinds of historical justice (trials, 
truth commissions, and other inquiries); analyzes reparations 
as a way of righting wrongs and reestablishing the rule of law 
(and thus protecting individual rights); indicates how admin- 
istrative justice and public law reshape, sometimes radically, 
the new political order, and critiques the uses of constitu- 
tionalism in serving the needs of transformed societies. 

The truth commission is but one of the ways in which the 
causes of historical justice may be furthered. It is the one best 
known and most studied, and it has, thanks to the example of 
the formidable South African Truth and Reconciliation 
Commission (TRC), in this century seized the imagination of 


incoming regimes, their supporters and advisors, and aroused 
the suspicions of those who prefer retributive to restorative 
justice both to settle scores and to delegitimize an egregious 

r regime. Transitions may require these and other 
forms of historical justice to strengthen the incoming and 
destroy the credibility of the outgoing regime. 

The existence and operations of the TRC permitted South 
Africa to detail and come to terms with the sins of apartheid. 
The TRC grew out of an elaborate political compromise that 
rejected the outgoing regime’s demand for a blanket amnesty 
(the South American model), and for no retribution, in 
exchange for the TRC. It could grant individuals amnesties 
for political acts in order to extract full information about the 
dark events of the past. 

Whereas the first commissions in Africa and Latin Amet- 
ica dared not hear testimony in public for fear that it would 
be too inflammatory or that it would arouse retaliation from 
ousted military personnel or their patrons, the TRC insisted 
on publ testimony. The open interrogation of accused 
perpetrators by victims as well as by prosecutorial figures 
from the TRC staff and by the commissioners themselves was 
salutary in achieving historical justice as well as restorative 
justice. But the TRC’s main procedural breakthrough was its 
transparency: The “victims hearings” were widely broadcast, 
televised, and reported at considerable length in the press. 
Hardly any South African could escape exposure to what 
Teitel rightly calls the critical conclusions of historical justice. 
The doleful deeds of the previous regime could be affirmed in 
the accumulation of individual testimonies. What had hap- 
pened was therefore not just an after-the-fact single story, as 
might have been revealed by a commission of historical 
inquiry, but more than a million perceptions of what had 
been revealed before the TRC in multiple iterations by 
multiple victims and perpetrators. 

The collective result amounted to both more and less than 
any result of criminal trials that could bring retributive 
justice. Although Teitel argues that trials and commissions 
equally shape collective memory, and that the traditional 
ritual nature of a trial allows publics to contextualize and 
share past experiences of wrongdoing, the new kinds of truth 
commissions do the same, and do it more expeditiously, less 
expensively, and, arguably, more efficaciously. Teitel largely 
acknowledges that point when she suggest that “rituals 
involving contested histories of the individual case often 
break down in the face of massive systemic atrocities” (p. 75). 

Truth commissions obviously flow from the problems of 
proof. Commissions can cope much more conclusively than 
even Nuremberg-like trials with the scale and quantity of 
violence perpetrated by the modern oppressive state. They do 
“elide” appropriate evidentiary standards, and they simplify 
or abbreviate developed notions of discovery and due pro- 
cess. But that is the simultaneous strength and weakness of 
restorative justice. Truth commissions, to use Teitel’s evoca- 
tive phrase, are “‘impunity’s’ antidote and amnesty’s ana- 
logue” (p. 79). 

It is important in historical justice to ascertain truth. The 
elaboration of that truth is essential to an effective transition 
and to the provision of transitional justice. But Teitel has 
recognized, as have others, that truth is contextual, and that 
the new successors’ truth, as believable as it is, must be 
developed so that it can seem an accurate and plausible 
substitution for the accounts and rationalizations of the prior 

€ 

Truth is not synonymous with justice or independent of 
justice. Teitel suggests wisely that it is a “virtue of justice” (p. 
89 


). 
This book is also virtuous. It provides a solid basis for all 
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future explorations of regime transitions and the varieties of 
justice during transitions. 


Freedom, Efficiency, and Equality. By T. M. Wilkinson. New 
York: St. Martin’s, 2000. 199p. $65.00. 


Joseph H. Carens, University of Toronto 


Today capitalism enjoys global hegemony, ideologically, eco- 
norically, and politically, but the economic benefits of 
capitalism are distributed in a radically inegalitarian manner. 
To many, even many who might wish it otherwise, this seems 
inevitable. This vast and increasing inequality is simply the 
price human beings must pay for freedom and prosperity. 
Martin Wilkinson’s goal is to challenge that view. In this 
excellent book, he argues that it is possible, at least in 

principle, for an economic system to combine freedom, 
efficiency, and equality. 

The primary goal of Wilkinson’s book is not to convince 
people hostile to equality that they should change their views 
but, rather, to help people sympathetic to equality to think 
more carefully about what they should pursue and how they 
should pursue it The first few chapters are devoted to a 
discussion of freedom, efficiency, and equality. Wilkinson 
wants to explain both why these values matter and how they 
might come into conflict. He shows that it is relatively easy to 
imagine arrangements that combine any two of these values 
but more difficult to see how to combine all three, a difficulty 
that he calls the trilemma. 

The incentives problem is at the heart of the trilemma. 
Occupational freedom and economic efficiency require some 
sort of market arrangement, but an egalitarian distribution of 
income would interfere with the incentives that make ordi- 
nary markets work. Wilkinson argues that the trilemma 
cannot be avoided so long as people make their occupational 
choices purely on the basis of their own ends and preferences. 
What is needed instead is the introduction of moral motiva- 
tions into economic affairs and, more specifically, a (legally 
unenforceable) social duty to make occupational choices with 
an eye to the effect of such choices on efficiency and equality. 
If people are motivated by this sort of social duty, it will be 
possible to cut the link between production and distribution 
and to distribute income on an egalitarian basis. (This 
egalitarian distribution may not simply be equal money 
incomes because it may be appropriate to provide some 
financial compensation for burdensome work.) 

The specific moral duty that Wilkinson proposes (which he 
calls the counterfactual duty) is the following: “ ‘respond to 
market prices in the egalitarian system as though you were 
getting the money for your own personal consumption’ ” (p. 
136). It is counterfactual because incomes will be taxed at 
100% and redistributed as equality requires. Wilkinson ar- 
gues that this counterfactual duty is compatible with eff- 
ciency and also-with all the underlying values that make 
occupational freedom so important, such as the capacity to 
say “no” to bosses, personal autonomy, and opportunities for 
self-development. He also contends that it 1s not an exces- 
sively demanding moral requirement. It still leaves consider- 
able scope for reasonable self-interest and the pursuit of 
important personal projects because these sorts of concerns 
are reflected in the kinds of occupational choices people 
make in responding to market prices, and so they will be 
reflected in his egalitarian system as well. 

This brief summary cannot do justice to the richness and 
variety of the arguments that Wilkinson develops in the 
course of his work. For example, his discussion of occupa- 
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tional choice in chapters 2 and 3 is the only sustained 
normative analysis I have seen of this important topic. The 
book is filled with brief, lucid critiques of such authors as 
Joseph Raz, Brian Barry, Thomas Nagel, and John Rawls on 
key points, and these critiques are always presented in ways 
that advance the overall argument rather than distract from 
it. I should perhaps mention that Wilkinson’s egalitarian 
system explicitly builds upon an egalitarian model I proposed 
some years ago, although it differs from my model in several 
important ways. Most of his book, however, is concerned with 
normative questions about egalitarianism that I did not 
explicitly pursue. 

The kind of egalitarian system that Wilkinson sketches is 
obviously not on the horizon, and he makes no pretense of 
offering a program for political action. What he offers instead 
is an exploration of fundamental principles. Our sense of 
inevitability about the way things are today may stem in part 
from the poverty of our imaginations. It is refreshing to have 
an inquiry that asks us to think again about unexamined 
presuppositions and to consider whether there might be 
alternative modes of organizing our social and economic 
world that would make it possible for all to share much more 
equally in the benefits of what we produce together. 

One can disagree even with a book that one admires. I 
worry that Wilkinson is neglecting some important problems 
of institutional design in the way he constructs his model. 
Even in an egalitarian system that relies on social duties, one 
should try to mmimize direct conflicts between actors’ inter- 
ests and their duties or at least to make the potential conflicts 
as evident as possible both to the actor and to others. In other 
words, we want to minimize moral hazard problems. In 
existing markets, people often do not know how they will 
respond to an offer until they actually receive it and have to 
make a real decision. In asking people to respond to market 
prices as though they could keep the money (even though 
they cannot), Wilkinson may be asking something that is not 
psychologically feasible. At the least, where money is com- 
pensating for other factors (e.g, job stress, collegiality, 
location) for which there might be no compensation in an 
egalitarian system, people would have a powerful incentive to 
persuade themselves that the monetary compensation would 
not have been enough. 

Wilkinson does suggest that the egalitarian system may 
provide compensation for some of the burdens of work, but 
he underestimates the difficulties in doing so. He assumes, for 
example, that setting the right level of compensation for the 
burdens of work is merely a matter of establishing the right 
metric for equality, an issue that is much disputed among 
contemporary egalitarian philosophers (including Joshua Co- 
hen, Joseph Raz, Philippe Van Parijs, and others) and that 
Wilkinson himself does not attempt to resolve. But even if we 
had the right metric, it might not be easy to apply it, and the 
attempt to do so could have its own perverse effects on 
incentives, especially given the variability of what people find 
burdensome. 

Regardless of what one thinks about these details, the work 
as a whole is remarkably clear, coherent, and accessible. 
Wilkinson is absolutely right to draw our attention to the 
fundamental difference between egalitarian projects that rely 
on moral motivation and those that do not (p. 90). J think he 
is also right to argue that only the former have any chance of 
real success in achieving equality. Anyone interested in 
thinking more deeply about egalitarianism will benefit from 
reading this book. 


American Political Science Review 


American Politics 


A Caring Jurisprudence: Listening to Patients at the Su- 
preme Court. By Susan M. Behuniak. Lanham, MD: 
Rowman & Littlefield, 1999. 224p. $55.00 cloth, $17.95 
paper. 

Donna R. Kemp, California State University, Chico 


This is a work of advocacy; the author is concerned with 
constructing and encouraging the use of a model for a caring 
jurisprudence. Behuniak uses her model to revisit the Su- 
preme Court’s decisions on abortion and assisted suicide. She 
develops her view by extending arguments of feminist justice 
and care. 

A Caring Jurisprudence is an exceptional study of how the 
medical, legal, and personal versions of cases are heard in the 
courts, and it demonstrates the predominance of medical and 
legal views over the personal views of patients in the judicial 
process. The author fulfills her goal of exploring “how 
patients’ knowledge and medical knowledge interact within 
the legal culture, and how this interaction affects legal 
decision making” (p. 2). 

Behuniak begins by discussing these three types of knowl- 
edge—legal, medical, and patient—and the limitations of 
each. She points out that law and medicine share a common 
epistemology that develops knowledge through impartial and 
reasoned processes, and justices see such knowledge as 
expert and reliable and a justrfiable basis for their decisions. 
Patient knowledge is considered emotional, involved, and 
particular, and legal norms tend to assume that justice 
requires less attention to be paid to such knowledge or that it 
be discarded. Behuniak argues that the exclusion of patients’ 
knowledge handicaps the Court’s deliberations and decision 


Two detailed chapters use abortion and physician-assisted 
suicide cases to show how the Court frames the issues as 
predominantly medical matters and excludes most patients’ 
knowledge. Behuniak demonstrates how these cases become 
primarily about physicians rather than patients. She then 
shows how feminist scholarship led her to develop a model of 
judicial decision making that integrates the ethics of care and 
the ethics of justice. Finally, Behuniak revisits the abortion 
and physician-assisted suicide cases, applies a caring jurispru- 
dence, and reveals how it can broaden the knowledge con- 
sidered. She brings forward extensive patient knowledge and 
uses this to reach prochoice and prophysician-asaisted reso- 
lutions of the cases. Because her model is not result oriented, 
however, she stresses that the Court might not reach the 
same conclusions with the same knowledge. 

Behuniak builds on the work of feminist scholars to explain 
why patients’ knowledge receives so little weight before the 
Supreme Court. Using Carol Gilligan’s (In a Different Voice: 

ical Theory and Women’s Development, 1982) “dif- 
ferent voice,” which reintroduced the theory that women’s 
moral reasoning is different from men’s, Behuniak draws on 
Gilligan’s comparison of the ethics of justice and care to 
construct a theory as to why the legal system does not usually 
accept subjective, involved, and particular knowledge. Jus- 
tices use an ethic of justice with a “masculine” voice, that is, 
reasoned, impartial, and universal. The author also uses 
Gilligan’s theory to suggest that because patients’ knowledge 
is the “feminine” form of knowledge derived from the ethic 
of care, the court devalues it. This distinction between the 
two ethics is also supported by other feminist scholars (¢.g., 
Marilyn Friedman, “Beyond Caring: The De-Moralization of 
Gender,” in Virginia Held, ed., Justice and Care: Essential 
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Readings in Feminist Ethics, 1995; Alison M. Jaggar, Feminist 
Polttics and Human Nature, 1983; Joan Tronto, Moral Bound- 
aries: A Political Argument for an Ethic of Care, 1993). 

Building on two distinctive elements of feminist jurispru- 
dence—law is not neutral, and theoretical challenges to 
mainstream law will eventually lead to practical change— 
Behumak applies and develops feminist methods to frame a 
model for a caring jurisprudence. It has four principles: “(1) 
Begin with personal experiences and use these to question 
abstract knowledge; (2) challenge the validity of dichotomies; 
(3) analyze the politics that support an ethic of justice over an 
ethic of care, i.e., who and what is served by the law’s focus on 
competing rights, universal rules, impartiality, and reasoning; 
and (4) explore the possibilities for change by shifting the axis 
of law away from justice and toward care in order to integrate 
them” (p. 119). 

Behuniak does not argue for the superiority of either ethic. 
She supports an integrationist approach, whereby both ethics 
are applied by justices. She demonstrates that this model is 
practical by showing that the concurring opinions of Justice 
Stevens in Washington v. Glucksberg (1997) and Vacco v. 
Quill (1997) used patients’ knowledge as well as legal and 
medical knowledge, even though he did not vote in the 
patients’ favor. - 

This book not only is methodologically sound but also is 
well written and has a practical application. Behuniak’s 
model is adaptable to many other types of cases involving 
patients and medicine, and it is as applicable to cases that 
involve mental health issues as it is to those that involve 
physical health issues. A Caring Jurisprudence should be read 
not only by academics but also by practitioners, including 
judges and justices of the Court. 


Elections to Open Seats in the U.S. House: Where the Action 
Is. By Ronald Keith Gaddie and Charles S. Bullock MI. 
Lanham, MD: Rowman & Littlefield, 2000. 256p. $65.00 
cloth, $26.95 paper. 


Stephen M. Nichols, California State University, San Marcos 


Compared to the voluminous literature on the electoral 
advantages of congressional incumbents, little attention bas 
been paid to races for open seats. These contests lack the 
monolith of incumbency and thus may differ substantially 
from those in which an incumbent’s shadow looms large. In 
this valuable book, Gaddie and Bullock make a compelling 
case for the importance of a topic that has been largely 
overlooked. Given the high reelection rate of incumbents, the 
most promising avenue for a candidate seeking election to 
the House—and, more important, the greatest opportunity 
for affecting meaningful partisan and policy change in the 
institution as a whole—is through open seats. 

This district-level study of open seat races between 1982 
and 1998 sheds new light on important dimensions of these 
contests. It amends the conventional wisdom that such races 
are much more competitive than incumbent-dominated elec- 
tions. Although open seat contests are generally closer than 
incumbent-challenger races, only about one-third of all open 
seat contests since 1982 were marginal victories (won with 
less than 55% of the vote). Gaddie and Bullock also note that 
many of the factors that constitute a strong challenge to an 
incumbent—prior political experience and ample campaign 
finances, in particular—lkewise predict success in open seat 
contests. In addition, the absence of an incumbent allows for 
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greater effects of district and-national partisan tides in such 
elections. 

The analysis of the pivotal 1994 House elections is partic- 
ularly good. In a departure from the past, Republican 
candidates for open seats ran substantially ahead of the 
normal GOP vote in most districts, especially in the South. 
This was so even when GOP candidates faced better funded 
and more experienced Democratic rivals. Gaddie and Bul- 
lock argue convincingly that the GOP’s performance in open 
seat contests was a crucial component of the party’s majority 
in Congress, helped make possible the.Republican realign- 
ment in the South, and in turn produced the profound policy 
reverberations stil! felt today. 

Gaddie and Bullock also explore patterns in financing 
campaigns for open seats. It is relatively easy for donors to 
determine recipients in incumbent races, but the choice is 
murkier without that cue. The authors find, interestingly, that 
past political experience has little bearing on the size of the 
campaign war chest, but the amount of money taised by an 
opponent is a significant predictor of one’s own campaign 
resources. A very intriguing finding in this regard is the 
authors’ inability to explain open seat funding in 1994: For 
other years, the goodness-of-fit of their models is quite 
respectable, but the model for 1994 accounts for almost no 
funding variation whatsoever. They attribute this to substan- 
tive changes in variables that had been strong predictors of 
fundraising levels for open-seat candidates in past years. For 
example, party spending on open seats increased in 1994 and 
was more evenly distributed across candidates for open seats. 
By not supporting specific races more heavily, the parties sent 
weaker signals to other donors as to whom to support; as a 
result, the level of party funding was not a significant, 
determinant of fundraising by candidates for open seats that 


year. 

Those who hope for greater gender balance in Congress 
can take heart from this analysis. Gaddie and Bullock note 
that the imbalance will most likely change through the same 
means as the partisan balance shifted, via open seats. Fur- 
thermore, they find that women with sufficient political 
experience and campaign financing fare just as well in open 
seat elections as comparably experienced and funded males. 
This is a noteworthy addition to the growing literature that 
documents the decline of barriers confronting women who 
seck high elective office. 

The analysis is methodologically sound and, in places, 
highly sophisticated, yet the conclusions are not beyond the 
grasp of an educated lay person. The discussion of the PRE 
interpretation of the logit analyses, for example, is especially 
clear. Overall (see below for one caveat), the models seem 
well specified; aa noted, in most instances the hypothesized 
relationships are statistically significant, and the models 
account for an impressive proportion of variation in the 
dependent variables. 

The strength of the statistical models is best demonstrated 
in the final chapter, wherein Gaddie and Bullock venture 
beyond the safe ground of explaining past events and predict 
future outcomes. Writing in 1999, they estimated the vote in 
open seat races in the 2000 elections. They deserve credit for 
their bravery and their accuracy. In Table 7.6 (p. 182), the 
authors predict the GOP vote percentage in 25 expected 
open seat House contests. In checking how well they fared, I 
eliminated three contests—one in which the incumbent did 
not retire as anticipated, and two with circumstances (a 
three-way race in Missouri, a four-way battle in Rhode 
Island) the authors could not have foreseen. For the remain- 
ing 22 contests, their model of the GOP vote share performs 
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remarkably well: The average error is only about three 
percentage points. Gaddie and Bullock’s predictions about 
the partisan outcome in these races are presented alongside 
those of political commentators Rowland Evans and Robert 
Novak. Gaddie-Bullock and Evans-Novak agreed on the 
outcome of thirteen of the contests; among the nine in which 
their predictions differed, Gaddie and Bullock were right 
about five, Evans and Novak about four. Score one for the 
political scientists over the pundits. 

The book is not flawless. The authors point out that open 
seats are the means through which partisan change in Con- 
gress can be effected. But the House is only half the 
institution, and no- consideration is given to whether the 
findings can be generalized to the U.S. Senate. Understand- 
ably, such difficulties as amassing a sufficient number of cases 
and treating states as homogeneous political jurisdictions 
would make extending the model to Senate contests tricky. 
Nonetheless, some speculation es to whether the authors’ 
conclusions might hold in the other chamber seems war- 
ranted. 

Also, a factor that some scholars have identified as an 
important determinant of open seat contests—media treat- 
ment of the candidates—is missing here. Paul Herrnson’s 
(Congressional Elections, 1998) analysis of open House seats 
in 1992 found that candidates who received more favorable 
media coverage than their opponents accrued an advantage 
of nearly four percentage points. The Gaddie and Bullock 
models include factors that may lead to such an advantage in 
media treatment (a candidate’s past experience, campaign 
war chest, and so on), but the authors might have addressed 
this difference between their models and those used in prior 
work. 

Of less substantive importance are the errors not normally 
found in a work from a reputable publisher (and responsibil- 
ity for which lies more with the editors than with the authors). 
For example, the maps depicting gerrymandering in Indiana 
in 1982 (pp. 26-7) are fuzzy, such that considerable effort is 
required to discern the substantive point discussed in the text. 
In the section on future predictions, the authors state: “At 
the time of this writing (winter 2000), not all open seats are 
known” (p. 178). By winter 2000, of course, not only were the 
seats known, but also the outcomes had been decided; surely, 
Gaddie and Bullock were writing in winter 1999. Finally, the 
presentation of both R-squared and adjusted R-squared m 
many (but not all) regression tables is curious. In most 
instances, the figures reported differ very little, and when they 
do, the text interpretation invariably relies on the more 
conservative adjusted goodness-of-fit measure. One wonders 
why the R-squared is presented at all. 

Overall, this work is a genuine contribution to the litera- 
ture on congressional elections. The book makes a powerful 
case for studying open seat races and lays the groundwork for 
future exploration in the field. It deserves a spot in the library 
of all students of Congress and elections. 


Urban Exodus: Why the Jews Left Boston and the Catholics 
Stayed. By Gerald Gamm. Cambridge, MA: Harvard Uni- 
versity Press, 1999. 384p. $39.95 cloth, $19.95 paper. 

Robert Huckfeldt, Indiana University 

Gerald Gamm secks to explain the different rates at which 

Jews and Catholics left the Roxbury and Dorchester areas of 


Boston for nearby suburbs during the middle decades of the 
twentieth century. As Sam Bass Warner shows (Streetcar 
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Suburbs, 1978), Roxbury and Dorchester were the first ring of 
streetcar suburbs around the city and became thriving neigh- 
borhoods during the jate nineteenth and early twentieth 
century. The subsequent development of the automobile and 
more distant suburbs, coupled with continued migration into 
the Boston area, increased centrifugal pressures on the city’s 
spatial development. As African Americans and other minor- 
ities began moving into Roxbury and Dorchester, many white 
residents left for new and more distant suburban locales. 

As in other American cities, Jewish residents of Boston 
were among the first to leave their old neighborhood, and 
Catholics were among the last. In an effort to explain these 
migration patterns, Gamm accomplishes a number of feats. 
He provides a more sophisticated understanding of individ- 
ual involvement in urban institutions as well as the impor- 
tance of institutional rules for the vitality of urban groups. He 
encourages us to think more deeply about the institutional 
basis of urban social contexts. And he forces us to reconsider 
explanations for urban decay, the vitality of urban communi- 
ties, and the potential for diverse urban neighborhoods. 

Why did Jews leave more rapidly than Catholics? Some 
controversy surrounds this issue, but the core of Gamm’s 
argument revolves around distinctive institutional features in 
the organization of Catholic parishes and Jewish synagogues. 
The life of the parish is defined in terms of geography and 
hierarchy. Catholics cannot belong to the parish church 
unless they reside within the boundaries of the parish. 
Church property does not belong to parishioners but to the 
diocese. Finally, priests are assigned and reassigned to par- 
ticular parishes at the discretion of the diocese. In contrast, 
the location of a Jewish synagogue is determined by the 
congregational members who own it, and membership is an 
individual choice that is unaffected by location of residence. 
Moreover, the rabbi is responsible to the synagogue that 
provides his employment contract and salary. Hence, when 
Catholics move from the city to the suburb, they leave 
behind the parish and the church as well as a parish-specific 
network of social contacts. When Jews leave the city for the 
suburb, they can maintain their membership in the urban 
synagogue. 

According to Gamm, the geographic definition of the 
Catholic parish creates a credible commitment to the people 
who reside in that area. Catholic residents know that the 
church will not leave. Moreover, they know that other resi- 
dents know the church will not leave. One might say that 
Catholic residents are discouraged from leavıng by the nec- 
essary loss of ties to their parish and church, and they are 
encouraged to stay by the assurance that others are also 
unlikely to leave. In contrast, the freedom of departing 
Jewish residents to continue their involvement within the 
synagogue makes leaving easier. This individual freedom of 
mobility, along with the ability of Jewish congregations to 
close down operations at one site and relocate to another, 
undermines the group’s commitment to a particular locale. 
Hence, an individual Jewish resident might anticipate the 
departure of both the synagogue and its membership, which 
makes it more likely that the individual will leave. 

In many ways, Gamm provides a story of unforeseen 
consequences and short time horizons. When Jewish resi- 
dents of Dorchester moved to Brookline, they may have fully 
expected to maintain their involvement at the old synagogue. 
Similarly, when the urban synagogue provided services to its 
suburban members through satellite facilities, its leadership 
based these actions on the future survival of the synagogue. 
In fact, both sets of actions led to the inevitable demise of one 
institution and the formation of another. Freedom of associ- 
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ation aud location inevitably led to distinctive patterns of 
migration, decline, and institutional rebirth. 

What are the consequences of these patterns for urban 
politics and the vitality of urban communities? Who are the 
winners and who are the losers? Here again, short-term 
dynamics often run at cross purposes to long-term dynamics. 
In the short run, locally based Jewish institutions lose support 
and disappear, a fact that is particularly unfortunate for the 
typically less affluent Jewish residents who are left behind. In 
contrast, Catholic parishes endure, and many of the churches 
once populated by Irish Americans ultimately become the 
religious homes of other ethnic groups, thereby providing 
institutional continuity in the context of sometimes rapidly 
changing population composition. 

At the same time, the geographical basis of the Catholic 
parish does not provide a uniformly happy tale. The same 
factors that make Catholic residents less willing to leave also 
create and exacerbate higher levels of hostility toward new- 
comers to the neighborhood, and this hostility is a major 
theme of Boston politics throughout much of the twentieth 
century. In short, we should not forget that race and class and 
ethnicity lie at the heart of this story. Ironically, in their rapid 
desertion of Roxbury and Dorchester, the departing Jewish 
residents made possible a somewhat easier in-migration of 
African Americans and others who were in dire need of their 
own homes, neighborhoods, and communities. 

Finally, and perhaps most important, Gamm contributes 
important new insight regarding the institutional basis of 
urban communities, There is an important component of 
individual choice and preference in the patterns of associa- 
tion and networks of interaction that underlie these commu- 
nities, but the incidence and likelihood of particular interac- 
tions is enhanced by some institutional arrangements and 
diminished by others. Hence, the form and function of urban 
communities, and the vitality of urban groups, are not 
accidental, even if they are frequently unintended. Gamm’s 
rich and compelling account of the institutional basis for 
these groups and communities provides an indispensable 
element to our understanding of the roles played by space 
and location in urban communities and urban politics. 


The Legacies of the 1964 Civil Rights Act. Edited by Bernard 
Grofman. Charlottesville: University Press of Virginia, 
2000. 320p. $55.00 cloth, $16.50 paper. 


Mary Coleman, Jackson State University 


Under what conditions, if any, do policies designed to 
enhance black Americans’ citizenship succeed? At what pace 
and with what apparent and real benefits and costs to the 
public interest and public good do they do so? These are 
questions that this important volume seeks to evoke in 
readers. The contributors offer approaches and answers that 
are often robust and often at variance one with the other. 
Most provide insights that could lead to demonstrably better 
research programs for social policy than exist presently. 
David B. Filvaroff and Raymond E. Wolfinger examine the 
intersection of policy mterests at the executive and legislative 
levels and the protest and leadership mobilization in commu- 
nities of good will throughout the nation. The nonviolent 
movement’s message of citizenship and justice for blacks 
resonated throughout the world and gave rise to policy 
mobilization of unprecedented magnitude. This chapter 
alone should be of interest to students of Congress, to 
scholars of mass movements and public policy, and to stu- 
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dents of the presidency. What Filvaroff and Wolfinger do not 
address is the extent, if any, to which America’s status as a 
contending superpower and its reach as a media power 
compelled a level of conscience and duty that resulted in the 
cooperation of the national government—courts, legislature, 
executive, and bureaucracy—in the most extensive expansion 
of citizenship in modern times. 

Hugh Davis Graham offers penetrating analysis about the 
changing regulatory state in America and the ways in which 
cross-cutting compliance mechanisms developed by Civil 
Rights Act provisions challenged the traditional ideas of 
federalism. Graham shows how the sleeper provision, Title 
VI, provided chief leverage for national enforcement of the 
disparate impact standard in minority rights. This model of 
compliance with federal regulations reached beyond the area 
of race relations. The denial of funds to programs and 
institutions that discriminated on the basis of race, religion, 
or national origin effectively prevented tax dollars from 
buttressing segregation. Furthermore, the effective use of 
Title VI, in combination with executive orders, encouraged 
desegregation in employment opportunity throughout the 
South. 

Graham’s work confirms that of Filvaroff and Wolfinger; in 
combination, their essays demonstrate that mobilization of 
rights requires “outsider” groups to exert systematic pressure 
in the concrete arena of politics and efficacious regulatory 
and compliance mechanisms to squeeze out racist institutions 
and institutional practices and polices from the national, 
state, and local body politic. Only a weakening of the national 
will to its forced commitment to equality of opportunity in 
America could halt this movement. 

What is the nexus between efforts to desegregate public 
schools and the Civil Rights Act of 1964? Gary Orfield 
examines the slow pace at which efforts to desegregate public 
schools proceeded. The Department of Health, Education 
and Welfare (HEW) eventually moved ahead of or alongside 
the courts but was very much spurred by the judicial milieu of 
the Supreme Court and the need for specificity of language. 
Actual desegregation of both teachers and students was 
called for in HEW guidelines. In this move HEW in 1966 
answered critics who had asked of the Brown decision in 1954 
what the Supreme Court had meant by desegregation. A 
handful of federal cases decided between 1966 and 1969 
jettisoned the old segregated public education system in the 
South and signaled important expectations for change in the 
world north of the Mason-Dixon line. In 1969 the Supreme 
Court decided Alexander v. Holmes, a case arising from 
Holmes County, Mississippi. Alexander ended all deliberate 
speed; in rural areas and towns, schools throughout the South 
desegregated. One full generation after Brown, four years 
after the Civil Rights Act of 1964, three years after the 
Voting Rights Act of 1965, and three months after Alexander 
v. Holmes, public schools in the South meaningfully desegre- 


The essays of Orfield and Barbara Phillips Sullivan raise 
several questions for me. How, if at all, did the Civil Rights 
Act create greater opportunity for a quality education? Of 
what use is the Civil Rights Act of 1991 to resegregated inner 
city schools dominated by children who reside in neighbor- 
hoods close to or in poverty? What are the noblest ends 
served by public school desegregation? Sullivan’s examina- 
tion of higher education policy takes on this issue. The 
agendas of Orfleld and Martin appear to be at odds. Case law 
in higher education desegregation warrants greater examina- 
tion, as does the Department of Justice’s various responses to 
equity and desegregation in higher education. The failure of 
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the courts to define “educationally. sound policy” and “free 
choice” severely hampers the search for justice. 

Ambiguous language often creates ambiguous policy and 
ambiguous policy effects. Burstein encourages studies that 
examine the context of enactment and enforcement; this 
includes public opinion, protest activity, lobbying, electoral 
politics, media coverage, and political leadership. Burstein 
provides a cursory examination of the effect of equal 
employment opportunity on gender roles as well as law and 
labor market outcomes, and the conclusion is that these 
movements have provoked a countermovement that has 
reasserted “traditional” values into the center of political 
debate. What does such a view explain or obscure? Interest- 
ingly, Burstein wrestles with definitions for discrimination 
and affirmative action, but he uses uncritically the term 
“traditional values” without attempting to frame the context 
within which the countermovement has gained leverage and 
“legitimacy.” 

Why have some items of the civil rights agenda failed while 
others have succeeded? For example, what explains Title I's 
effectiveness? Randall Kennedy answers: (1) Discrimination 
in public accommodations was a regional not a national 
problem; (2) discrimination in areas other than public accom- 
modation was and is more subtle and could and can be 
justified or equated with deficits in blacks’ social capital, due 
to their relative lack of access to education and other 
resources; and (3) perceived scarcity is more associated with 
opportunities to acquire economic well-being than public 
accommodation. If Kennedy’s explanations are correct, what 
accounts for the stubbornness of personal, individual, and 
institutional discrimination against blacks in employment and 
in higher education? 

Katherine Tate and Gloria J. Hampton offer partial but 
important answers. Their essay underscores the challenges 
that a black rights policy agenda driven by public opinion 
would face in America. In principle white Americans of all 
ages and partisan identification believe in equal opportunities 
for jobs and education. Most continue to reject federal efforts 
to reduce the scale of racial inequality, most believe that 
opportunities abound. White responses to a government role 
vary across the age spectrum; overall, support for government 
intervention has declined for all age groups over the last two 
decades. The content of opportunity and methods of enforce- 
ment continue to challenge the minds, hearts, and interests of 
whites—men, women, young, old, the well educated—no 
matter where they live in America. 

Just what lessons are learned from the last fifty years of 
civil rights effort? According to Fraga and Ruiz-de-Velasco, 
the nation needs to think about the content and character of 
the public interest. Their view is that the political arena lacks 
@ unifying conception of personal citizenship. These authors 
and Wasby would have one think about the inherently limited 
strategy based on legal rights inherent in the Civil Rights Act 
of 1964. Fraga and Ruiz-de-Velasco believe that the act 
obscures more than it makes clear: Thus, these questions are 
asked: Do rights extend to individuals, groups, or both? Are 
blacks victims of preferential treatment under the act? Does 
the act with its inherent strategy lead to a perception that 
some gain everything and others gain nothing? This book, 
like most thoughtful ones, is long on questions and short on 
answers. 

The lasting legacy of the 1964 legislation is its reach for a 
more robust conception of citizenship rather than balkaniza- 
tion or the creation of a hierarchy of victimization. That the 
regulatory state was transformed once to ensure the integrity 
of citizenship as a matter of everyday practice means that, as 
necessary, it can again be transformed. Robin Williams, Jr., is 
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correct when he argues that claims must be examined and 
renegotiated over time and place. Grofman’s edited volume 
demonstrates that race discrimination persists. My worry is 
that the public interest will likely continue to privilege liberty 
at the expense of the public good that values substantive 
justice. 


To Serve God and Mammon: Church-State Relations in 
American Politics. By Ted G. Jelen. Boulder, CO: West- 
view, 2000. 176p. $35.00 cloth, $14.00 paper. 

Paul Apostolidis, Whitman College 

The return of the Republicans to national executive leader- 
ship would appear to bring renewed prospects for an adjust- 
ment of the contentious boundary between church and state. 
Ted G. Jelen’s book thus arrives at an opportune moment in 
American political history. Teachers and scholars of religion 
and politics in the United States will finda great deal to 
recommend about this compact, accessible, and provocative 
assessment of recent tendencies in the ongoing battle to 
define the proper reach of government vis-a-vis the realm of 
the sacred. 

Jelen’s book is pitched at a level appropriate to undergrad- 
uates who are confronting issues regarding church-state 
separation for the first time. Its central achievement lies in 
rendering an impressive range of existing research available 
in an organized and stimulating way to such readers, rather 
than in supplying new research. To Serve God and Mammon 
does not offer the breadth of other texts on religion and 
politics in the United States that are targeted toward:a similar 
audience, especially in regard to coverage of Left-liberal 
religious activism (e.g„ Robert Booth Fowler and Allen D. 
Hertzke, Religion and Politics in America, 1995; Kenneth D. 
Wald, Religion and Politics in the United States, 1997). Nev- 
ertheless, Jelen’s extensive and systematic analysis of church- 
state relations, particularly the role of court decisions and 
Christian Right activism in shaping this sphere of political 
contention, may well make the book more useful than others 
for courses on religion and politics. 

The abiding tension between the establishment and free 
exercise clauses of the First Amendment forms the concep- 
tual foundation for most of the book. Jelen makes a distinc- 
tive contribution by insisting that conflict over church-state 
relations cannot be adequately understood according to any 
simplistic liberal-conservative dichotomy or, alternatively, a 
facile binary between those who emphasize free exercise and 
those who stress the disestablishment principle. He shows 
instead how political struggles over these issues can be based 
on at least four different ways of combining interpretations of 
the establishment and free exercise clauses. Jelen employs his 
four-part typology to analyze both rhetorical and institutional 
features of church-state controversy. 

One of the three main chapters discusses historical inter- 
pretation as an important arena for dispute over church-state 
relations. Jelen argues that both accommodationists and 
separatists (on the establishment clause) have found ample 
historical grounds to make their positions seem reasonable in 
terms of maintaining fidelity to the founders’ intentions (pp. 
32-42). In another chapter, Jelen offers the nuanced argu- 
ment that federalism, the separation of powers, and other key 
institutional features generate variation in the receptiveness 
of public officials in different arenas of government to 
majoritarian demands for accommodationist and communal- 
ist (on free exercise) policies (pp. 53-75). Jelen then shifts his 
focus back to the rhetoric of conflict over church-state 
relations, contending that recent Christian Right leaders 
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have broken with their predecessors by emphasizing free 
exercise values over the establishment clause, and by favoring 
libertarian rather than communalist readings of the free 
exercise clause (pp. 86-101). The author concludes that 
church-state controversy not only is unlikely to abate but also 
is desirable to ensure governmental responsiveness to the 
changing religious composition of the American public and 
the continuing vitality of religious freedom (pp. 110-29). 

Jelen anchors his discussion of the current debate in an 
excellent contemplation of the deliberative requirements of 
democracy and the value of religious institutions and faith- 
based truth-claims with respect to those democratic demands 
(pp. 80-94). But contestation over the church-state boundary 
is not simply a matter of reasoned deliberation; it is also a 
strategic, discursive struggle over the construction of national 
identity. Jelen does not explicitly thematize this discursive 
aspect of the controversy, but doing so would have enabled 
him to address three problems I see with the book. First, it 
would have allowed him to draw tighter and felicitous 
connections between his discussions of rhetorical and insti- 
tutional aspects of these issues. Jelen might have probed, for 
example, how historical accounts that idealize the founders as 
pious men who sought to strengthen the people’s piety may 
combine with some school boards’ hospitable responses to 
“accommodationist public opinion” to generate a historically 
distinctive mode of citizenship inflected by the majority faith 
(pp. 32-5, 62-4). 

Second, attention to the discursive aspect would have 
helped Jelen avoid oversimplifying the current politics of the 
Christian Right by taking leaders’ rhetoric simply at face 
value, as rational propositions in a deliberative context. It is 
true that recently “the Christian Coalition . . . has specifically 
eschewed any notion of a ‘Christian America’ in favor of a 
‘place at the table’ in public dialogue,” but this is only one 
current in the wider discursive environment of the Christian 
Right (p. 95). In other major aspects of the complex political 
culture of this movement, from Focus on the Family’s radio 
broadcasts to the Promise Keepers’ mass spectacles, Ameri- 
can national identity remains intimately bound up with 
evangelical Christianity (see Paul Apostolidis, Stations of the 
Cross: Adorno and Christian Right Radio, 2000, Linda Kintz, 
Between Jesus and the Market: The Emotions that Matter m 
Right-Wing America, 1997). : 

Third, exploring the rhetoric deployed in church-state 
contestation as identity-constituting discourse might have 
induced Jelen to qualify his enthusiasm for further conflict 
over church-state. relations. To the extent that such contro- 

takes the form of discursive struggle rather than 
critical-rational debate, a robust public discourse on church- 
state relations may have effects of power quite other than 
those that Jelen foresees, disempowering those whose exclu- 
sion proves necessary for the coherent construction of na- 
tional identity. 

On the whole, Jelen admirably acquits himself in the core 
task of To Serve God and Mammon. He furnishes a basic text 
in American church-state relations that infrequently sacri- 
fices conceptual sophistication for the sake of accessability, 
that exposes the multiple valences of church-state conflicts 
with the main institutional dynamics of American govern- 
ment, and that balances historical sensibilities with a contem- 
porary focus. Complemented by readings on religious groups’ 
involvements in the union, civil rights, and other progressive 
movements, this book would serve very well as one among 
several main texts in a course on religion and politics in the 
United States. The book would also work quite nicely in 
courses on social movements, law and American society, or 
American conservatism. 
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Entitlements: The Politics of Food Stamps. By 

Ronald F. King. Washington, DC: Georgetown University 

Press, 2000. 256p. $65.00. 

Eric M. Patashnik, University of Cakfornia, Los Angeles 
This is one of the best works so far in integrating budgeting 
into the general study of the American welfare state..The 
events of the past twenty years attest to the importance of 
making connections between budgeting and social policy- 
making. Since the presidency of Ronald Reagan, the budget 
has been at the center of ideological struggles over the future 
of activist government. We have witnessed the “fiscalization” 
of public social provision. Conflicts over Social Security, 
Medicare, and the tax revenues available to finance future 
program expansions are played out in budgetary debates. If 
one wishes to understand the current prospects for U.S. 
social policymaking, it is crucial to examine the politics of the 
budgetary process 

By the same pale woken contemporary budget scholars must 
pay close attention to the dynamics of social policymakmg. 
When Aaron Wildavsky wrote his classic 1964 book, The 
Politics of the Budgetary Process, most domestic spending 
supported the internal operations of the bureaucracy. Since 
the 1970s, however, a growing share of the domestic budget 
has been spent on transfer payments for individuals. Many 
transfer programs have been designed as mandatory entitle- 
ments. Under ordinary agency budgeting, the appropriations 
committees have the license to authorize funds as they see fit. 
Under entitlement budgeting, expenditures’ are on automatic 
pilot unless budget guardians manage somehow to reclaim 
control. In sum, the expansion of welfare entitlements has 
fundamentally changed the political economy of American 
national budgeting. 

In this stimulating and thoroughly researched book, Ro- 
nald King explores the clash between entitlement protections 
and budgetary concerns. His analysis uses the food stamp 
program as its primary empirical vehicle, but it sets forth a 
conceptual framework that is helpful in understanding more 
generally how policy design influences the politics of resource 
allocation decisions. King’s main empirical claim is that the 
institutional rules of the budget game fundamentally shape 
both the strategies of individual players and the outcomes of 
collective decisions, although not always in intended ways. 
For King, entitlement budgeting also serves as a setting for 
exploring the normative tension between procedural and 
substantive justice. Budgeting implies a discretionary choice 
among competing purposes, yet entitlements stand as prior 
claims that are insulated from ordinary fiscal control. The 
tension between entitlement and spending restraint seems to 
be irresolvable. King provocatively insists that confrontation 
with this basic moral dilemma is a continuing obligation of an 
engaged citizenry. 

The book’s most important contribution is methodological. 
King uses rational choice models both to generate hypotheses 
about the effect of institutional rules on actors’ behavior and 
to provide an interpretive framework for ordering his empir- 
ical findings. This is a potentially fruitful yet still uncommon 
approach in policy studies. Many political scientists with 
substantive policy concerns shun rational choice concepts and 
analytic models, viewing them as a barrier to sensitive 
understanding of case materials. Formal theorists either 
avoid confrontation with the messy details of policy develop- 
ment or else use these details merely to establish the plausi- 
bility of some off-the-shelf model. King demonstrates that it 
is possible to make good use of analytic models in policy 
research. One can attempt to strike a balance between 
descriptive realism and analytic power. Some historical- 
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institutionalists may criticize King for being too abstract. 
Game theorists may attack him for not being analytically 
rigorous enough. Given King’s own purposes, I think he has 
the balance between empirical richness and analytic power 
about right. 

King’s models have three distinguishing characteristics. 
First, they are elementary, they involve no high-powered 
math. Second, they were deliberately formulated with a 
specific empirical application in mind—the evolution of the 
food stamp program since the 1960s. The models clearly have 
broader implications, but King makes no attempt to develop 
universal theories of politics. Third, the models do not make 
heroic assumptions about the effectiveness of the policy 
process. They allow for policy outcomes that are the product 
of neither consensus nor rational deliberation. 

At the core of the models is the notion of a “reversion 
point,” the budget outcome that will prevail by default if no 
bargaining agreement is reached. The location of the rever- 
sion point at any moment depends crucially on the specific 
budget rule then in effect. Over the food stamp program’s 
history, three distinct budget rules have been employed. King 
skillfully exploits this institutional variance to explore how 
changes in the reversion point alter the logic of bargaining 
strategies of the players. The first rule is ordinary discretion- 
ary budgeting, which governed food stamps during the pro- 
gram’s formative years. Spending cannot take place absent 
the passage of an appropriations measure, so the reversion 
point is zero. Next, King analyzes the entitlement rule, under 
which the reversion point is determined by the number of 
eligible persons who seek assistance and the level of their 
determined need. The third budget rule, which applied to 
food stamps during much of the Reagan-Bush era, is a 
spending cap that allows outlays to increase but only up to a 
predetermined level. The reversion point exists at the cap 
threshold. 

According to King, changes in the reversion point do not 
eliminate conflict over food stamp spending; rather, they 
influence the way budget actors play the game. His most 
interesting insight is that the incentives for aggressive behav- 
ior in budget negotiations vary with the form of budget rule. 
Under discretionary budgeting, the incentive for moderation 
is relatively strong because the failure to reach agreement 
will cause programs to receive zero funding, an outcome so 
extreme that even conservatives who decry excessive govern- 
ment spending are likely to run from it. Under pure entitle- 
ment rules, in contrast, budgetary posturing and position 
taking are less politically risky because everyone recognizes 
that obligatory costs eventually will be met. 

The many strengths of the book notwithstanding, some 
constructive criticisms can be offered. The analysis makes no 
attempt to test statistically the effect of changes in budget 
rules on either bargaming dynamics or policy outcomes, 
although trend data on the food stamp budget are presented 
in an appendix. King rightly emphasizes that all entitlements 
are not alike, but he misses an opportunity to say how his 
models might be modified if applied, say, to a contributory 
entitlement such as Social Security, for which expenditures 
are permanently appropriated yet constrained by a dedicated 
revenue base. In the concluding chapter, King suggests that 
the budgetary precommitment established through the enti- 
tlement mechanism can be seen as an instance of “self- 
binding,” whereby citizens prevent themselves from reneging 
on promises they wish to keep. Another possible interpreta- 
tion—one that seems to fit better with King’s facts—is that 
the entitlement form (as well as the cap mechanism) embody 
attempts by particular factions to lock in their political 
agendas by binding their current and future opponents. In 


American Political Science Review 


government budgeting, structural design is used not only to 
solve the “time-inconsistency” problem but also to reduce 
political uncertainty. 

These caveats aside, Budgeting Entitlements makes an 
mportant contribution. The book is particularly appropriate 
for advanced undergraduate and graduate courses that stress 
social policy, political economy, and public administration. It 
is essential reading for budget specialists, welfare state 
researchers, and policy scholars interested in drawing on 
rational choice concepts in their work. 


Divided Arsenal: Race and the American State during World 
War H. By Daniel Kryder. New York: Cambridge Univer- 
sity Press, 2000. 301p. $29.95. 

Philip A. Klinkner, Hamilton College 

Scholars of comparative politics and international relations 

have long been aware of the importance of war to state 

development, but only recently have scholars of American 
politics come to the same conclusion. In particular, they have 
begun to examine the importance of wars and their aftermath 
to the development of the American state. Daniel Kryder 
contnbutes significantly to this trend by examining how the 

American state sought to manage the racial friction created 

by World War IL He takes on Gunnar Myrdal’s view that this 

conflict, like the American Revolution, the Civil War, and 

World War I, would lead to significant advances for African 

Americans. According to Myrdal, the ideological nature of 

the war against fascism inspired white Americans to redouble 

their efforts to make the democratic and egalitarian ideals of 
the American creed a reality for all. 

Kryder shows that state actors took a less idealist view of 
the war; they focused less on pursuing ideological reform 
agendas and more on the pragmatic tasks of maintaining an 
orderly and efficient military mobilization as well as assuring 
continued electoral success. The analysis focuses on federal 
manpower policies governing black factory workers, farm 
laborers, and army troops. In each case, Kryder provides a 
thorough and sophisticated description that draws heavily 
upon government records and archives. In particular, he 
offers an excellent description of the waves of unrest among 
black soldiers, most of whom were stationed in the South. 

Kryder shows that executive branch bureaucrats were 
concerned less with advancing a particular racial agenda, 
egalitarian or otherwise, than with minimizing racial conflicts 
that might harm the war effort or endanger the Democratic 
Party’s increasingly fragile coalition of conservative southern 
whites, urban blacks, and northern white liberals. At times, 
this led to policies that were somewhat racially progressive, 
such as the Fair Employment Practices Commission in 1941 
or the reduction in segregation on military bases. At other 
times, these goals led in the opposite direction, such as the 
initiation of FBI surveillance of urban blacks. Indeed, “the 
Roosevelt administration implemented policies that may 
have appeared progressive, but other purposes—the full 
mobilization of industrial production and the maintenance of 
the party coalition— outweighed in importance the principle 
and goal of egalitarian social reform” (p. 4). Kryder is 
certainly correct in stressing the limits of the executive 
branch’s desire to alter the racial status quo. From FDR on 
down, most saw winning the war as their primary and perhaps 
only goal. If the cause of racial equality were to be advanced, 
it would only be as a side effect or instrumental in the pursuit 
of victory. 

The only weakness of the book is that Kryder seeks to 
extrapolate from the case studies to make a larger statement 
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about the effect of World War II on American race relations. 
He concludes that the war had little lasting positive influence 
and, if anything, actually constrained the movement toward 
civil rights. Kryder seems to be using the civil rights advances 
of the 1960s as his benchmark for assessing the achievements 
of the 1940s, which clearly do not measure up. It seems that 
the more iate benchmark is the decades before the 
1940s. Compared to the 1910s, 1920s, and 1930s, changes in 
federal policy regarding race relations were truly staggering. 
The federal government thoved from at best ignoring racial 
inequality to at worst actively engaging in discriminatory 
practices, to working to undo some aspects of that inequality, 
albeit in the circumscribed ways that Kryder discusses. In 
some respects, the federal government’s response to civil 
rights in the 1940s is like the dog that could dance-—that he 
did so poorly is far less significant than the fact that he did it 
at all. 

By focusing exclusively on federal manpower policies, 
Kryder overlooks other areas of government action. For 
example, during World War II the Department of Justice 
acted for the first time since Reconstruction to investigate 
lynchings as a violation of federal civil rights laws. Changes of 
equal or greater importance came in other branches of 
government, particularly the Supreme Court, which in 1944 
undid a major component of black disenfranchisement by 
banning white primaries, There were also changes in racial 
attitudes and policies outside the federal government. State 
and local governments began to pass antidiscrimination 
legislation. Private actors began to jump on the civil rights 
bandwagon, as evidenced by the remarkable spread of local 
interracial councils in the mid-1940s and the integration of 
baseball immediately after the war. A fuller look at American 
race relations in World War II shows a somewhat more 
positive trend than Kryder suggests. 

But these are differences of degree rather than of kind. 
Divided Arsenal is a compelling and important analysis of 
American racial manpower policy in World War IL It 
advances significantly our understanding of American race 
relations in this critical period and, more broadly, of the 
interrelationship among war, the state, and social move- 
ments. 


Diversity and Distrust: Civic Education in a Multicultural 
Democracy. By Stephen Macedo. Cambridge, MA: Har- 
vard University Press, 2000. 343p. $45.00. 

Renford Reese, Cal Poly Pomona University 

Stephen Macedo’s book is an episodic historical analysis of 

the role of civic education in the United States. It has three 

broad themes: “Public Schooling and American Citizenship,” 

“Liberal Civic Education and Religious Fundamentalism,” 

and “School Reform and Civic Education.” It is motivated by 

the author’s conviction that much of the thinking surrounding 
diversity and difference is misconceived. According to 

Macedo, “diversity is not always of value, and it should not, 

any more than other ideals, be accepted uncritically” (p. 3). 
Macedo suggests that celebrating diversity should be sec- 

ondary to a civic liberalism that advocates the legitimacy of 

reasonable efforts to inculcate shared political virtues but 
leaves deeper philosophical moral questions to private com- 
munities. He supports a public philosophy of liberalism that 
embraces civic ideals that are broad in their protection of 
freedoms but not too deep. A main purpose of this book is 

“to argue that we should not allow liberalism’s most alluring 

features—broad freedoms, limited government, and ‘the 
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great pageant of diversity—to obscure other dimensions of a 
healthy, free, self-governing society” (p. 275). 

In the first section, the author explains the purpose of the 
common school philosophy in the context of American 
democracy. That purpose is to acknowledge the diversity of 
students while homogenizing them into respectful American 
citizens. He highlights the significant contributions of educa- 
tion reformer John Dewey to the common school cause. 
Dewey's adamant embrace of the separation of church and 
state doctrine led him to be highly critical of dogmatic 
religious communities. 

The author critiques the historical problems public schools 
have confronted in accommodating the concerns of the 
Catholic Church. For example, Catholics protested the man- 
datory reading of the King James version of the Bible in 
schools. Instead, they wanted Catholic students to read from 
the Douay Bible. In the author’s judgment, the demands of 
Catholics (and other religious groups) fall outside his con- 
ception of public reasonableness. According to Macedo, 
“Catholics were at the forefront of opposition to common 
schools not simply because some school materials and prac- 
tices were anti-Catholic, but because some in the Catholic 
hierarchy rejected legitimate civic ends” (p. 7). Over time, 
Catholics stopped seeking concessions from the common 
schools and established a separate system of parochial edu- 
caton. 

Much of the weight of this book is put on the challenge of 
accommodating religious diversity in public schools in the 
United States. The analysis centers on the issue of religious 
sectarianism and the establishment and free exercise clauses 
of the First Amendment. In fact, the 1983 Mozert v. Hawkins 
case guides most of the discussion on the limits of tolerance 
and accommodation. Fundamentalist Christian families in 
Hawkins County, Tennessee, charged that a reading series 
required by the school system denigrated their religious 
views, and they asked school officials to allow their children 
to opt out of the reading program. The school refused, and 
ultimately the courts upheld the school’s decision. Macedo 
states that the Mozert families had no right to be accommo- 
dated on principled ground. His deconstruction of this case 
gives a good example of his framework and position. He 
refers to this section as “The Intolerance for Educating for 
Tolerance” (p. 157). 

The author’s conception of liberal public reasonableness 
guards against the excessive zeal of religious fundamentalists. 
Macedo states that his model espouses a “tolerance for 
reasonable forms of diversity and respect for a wide array of 
freedoms” (p. 179). Given the author’s emphasis on religion, 
perhaps this book should have been entitled “Religious 
Diversity and Distrust: Civic Education in a Multicultural 
Democracy.” 

Much of the discourse on multiculturalism has become 
hackneyed, but Macedo offers a bold and refreshing philo- 
sophical discussion of the debate. This work represents sound 
scholarship. Unlike many in the fields of education, ethnic 
studies, and American studies who have written on multicul- 
turalism, Macedo grounds his work in political philosophy 
and constitutional principles. Indeed, he draws on John 
Locke, Thomas Hobbes, J.S. Mill, and John Rawls to guide 
his analysis. In his defense of a civic liberalism he outlines a 
range of public principles that move beyond the political 
liberalism of Rawis. 

Rawls developed a template for a public morality that is 
not grounded im religion. The task for Macedo is more 
specific: Promote moral education in public schools without 
infusing religion. Is this possible? Macedo agrees with Rawls 
that a public morality can be created from public ideals and 
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principles espoused by liberal democratic institutions that are 
not entangled in religious or deep philosophical beliefs. 

The book is insightful and engaging, but there are limita- 
tions. The author fails to specify what a moral education 
divorced of religious principles would entail: the teaching of 
mutual respect, cross-cultural understanding, sexual absti- 
nence, anger management, respect for the environment? 
How does a moral education avoid discussing the Golden 
Rule and its genesis? 

Although Macedo explores the on-going debate about 
school choice and market competition in school reform, he 
does not critically examine other important issues in the 
context of his tough liberalism and prudential accommoda- 
tion framework. For example, how would the author decon- 
struct affirmative action, cultural bias in standardized testing, 
gay rights, English only, and a moment of silence as they 
relate to public schools? By using these issues as examples, 
Macedo could have elucidated and strengthened his argu- 
ments. In fact, a critical examination of these topics could 
have put this well-written book at the center of various policy 
debates. ° 

Irrespective of its limitations, this book is timely and 
relevant, Genuine education reforms cannot take place un- 
less stakeholders have a conceptual understanding of the 
intellectual overview that Macedo outlines. I will use it in my 
“Politica of the Public Policy Process” course to enhance 
students’ philosophical understanding of the role of civic 
education in a multicultural society. 


Realignment and Party Revival: Understanding American 
‘Electoral Politics at the Torn of the Twenty-First Century. 
By Arthur Paulson. Westport, CT: Praeger, 2000. 376p. 
$69.95. 

Pat Dunham, Duquesne University 

The topic of realignment has produced much discussion and 

disagreement. Positions include: There was a realignment in 

the late 1960s; there was a realignment only at the presiden- 
tial level; there was a realignment only in the South; there has 
been no realignment, there has been dealignment; and more. 

Each succeeding election has been scrutinized by scholars to 

determine how it supports or refutes these positions. In a 

meticulously documented work, Arthur Paulson argues that 

the United States has experienced both realignment and 
dealignment, and something more as well. 

Paulson believes the United States experienced a realign- 
ment at the presidential level in the late 19608 and early 
1970s. Congressional elections took a long time to catch up, 
however, and for many years the system exhibited signs of 
dealignment. With the Republican takeover of Congress in 
1994, we now have a system in which both the “top” and the 
“bottom” have completed a conservative realignment. “Ideo- 
logical polarization seems to be a much more linear trend in 
the Senate. ... But by 1996, the parties are clearly polarized 
along ideological lines in both the House and Senate” (p. 
196). No longer do the parties contain mavericks. Paulson 
also argues that political parties in the United States have 
been revitalized. They are important for raising money and 
supporting candidates, and they also reflect a new, ideologi- 
cally polarized party system. According to Paulson, this 
transformation opens the door to the potential for party 
government. Thus, he rejects the theories of dealignment and 
party decay. 

The historical examination in chapter 1 compares the 
system of 1896 to the system of 1996. In chapter 2 Paulson 
describes how the Democratic Party is a multifactional 
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organization; power at its national conventions shifted from 
conservative states of the South and West in 1896 to the more 
liberal states in the East by the middle of the twentieth 
century. Chapter 3 describes the Republican Party as a 
bifactional organization with Wall Street and Main Street 
wings. Paulson shows in chapter 4 how these factions re- 
aligned in the 1964-72 period. “The election of 1964 sud- 
denly correlates positively with 1996, and negatively with 
1896, for both parties” (p. 14). Chapter 5 examines nomina- 
tion reforms and their effect on the parties. Chapter 6 deals 
with the ideological homogenization that occurred and the 
revival that ultimately came about. In chapter 7 Paulson 
describes split-ticket voting and divided government as func- 
tions of the realignment of the 1960s. Chapter 8 discusses the 
roles played in voting by race and class. In chapter 9 Paulson 
offers his definition of realignment, and chapter 10 closes the 
book with a discussion of what this means for the future. 

Many of the author’s conclusions are not new (e.g., that 
race is a more important factor in voting than class in the 
United States, and that southern conservatives are now 
Republicans instead of Democrats), but Paulson does an 
excellent job of supporting his argument. Each chapter 
contains numerous tables with data from various levels: 
national voting statistics, voting by congressional district, 
voting by delegates to national conventions, and so on. This 
is an impressive effort. Table 6.2, for example, categorizes 
Republican presidential primaries by ideological alignment 
of states, which illustrates the homogenization of the vote in 
those primaries since 1988 (p. 165). 

Paulson writes that “dealignment theorists who have been 
‘waiting for Godot’ have been waiting for something that 
even realignment theory, properly understood, would not 
predict. They have been waiting for a realignment that would 
fit a rigid ahistorical model” (p. 18). Like Everett Carll Ladd, 
Paulson holds that previous realignments do not resemble 
one another, so scholars should not settle on one example of 
realignment and say this is how it should look. Paulson argues 
that, before 1896, there was no majority party; therefore, 
realignment need not involve the displacement of one ma- 
jority party with another. 

Paulson’s analysis rings quite true for those who participate 
in elections, but for years that group has been about half the 
eligible electorate. Some will question whether realignment 
can make sense when there is no majority party. According to 
Paulson, “this process of elite realignment is the product of 
electoral realignment” (p. 295), but what kind of electoral 
realignment is it when so many citizens refuse to participate? 
. Some will take issue with Paulson’s conclusions about party 
revival. The parties indeed are more ideologically consistent 
internally than in the past, but many potential voters and 
even many voters dislike the parties. As Paulson notes, 
parties no longer serve the rank-and-file through patronage 
and constituent services; instead, they serve candidates. 
Steven E. Schier (By Invitation Only, 2000) argues that the 
parties themselves discourage the public at large from par- 
ticipating in elections. Parties are stronger in some ways, but 
they also are less relevant to the public. Candidates know 
this. Party nominees for national office do not invoke party 
affiliation in their campaigns or even in their nomination 
acceptance speeches. Victors cannot say that election results 
constitute a mandate in favor of their party’s platform. 
Paulson recognizes that parties are different today—this is 
one of his main points—but does not believe that these 
differences ımpede the development of a party system. He 
sees potential for party renewal through third parties, but the 
2000 election offers little support for this. In 2000 the Reform 
Party imploded, its nomination process keelhauled by sup- 
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porters of Pat Buchanan. The Green Party’s future is nebu- 
lous, with many citizens annoyed about the role it played. 
Paulson may well be correct in predicting the birth of a 
new, ideologically polarized party system. Given that few 
citizens seem to care, one doubts whether this realignment 
can have the kind of effect demonstrated by previous ones. 
Paulson contributes to the arguments about realignment, but 
he will not convince proponents of dealignment that they are 


wrong. 


Beyond Machiavelli: Policy Analysis Comes of Age. By Beryl 
A Radin. Washington, DC: Georgetown University Press, 
2000. 200p. $45.00 cloth, $17.95 paper. 


Amy K. Donahue, University of Connecticut 


Many of us who teach in programs that award graduate 
degrees in public affairs, administration, or policy regularly 
confront the question: What do policy analysts do? Our 
students raise it as they seek to understand the policy analysis 
field, evaluate their potential role in it, and prepare for the 
job market. As faculties we wrestle with it as we configure 
and reconfigure curricula to meet the demands of public 
service in a complex policy environment. We contend with it 
as professors when we design course syllabi to teach the tools 
of the trade appropriately and adequately and as researchers 
when we pursue scholarship that draws on the entangled 
disciplines of policy analysis, implementation, and public 
management. 

Beryl Radin delivers a multidimensional response to this 
elemental and omnipresent query, set in the context of an 
historical retrospective on the field and profession of policy 
analysis. She draws on a broad range of literature and 
exemplar cases to provide an insightful analysis of the field’s 
evolution. Her work is rooted in pragmatism and experience, 
consciously focused on the implications of important changes 
in the field for the practitioners who populate it as much as 
for those who study and teach it. She thus provides a 
reflective tour that can both support debate about the field’s 
intellectual direction and serve as a useful guide to those 
pursuing careers in it. 

Radin artfully employs four devices to illuminate the realm 
of policy analysis as an intellectual pursuit and as a field of 
practice. The first and most central of these is comparison: 
She describes and contrasts the practice of policy analysis in 
the 1960s and the 1990s. She begins by painting a “portrait of 
the past,” describing the origins and nature of policy analysis 
as it emerged as a self-conscious discipline, initially tied to 
the demands of the Planning Programming and Budgeting 
System in the Department of Defense after World War II, 
and then in various key offices throughout the federal gov- 
ernment under President Johnson. In the next chapter, she 
details the expansion and maturation of the field, showing us 
its role throughout the branches and levels of government 
and also beyond the public sector, until we reach the present, 
where policy analysis responds to a diverse set of demands 
and decision makers across society. These chapters point to 
striking shifts in how analysis is viewed and used by stake- 
holders in the policy process, how the relationship between 
analysts and decision makers has been transformed, how the 
tools of the analytic trade have advanced, and especially how 
the context of policy analysis has changed. A modest flaw is 
that many of the salient lessons of history are implied, not 
consolidated and enhanced by critical examination, which 
leaves the reader to identify, interpret, and evaluate them. 

The second device Radin employs is detailed profiles and 
short case examples. These appear throughout the text to 
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enhance and clarify the discussion, and chapter three is 
devoted entirely to case presentation. She describes six very 
different organizations to highlight the diverse nature of 
policy analysis in terms of its locus, practitioners, clients, 
approach, tools, and contribution to policy design and imple- 
mentation. These “profiles of practice” bring home the overt 
and subtle tensions between ostensibly objective analysis and 
the political policymaking process by depicting actual insti- 
tutions and their unique circumstances. By the author’s 
design, these examples are not representative, but they raise 
the question of what lessons, embedded in these particular 
cases, are generalizable. 

Subsequent chapters take on this question, to some extent, 
as Radin applies her third approach to analysis: direct 
attention to key contextual factors that affect the nature of 
policy analysis both as a field and as a profession. This 
discussion is a particularly salient contribution of this work. 
At several points, Radin shows us ways in which wide-ranging 
changes in the institutions of government and in society more 
broadly have influenced the type of analysis in demand, the 
purposes for which it is solicited, and the very nature and 
quality of analysis itself. She points out how these shifts in the 
field have had ramifications for the norms of the profession, 
and in turn, for those who practice analysis. She raises a 
pertinent concern for practitioners as their linkage to the 
policy process evolves, decoupling somewhat from questions 
of policy design and intervention and more frequently engag- 
ing issues surrounding the evaluation of outcomes: that of 
how they should gauge the value and quality of their work. 
The question of the influences of the social and political 
environment on policy analysis is addressed at a general level 
that successfully orients the reader to prominent issues but 
belies the complexity of the analytic system and its interde- 
pendence with policymaking and implementation. 

As a fourth device, Radin reiterates the implications of 
substantive and contextual changes in the field with hypothet- 
ical composites. Each chapter is framed by a depiction of one 
of the two fictional analysts the author has created, one with 
the characteristics of an archetypal analyst of the 1960s, and 
one who typifies an analyst of the 1990s. These characters 
reflect on (and, in the final chapter, discuss) their jobs and 
roles to highlight the distinct nature of policy analysis in their 
respective decades. They also illustrate the personal and 
professional demands and dilemmas a practitioner in each 
circumstance would likely experience. This technique is gim- 
micky and does not contribute to the substance of the 
discussion provided in the body of each chapter, but it makes 
the discussion accessible, particularly to students, for whom 
these vignettes can draw attention to the more subtle dimen- 
sions of each chapter’s presentation. 

The. strength of this work is a direct approach that makes 
it accessible and useful to scholars, students, and practitio- 
ners alike. To students, it offers a valuable counterpoint to 
the range of “how to” policy analysis texts that can help 
students envision analytic techniques in practice and them- 
selves in the field. To teachers it offers a perspective that 
focuses attention on what graduate schools provide and how 
well we prepare students for the exgencies of the policy 
analysis world. To researchers, it offers an intellectual history 
that can prompt reevaluation of the role, contributions, and 
responsibilities of policy analysis and of the relationship 
between policy analysis and other disciplines and fields. An 
important attribute of this book—its broad appeal—also is a 
disadvantage: The broad brush approach forgoes depth on 
some of the knottier issues of how high-quality policy analysis 
can be accomplished within and contribute meaningfully to 
today’s complex, fast-paced, and interdependent governance 
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processes. Many of these issues are raised, which is helpful, 
but few are explored in detail. Nonetheless, Radin does what 
she commits to do: She provides fertile ground for serious 
discussion of the field’s foundations and future. 


Republicans in the South: Voting for the State House, Voting 
for the White House. By Terrel L. Rhodes. Westport, CT: 
Praeger, 2000. 160p. $49.95. 


Aubrey Jewett, University of Central Flonda 


As the title indicates, this concise yet thorough book seeks to 
document and explain the extent of Republican voting and 
electoral success in the South. It examines in great detail the 
percentage of Republican vote received by presidential, 
senatorial, congressional, and gubernatorial candidates. It 
also looks at the number of GOP wins for these offices and 
for both houses of state legislatures. The South is defined as 
the eleven states of the former Confederacy, and numerous 
tables and graphs are presented for individual states, for the 
peripheral and deep South states as a group, and for the 
South overall. Trends are tracked from the 1960s to the 
present, with the exception of presidential voting, which is 
traced from several decades earlier. 

The book establishes in the first chapter several main 
themes that add to and/or reinforce the literature on south- 
ern politics and partisan change. The first and most impor- 
tant is that to understand adequately Republican growth in 
the South, one must examine both national and subnational 
gains. The second is that GOP gains have occurred unevenly 
over time, across the southern states and for different offices. 
The third is that despite the increased number of elephants in 
the cotton field, the Democratic Party remains very compet- 
itive. The book also makes clear that increasing Republican 
strength has had and will have important political conse- 
quences for the South and for the nation. An important 
secondary theme that emerges is that the growth of GOP 
strength in the South has many causes. 

Chapters 2-4 cover southern Republican electoral pat- 
terns for president, Congress, and state offices, respectively. 
The evidence is presented very clearly with tables and figures, 
and a rich narrative helps bring the data to life. The 
penetration of Republicanism at the presidential level is 
covered thoroughly, as is the propensity of southern voters to 
turn to third parties. The section detailing the critical nature 
of the South to GOP electoral vote strategies is particularly 
timely, given George W. Bush’s razor-thin margin in the 
electoral college and in Florida, as well as his ability to win 
the rest of the South, including Al Gore’s home state of 
Tennessee. Yet, the chapter also clearly points out that the 
South may be vital to Republican presidential success, but it 
is not a sure thing, as demonstrated by Democrat Bill 
Ciinton’s assorted southern state victories over two elections. 

Southern GOP senators, beginning with John Tower of 
Texas in 1961, emerged in the 1960s. Except for the election 
of Strom Thurmond in South Carolina, these gains were 
largely limited to the peripheral South until the 1980s. The 
early strategy for Senate GOP victories set the tone for what 
has now become commonplace: Republicans min as conser- 
vative candidates who paint their Democratic opponents as 
liberals who follow “policies set by Northerners and minority 
groups” (p. 44). The GOP was clearly competitive by the late 
1990s, but Democrats continue to show vitality at the sena- 
torial leveL The 2000 Senate elections clearly illuminate both 
these trends: GOP candidate George Allen beat incumbent 
Democrat Chuck Robb in Virginia, but Democrats Bill 
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Nelson and Zell Miller gained formerly Republican seats in 
Florida and Georgia. 

In the House, Republicans from the South, often repre- 
senting Tennessee, have been elected to Congress since the 
end of Reconstruction. Since the 1960s, GOP congressional 
increases have been particularly affected by changes in the 
racial climate and in redistricting rules. Republicans are 
consistently dominant in the majority of House elections in 
the South today and exercise considerable influence over 
national policy through leadership positions in both the 
House and Senate. 

GOP governors had taken office in Florida and Arkansas 
by 1966, and GOP gubernatorial candidates had become 
competitive in all the southern states by the 1990s. Yet, 
Republican success varies tremendously (Georgia has not 
elected a Republican governor since Reconstruction), and 
fiscally conservative, socially moderate Democrats are still 
tough to beat in southern gubernatorial races. Democratic 
gains in Alabama, South Carolina, and Mississippi in the late 
1990s attest to this fact. 

Significant GOP peripheral South legislative gains began 
as early as the 1960s. By the 1990s, Florida, Virginia, and 
North Carolina Republicans held majority status m one or 
both branches of the state legislature, although North Caro- 
lina has since reverted to Democratic control. In the deep 
South, GOP gains lagged until the 1980s. With the exception 
of South Carolina and Georgia, they remain at relatively low 
levels today. It is clear that the strongest vestiges of the 
one-party South can be found in the continued viability of 
Democratic governors and state legislators. 

Chapter 5 creates indices of party competition by multi- 
plying the total Republican percentage of votes for an office 
by two. The state legislative index is created as a lag score 
based on how many seats Republicans gained compared to 
what they might have been expected to pick up based on 
previous GOP gubernatorial vote percentages. These indices 
provide easily understood visual summaries of the trends in 
party competition at various levels over forty years. 

Almost every major explanation for southern Republican 
growth is addressed at some point in the book. Nevertheless, 
explanatory power could have been improved in three ways. 
First, more data-driven evidence could have been used to 
explain GOP growth. The book offers a plethora of such 
evidence to describe Republican gains, but it relies more on 
reviews of existing research when it comes to explanation. 
Second, the last chapter, which provides the most concise 
explanation for the partisan changes documented in the 
previous five, is organized around race, gender, and religion. 
These are clearly important, but other independent variables 
are given relatively less weight than they deserve. Factors that 
Rhodes discusses earlier but that probably deserve equal 
billing include ideology, wealth, one person—one vote redis- 
tricting, coattails, midterm effect, white northern migration, 
urbanization, Watergate, and personal candidate character- 
istics, such as the Reagan and/or Clinton effect. On a related 
point, chapter 6 uses V. O. Key’s realignment criteria and 
theory of critical elections very effectively to evaluate and 
declare the existence of partisan change in the South, but 
Key's equally important views concerning secular realign- 
ment are not applied. : 

Overall, Rhodes makes a valuable contribution to the 
literature on southern politics and partisan change. He does 
a superior job of documenting the extent of Republican 
growth in the South at all levels of officeholding, from the 
White House to the state house. He also does a very good job 
of explaining why those changes have occurred and why they 
are important for state and national politics and policy. The 
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book’s rich description, empirical data, and important themes 
make it a must read for professors, pundits, politicians, and 
graduate students. Its accessible style makes it highly recom- 
mended for undergraduates and lay people as well. 


Roots of Reform: Farmers, Workers, and the American 
State, 1877-1917. By Elizabeth Sanders. Chicago: Univer- 
sity of Chicago Press, 1999. 528p. $48.00 cloth, $16.00 
paper. 

Michael Goldfield, Wayne State University 


Roots of Reform is a richly researched book, filled with 
important insights about U.S. politics. In a striking rebuke to 
state autonomy proponents, Sanders convincingly argues for 
the importance of radical agrarian movements (and their 
often reluctant labor allies and junior partners) in shaping 
the national political agenda and developing the modern 
state from the end of Reconstruction to the beginning of 
World War L Radical agrarians and their allies, though 
forcing the issues and bringing them to the fore, had their 
agendas constrained, channeled, reshaped, and modified by 
state actors and most -often by the capitalist class, who 
nevertheless would have preferred another agenda. The list 
of state policies that farmers affected is impressive: “the 
redefinition of trade policy; the creation of an income tax, a 
new, publicly controlled banking and currency system; anti- 
trust policy; the regulation of agricultural marketing net- 
works; a nationally financed road system; federal control of 
railroads, ocean shipping, and early telecommunications; and 
agricultural and vocational education. These constituted the 
core of the agrarian political agenda” (pp. 7-8). 

The agrarians were highly politicized, represented little 
more than one-third of the labor force, acted both inside and 
outside the Democratic Party, and at times reached out to 
African Americans and more consistently to workers. Sand- 
ers explains one of the anomalies of the Progressive Era—the 
spate of prolabor legislation at a time when organized labor 
was weak, relatively inattentive, and undermobilized politi- 
cally. It was labor’s agrarian allies who forced through 
legislation that included the eight-hour laws, railroad and 
seamen’s labor rights, federal workmen’s compensation, and 
much more, including the path-breaking 1932 Norris-La- 
Guardia Act, a legacy of the farmer-labor alliance. Sanders 
argues that it was the weakness of this alliance, specifically 
the weakness of labor and the unwillingness of organized 
labor to hold up its share of the bargain, that doomed 
radicalism, led to the failure of an independent farmer-labor 
party, and is the root cause of American exceptionalism. This 
is, as I will suggest, a bit too simple. 

. Such a broadly based account, of course, is bound to have 
lots of minor problems. One of the author’s main intellectual 
targets is the liberal elite theorist Gabriel Kolko, whom 
Sanders characterizes as a Marxist despite the somewhat 
emotive anti-Marnzist diatribes in his work, which suggests a 
lack of awareness of one of the most important schools of 
social and political analysis. Similarly, Sanders is murky on 
the split within the Socialist Party that led to the formation of 
the Communist Party; the latter, despite its weaknesses, 
would have to be considered superior (from her perspective) 
on the issues of race, immigration, and national chauvinism. 

Two problems cry out for a deeper analysis. Sanders, like 
Richard Bensel, divides the country into economic zones, the 
most important being the core (manufacturing and capitalist 
controlled) and periphery (highly agricultural). The conflict 
between them is allegedly the moving force of U.S. politics. 
Yet, she never reconciles the class differences among agrar- 
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ians. Somehow the Dixiecrats (and plantation owners) of the 
early twentieth century represent a continuity of interests 
with the black and white Populists, whom the southern ruling 
class brutally crushed, lynched, and disenfranchised. Here 
and there are inklings of these class differences and what they 
might have meant for the national, radical, farmers move- 
ment as a whole, but they are never analytically confronted. 

The most striking omission is with respect to race. There 
are hundreds of examples of the racist actions of agrarians, as 
well as the American Federation of Labor and other trade 
unionists, but these never figure into the analysis. William 
Jennings Bryan is the unsung hero of the book, but his 
virulent racism (Sanders fails to mention he was a supporter 
of the Ku Klux Klan) is seen as a wart rather than a problem 
for the movement. It can be plausibly argued that workers 
and farmers, at their most radical and effective, were antira- 
cist, and that acceptance of racism destroyed the vitality of 
farmers, not just trade unions. Dozens of instances can be 
gleaned from reading the small print in the book. Of special 
importance is the pivotal 1924 farmer-labor presidential 
campaign of Wisconsin’s Robert LaFollette, who had wide 
support in the Midwest and West but little in the supposedly 
agrarian radical South, in good part due to LaFollette’s 
advocacy of civil nghts for blacks. In the South, of course, 
that was a direct threat to the white supremacist control of 
black labor by southern cotton planters but not much of a 
threat to the interests of poor white farmers. It was not 
merely the failures of labor but the weaknesses of agrarian 
radicalism, particularly m the South around issues of race, 
that foreshadowed the ultimate failure of the movement. A 
deeper probing of these issues would have dramatically 
affected the underlying analysis. 

This all said, the book is rich, provocative, and informative, 
and it is a must read for students of American politics. 


American Business and Political Power: Public Opinion, 
Elections, and Democracy. By Mark A. Smith. Chicago: 
University of Chicago Press, 2000. 245p. $39.00 cloth, 
$16.00 paper. 


John F. Camobreco, Christopher Newport University 


The question of who holds meaningful political power in 
democratic societies motivated some of the most important 
works in American political science in the 1950s, 1960s, and 
early 1970s. Scholars such as Robert Dahl and Charles 
Lindblom explored the power structure of America and 
noted the constraints imposed upon representative democ- 
racy by a capitalist system. These types of examination seem 
to have fallen out of favor over the last several decades, 
probably because of problems surrounding the measurement 
of a concept such as power and the general feeling that such 
questions can never be answered with any high degree of 
certainty. This has been a regrettable development for the 
discipline. Despite its difficulty, the question of who holds 
political power and how it is exercised remains among the 
most important to be asked about politics and government. 

The exploration of political power in America—specifi- 
cally, the ability of business to control political outcomes— 
has made an ambitious and triumphant return in Mark A. 
Smith’s American Business and Political Power. Smith starts 
by reviewing the literature on the power of business in the 
United States, and he wisely casts his view beyond main- 
stream political science literature to take in the work of C. 
Wright Mills, G. William Domhof, and Ralph Miliband, 
among others. Smith concludes that the pluralists, elitists, 
and class theorists may differ in their perspectives on the 
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extent of business power, but they all share the assumption 
that a unified business front in the political arena will result 
in business being able to control political outcomes. Yet, 
Smıth’s findings refute this assumption: “Unity does not 
increase the direct influence of business and reduce demo- 
cratic control by the citizenry. Instead, unity coincides with 
the opposite results. ... Policies match the collective desires 
of business only when citizens, through their policy prefer- 
ences and voting choices, embrace ideas and candidates 
supportive of what business wants” (p. 8). 

Key to Smith’s contention is the idea that not all political 
issues will result in a unified business front. Specifically, 
issues of interest to business are characterized as being 
“particularistic,” “conflictual,” or “unifying.” Essentially, uni- 
fying issues are those in which the entire business community 
has an interest; they involve governmental decisions that have 
the potential to affect all businesses, not just particular 
corporations or industries. Smith uncovers these unifying 
issues in an exhaustive search of the positions on national 
issues taken by the U.S. Chamber of Commerce from 1953 to 
1996. He finds that although these issues are fairly rare, they 
are also the most ideological, partisan, and salient. 

Smith creates a scale to measure the degree of business 
success on these issues and shows that this scale is highly 
associated with certain measures of citizen preferences, such 
as public opinion, public attitudes toward corporations, and 
the partisan composition of Congress. This relationship holds 
in the face of what Smith terms “overt” sources of business 
power and political mobilization, such as spending by corpo- 
tate political action committees and business lobbying capac- 
ity. Smith also refutes the implication that business holds 
more subtle, “structural” power that forces elected officials to 
enact policies favored by business, particularly during times 
of economic hardship. He determines that the real power of 
business lies in its ability, particularly over the last several 
decades, to influence public opmion through the support of 
policy think tanks that hold favorable views on business. The 
mechanism through which this occurs is the news media 
visibility of researchers from conservative think tanks. 

Smith’s analysis is ambitious, and it is developed in a 
logical, thoughtful, and sophisticated manner that remains 
accessible throughout. It is also a reminder that many of the 
most important questions about politics are not answered 
with readily available data. Some may quibble with Smith’s 
choice of U.S. Chamber of Commerce positions to construct 
the dependent variable. Smith notes that this organization 
represents both small businesses and large corporations and 
is therefore a good proxy for the views of the entire business 
community. The potential problem is that when it comes to 
exerting political influence, small businesses are quite likely 
to be in a different league from large corporations. In fact, 
the real question of busmess power may be the extent to 
which large corporations can influence governmental poli- 
cies. Smith demonstrates that the positions of the U.S. 
Chamber are nearly identical to those adopted by the Busi- 
ness Roundtable, which represents mostly large corporations, 
but this still does not address the question of the independent 
influence that might be exerted by large corporations. Such 
influence may come as a result of the increasing use of soft 
money contributions to the political parties, an issue Smith 
does not examine. 

It is a very significant finding that disagreements between 
the business community and the public on unifying issues are 
typically resolved in favor of the latter. Yet, Smith admits that 
on particularistic issues that do not receive much attention 
and affect only a few firms or industries, business interests are 
much more successful, the increased openness of the legisla- 
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tive process notwithstanding. These types of issues are much 
more prevalent than the unifying ones Smith examines, but 
despite the public’s seeming apathy toward them, the fact 
that business tends to be very successful in this arena erodes 
the democratic ideal perhaps more than Smith would like to 
admit, as not all particularistic interests are created equal. If 
political power is about who gets what and how, it has long 
been clear that business interests are more successful at 
securing benefits from the government than, for example, 
welfare mothers. 

Despite these criticisms, Smith’s study will certainly lead 
scholars to question the ability of business to exert political 
power at the national level in the absence of public support. 
And Smith raises some very important questions. His findings 
indicate little evidence of structural business power at the 
national level, but he notes that state and local governments 
are probably much more susceptible to the pressures of 
enacting favorable business policies. This is an especially 
important point in view of the drive over the last two decades 
to allow states more policymaking power. If Smith’s logic is 
correct, than the devolution of policy responsibilities to the 
states is something that business as a whole should welcome, 
given the potential benefits. It also should be noted that the 
end of the Cold War has allowed for a much more globalized 
system of capitalism, one in which increasing competition 
among nations could lead to greater structural power for 
business. 

Some may argue that public opinion favors more govern- 
mental power for the states, but as Smith shows, the greatest 
power of the business community may lie in its ability to 
influence public opinion. Smith’s findings indicate that busi- 
ness seems to have made a concerted effort at politically 
socializing the public in favor of probusiness positions since 
the business failures of the 1960s and early 1970s. Because a 
dim view of government is likely to work in favor of business, 
this effort has probably benefited from declining public 
confidence in governmental institutions in the wake of the 
Vietnam War and the Watergate scandal. But Smith shows 
that it is important to explore whether declining public 
confidence has been exacerbated by unflattering media por- 
trayals of governmental behavior, not only in news broadcasts 
but also in television shows, movies, and so on. 

Like all significant and able scholarship, Smith’s book 
raises a number of important questions. It is gratifying to see 
an examination of such a fundamental and enduring political 
question addressed again by a mainstream political scientist. 
The book would be an excellent addition to graduate courses 
on political behavior and even upper level undergraduate 
courses on public opinion, elections, and democratic theory. 


The Regulatory Craft: Controlling Risks, Solving Problems, 
and Managing Compliance. By Malcolm K. Sparrow. 
Washington, DC: Brookings Institution Press, 2000. 346p. 
$20.95 paper. 


Thomas W. Church, State University of New York at Albany 


Regulation has never been popular in the public mind; like 
death and taxes, it is regarded—at best—as an unpleasant 
inevitability. Calls for deregulation hearken back at least to 
the Nixon administration, and the theme of reducing the 
alleged burdens of regulation have become a kind of bipar- 
tisan mantra, advanced by every American president in the 
latter third of the twentieth century. Scholars have not been 
appreciably kinder. Criticism oscillates between the charge of 
regulatory capture and the accompanying danger of pliant 
regulators falling into the clutches of their clientele, and the 
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more recent—and virtually opposite—concern with regula- 
tory unreasonableness and the tendency of governmental 
agencies to enforce regulations in a mechanistic and irration- 
ally aggressive manner. 

In this pervasive environment of what my students would 
term “dissing” regulation, Regulatory Craft seems an extraor- 
dinary anomaly. Malcolm Sparrow presents a positive, almost 
affectionate, portrayal of regulation and the beleaguered 
administrators with its implementation. He enter- 
tains no doubts about the continuing need for command and 
control forms of social policy, and he does not apologize for 
its coercive elements or its assumption that bad guys will 
always be with us. The raging debate over the need for less 
coercive, market-based alternatives to regulation is conspic- 
uous by its absence in these pages. 

The author directs his attention to the administrators who 
implement regulatory programs. Scholars are notably absent 
from Sparrow’s description of his intended audience. Yet, 
academic students of regulation and public policy will find 
much to like in this book. It is a refreshing antidote to recent 
“pop” public administration, especially the more extreme 
manifestations of the reinventing government movement, 
which applied customer satisfaction as a standard of govern- 
mental excellence, willy-nilly, to a range of activities for 
which it was inappropriate at best. (My favorite example of 
this pathology is Sparrow’s discussion of the effort to “rein- 
vent” the customs service, which included a wrenching and 
debilitating debate over whether smugglers should be consid- 
ered among the agency’s “customers.”) 

The central argument is deceptively simple: Sparrow urges 
regulators to “pick important problems and fix them” (p. xvi). 
He first discusses why this approach is both novel and 
necessary, setting the argument in the context of a highly 
useful analysis of the basic theoretical and pragmatic prob- 
lems of regulatory practice. Regulators, he argues, are always 
short of resources and unable to enforce all their mandates. 
Thus, the exercise of administrative discretion in allocating 
scarce resources is inevitable. Although these determinations 
necessarily result in an implicit ranking of the various risks 
the agency is responsible for addressing, political (and, 
increasingly, legal) reality presses administrators to deny the 
existence of such discretion. As a result, hazards are ad- 
dressed more or less randomly, based on the cases that come 
in the door as well as the organizational structure and 
professional orientation of the agency’s staff. The simple 
admonition to pick important problems, then, is not politi- 
cally, legally, or bureaucratically easy. 

Once important problems have been selected, they need to 
be “fixed.” Sparrow provides an extended discussion of 
problem-solving techniques, risk assessment and control, and 
administrative and organizational structures. The central 
sections of the book are a virtual manual for practitioners. 
According to Sparrow, the essence of craft, and the missing 
ingredient in most operating regulatory programs, is choosing 
the right tool for the job. His diagnosis of this most basic 
problem of regulation is not new: Robert Kagan, Eugene 
Bardach, John Braithwaite, and others have written exten- 
sively on the propensity of regulators to fall victim to what my 
colleague, Robert Nakamura, calls the “hammer and nail 
pathology”: When the only tool you know how to use is a 
hammer, every problem looks like a nail. But Sparrow lays 
out the problem in an intelligent and comprehensive way, and 
he goes where few academic observers have ventured: He 
proposes concrete policies and administrative structures and 
procedures that have been shown to overcome it. I found that 
the sections dealing with problem-solving infrastructure and 
“finding resources and making space” presented a bit more 
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practical advice than the academic in me cared to dwell upon, 
but the main audience for this book—administrative practi- 
tioners—probably will find these crucial. 

The book is written with an almost crusading zeal, and 
Sparrow argues persuasively for a mission for regulators that 
vests them with the power to select the most important risks 
in their inevitably overcrowded menu of social dangers, and 
then to design comprehensive and innovative strategies to 
address them. I am not troubled by the frequent absence of 
academically defensible statistics to demonstrate the effec- 
tiveness of the various programs he describes, and I do not 
disagree with his message of what might be termed simple 
regulatory rationality. I do have doubts regarding the practi- 
cality of his overall project, particularly in the context of 
regulatory agencies in the United States. 

It is perhaps not incidental that before his appointment at 
Harvard’s Kennedy School Sparrow served as a senior police 
officer in Great Britain, for elements of his prescription seem 
more suited to the world of British policing than the more 
acrimonious and politically charged world of American reg- 
ulatory agencies. A growing literature in comparative regu- 
lation shows European countries to have political cultures 
that evince popular trust of public servants and the regulatory 
structures they maintain, that vest administrators with con- 
siderable discretion, and that allow them to make and 
enforce regulatory policies with substantial input and coop- 
eration from the regulated community. Regulatory policy in 
the United States, in contrast, is characterized by adversarial 
legalism and “going by the book”; distrust of government 
Officials is a hallmark of the constitutional system, and 
regulators frequently find political refuge only by enforcing 
policy in a rigid and confrontational manner. 

Sparrow cites a variety of American examples in which 
state and federal officials have persevered and succeeded in 
overcoming the obstacles posed by politics, bureaucracy, and 
the law, but he is not sanguine about the difficulty of the 
mission he is urging on regulators. Indeed, he frequently 
speaks of the myriad problems involved in regulatory craft, as 
he defines it. Yet, after reading his stories of brave adminis- 
trators slaying the dragons of inertia, resistance, and timidity, 
I was left-with the question that has plagued past efforts at 
reform in courts and criminal justice systems: Why should we 
expect rational administrators to buck the system and under- 
take the personal risks and undeniable aggravation involved 
in his prescriptions, when the path of least resistance is so 
safe and well trod? Where are the incentives for such difficult 
and dangerous undertakings? 

Sparrow suggests that success in picking important prob- 
lems and solving them will provide administrators a respite 
from competing and conflicting demands and may give them 
a means of publicly justifying their activities through concrete 
accomplishments in reducing societal risks. But he also 
admits that such successes can only be demonstrated through 
a fundamental reconceptualization of the performance mea- 
sures typically applied to regulatory programs. At bottom, I 
wonder whether Sparrow's notion of regulatory craft can 
exist in a political and legal environment that seems so 
fundamentally unsupportive. 

It may be in the legal arena that Sparrow’s prescription 
ultimately will founder. The fundamental element of regula- 
tory craft is the rational and purposive use of administrative 
discretion. Sparrow tells public officials to use the law as a 
starting point but not to be bound by “the strict limits of 
existing law” (p. 254). In what might be termed the regulatory 
craftsman’s creed, he explicitly tells administrators to “cor- 
rect, by using your judgment, deficiencies of law” (p. 28). 
Unfortunately, as Sparrow on occasion admits, federal courts 
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have become increasingly intolerant of such extralegal behav- 
ior by federal regulators. Indeed, at this writing, the Supreme 
Court is considering a case involving the Environmental 
Protection Agency that threatens to invalidate the more 
limited but pervasive exercise of administrative discretion 
that has characterized federal regulatory legislation since the 
New Deal. 

These practical concerns aside, scholars and practitioners 
alike should find this book provocative, insightful, and per- 
haps even inspiring. The optimist in me hopes Sparrow’s 
advice is followed and celebrated by regulatory agencies 
throughout the government; the pessimistic student of failed 
reforms harbors doubts. 


Class and Party in American Politics. By Jeffrey M. Stone- 
cash. Boulder, CO: Westview, 2000. 192p. $29.00. 


Patrick Fisher, Monmouth University 


Jeffrey Stonecash examines the role that class political divi- 
sions play in the United States. The conventional wisdom is 
that even though inequality has increased over the past 30 
years, the Democratic Party has failed to address equality of 
opportunity issues. Consequently, the Democrats have alien- 
ated the white working class, and traditional New Deal class 
political divisions have altered. Stonecash finds this conclu- 
sion erroneous. Contrary to previous studies (e.g., Paul 
Abramson, Generational Change in American Politics, 1976; 
Everett Carll Ladd, “Liberalism Turned Upside Down: The 
Inversion of the New Deal Order,” Polstical Science 

91 [Winter 1976-7]: 577-600; and William J. Keefe, Parties, 
Politics, and Public Policy in America, 1994), Stonecash argues 
persuasively that the less affluent now support the Democrats 
more than they did in the 1950s and 1960s. Thus, class 
divisions in American politics have been increasing, not 
decreasing as is commonly believed. 

A number of factors have been working against the Dem- 
ocratic Party in recent years that seem to indicate an unfa- 
vorable environment for the Democrats. According to opin- 
ion polls, the public favors a less active role for the national 
government. At the same time, traditionally Republican 
areas—the Sunbelt and the suburbs—have grown much 
more rapidly than Democratic strongholds in the Rust Belt 
and the cities. Furthermore, a number of social issues, such as 
welfare and crime, are widely believed to have hurt the 
Democrats, but they have still been able to remain compet- 
itive in national elections. Analyzing class-related data from 
the National Election Studies Cumulative File (1948-98), 
Stonecash finds that the Democrats have developed a sub- 
stantial electoral base in recent years due to growing inequal- 
ities in American society. Not only has the distribution of 
income become more unequal, but also inequalities among 
communities, in tax burdens, in access to higher education, in 
health care, and in pensions have grown considerably. The 
country as a whole may be becoming more prosperous, but 
the less affluent are not necessarily sharing in the prosperity. 
The result, Stonecash finds, is that lower income Americans 
are becoming more Democratic in their partisan preferences. 

Less affluent Americans became a pillar of the Democratic 
Party beginning with the New Deal. As Stonecash notes, 
ideological conflicts in the 1930s over the role of government 
resulted in significant class political divisions. The Democrats 
were seen as sympathetic to the poor, and lower income 
Americans were stereotyped as strong supporters of the 
Democratic Party. That may have been the case during the 
Great Depression, according to Stonecash, but by the 1950s 
the parties were not nearly as polarized on ideological and 
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class grounds, and the major differences in their electoral 
bases became “more regional than class-based” (p. 45). 
Considerable internal diversity in both parties contributed to 
confusion as to how they differed from each other. In the 
1950s and 1960s, as income in many U.S. House districts 
declined, the likelihood of a Democrat winning the seat 
increased, which suggests the party was more successful 
among the lower classes than the GOP. But this relationship 
was largely due to the fact that most low-income districts 
were in the South, which was still overwhelmingly Demo- 
cratic. When controlling for region, Stonecash finds “there 
was little relationship between district income levels and the 
success of Democrats” in the 1950s and 1960s (p. 47). 
Stonecash argues that the Democrats have increased their 
appeal to the less affluent since the 1960s because both the 
constituencies and concerns of the parties have changed 
considerably. This argument is similar to that of E. J. Dionne 
(They Only Look Dead, 1997). As the Republican Party 
became more conservative, its electoral base moved from the 
North to the South and West, and its House membership 
became much more heavily based in more affluent districts. 
The Democrats became much more ideologically coherent as 
southern conservatives moved to the GOP, and they offset 
the losses in the South by gaining seats elsewhere, many of 
which were in lower income districts. By 1992, the Demo- 
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cratic Party “had a clear base in lower-income House dis- 
tricts” (p. 78). 

As a result, class divisions “have not declined in American 
politics. Instead, the trend is a steady increase in class 
differences in support for Democrats” (p. 112). Evidence 
from the National Election Studies shows that in the 1990s 
the less affluent voted for Democratic candidates at much 
higher levels than in the 1950s and 1960s. Even so, Stonecash 
finds, the party does not appear to have alienated middle- 
class voters, despite concerns to the contrary. In fact, overall 
impressions of the Democratic Party are relatively positive 
compared to the GOP. 

Class and Party in American Politics is an important con- 
tribution to the literature and is a solid choice for undergrad- 
uate or graduate classes on political parties or voting behav- 
ior. The Democrats and Republicans today are much more 
likely to differ in what should be done regarding various 
issues than was the case four decades ago. The growing 
partisan difference in congressional votes is a product of the 
new constituencies. The parties today are much more ideo- 
logically consistent and homogeneous and more Americans 
see a difference between them today than was the case four 
decades ago. As inequality in the United States continues to 
increase, Stonecash concludes, party polarization is likely to 
intensify as the stakes involved in public policy disputes 
increase. 
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Political Activists: The NDP in Convention. By Keith Archer 
and Alan Whitehorn. Toronto: Oxford University Press, 
` 1997, 299p. $29.95 paper. 


Terence Morley, University of Victoria 


The New Democratic Party (NDP) of Canada was formed in 
the early 1930s with the European mass party model in mind 
and, more specifically, with hopes of emulating the British 
Labour Party as an instrument for providing significant and 
far-reaching social and economic reforms through govern- 
ment intervention and state ownership of the “commanding 
heights” of industry. The party, originally named the Co- 
operative Commonwealth Federation (CCF), never was able 
to climb to those heights, as it has never managed to form a 
national government in its sixty-nine years as a political force. 

Yet, the CCF/NDP has had considerable electoral success 
in many of those years. As early as 1944 the CCF formed a 
majority government in the Province of Saskatchewan that 
continued in office for twenty years. It was this government 
that inspired the doctoral dissertation of Seymour Martin 
ane a young activist in New York City’s Young People’s 

Socialist League, later published in 1950 as Agrarian Social- 
ism. Over the last quarter century, as the NDP, the party has 
held office in Saskatchewan for lengthy periods. 

The NDP also won elections in other provinces: four in 
Manitoba, three in British Columbia, and one in Ontario, 
Canada’s largest province. In addition to a breadth of expe- 
rience running provincial governments, and despite the hand- 
icaps for third parties with a nationwide pattern of support 
imposed by the first-past-the-post, single-member plurality 
electoral system, the CCF/NDP has elected sufficient MPs 
since 1935 to make its national parliamentary caucus a highly 
legitimate political movement that has been accommodated 


by other national parties and by the national media out of 
political necessity. 

Archer and Whiteborn have written extensively about the 
CCF/NDP, jointly and separately, and this volume is primar- 
ily a slightly revised collection of several previous articles and 
book chapters, with some new material mostly collected at 
the last two NDP leadership conventions, held in 1989 and 
1995. As a mass party without a mass membership the 
CCEF/NDP has been j obedient in its strict obser- 
vance of Robert Michels’s “iron law of oligarchy.” As a 
consequence the ideological longings and policy preferences 
of the party as a whole are primarily shaped by the opinions 
of the activists who attend national conventions. 

Archer and Whitehorn base most of their observations and 
conclusions on nses to five delegate questionnaires from 
the 1971, 1979, 1983, 1987, and 1989 conventions. In addi- 
tion, they use aggregate data from these conventions and 
from the 1995 leadership convention in making a number of 
observations and drawing conclusions. The six most impor- 
tant are noted below. 

First, most delegates to these conventions have been party 
activists for many years, and many have served on local, 
jority have played a role in election campaigns for the party. 

Second, although the party has formal ties to the labor 
movement and champions the poor and dispossessed in 
society, the delegates are overwhelmingly middle class. 

Third, in examining attitudinal differences between and 
within Canadian parties on such issues as trade with the 
United States, privatization and deregulation, social security, 
moral conservatism, bilingualism, and civil liberties, there are 
significant differences among the three Canadian parties 
studied in convention (the NDP, the Liberals, and the 
Progressive Conservatives). The NDP activists exhibit what 
the authors call a higher level of “ideological clarity” than 
delegates to the other party conventions. 
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Fourth, the class consciousness of delegates seems to cut 
across regional differences, with the result that even though 
delegates identify more with provincial governments, partic- 
ularly NDP provincial governments, they do not wish to 
transfer more powers to those governments, which continues 
a long-standing CCF/NDP preference for a strong central 
government. 

Fifth, in contrast to delegate attitudes from surveys of 
Liberal and Progressive Conservative conventions, the vast 
majority of NDP delegate respondents continue to favor 
government intervention in the economy and oversight of 
social pro d 

Sixth, postmaterialist values, particularly environmentalist 
values, have been embraced by most party activists regardless 
of age or union affiliation. This contrasts with other survey 
evidence from NDP voters that suggests significant differ- 
ences of opinion between union and nonunion supporters on 
environmental issues. 

Two national (federal) elections have been held in Canada 
since this volume was compiled in 1997. Although the NDP 
regained party status in both 1997 and 2000, its popular vote 
across the country is now reduced to the same single-digit 
level with which it began in the 1930s. The NDP lost badly in 
the 1995 Ontario election, lost the government in the Yukon 
Territory, was recently reduced to minority government 
status in Saskatchewan, is likely to lose the next election in 
that province, and is certain to lose the 2001 election in 
British Columbia. The recent victory in the Province of 
Manitoba provides only tepid comfort. 

Although Archer and Whitehorn do not directly pe 
the NDP’s precipitous decline, their findings suggest 
explanation. The reported attitudes and beliefs Hae the arly 
activists seem insulated from those of the vast majority of 
Canadians. Archer and Whitehorn report that in the 1987 
convention survey more than 70% of the delegate respon- 
dents expressed a negative attitude toward the United States, 
and less than 30% wanted Canada to stay in NATO and 
NORAD. (The Berlin Wall was still intact that year.) These 
views are very different than those revealed in multiple 
surveys among activists in other parties and among Canadi- 
ans generally. 

In the November 2000 election one prestigious nationwide 
survey found that the typical NDP voter had a family income 
of $100,000 plus, a level well in excess of that reported for the 
voters who intended to vote for the other major parties. It 
must be said that an avowedly democratic socialist and 
labor-affilated party that attracts mostly high-income voters 
seems to have lost touch with its potential base. The Archer 
and Whitehorn volume shows that this disconnection be- 
tween the working class and NDP adherents has been 
reflected for almost twenty years in the attitudes that can be 
attributed to NDP political activists. 


States, Banks, and Markets: Mexico’s Path to Financial 
Liberaltzation in Comparative Perspective. By .Nancy 
Neiman Auerbach. Boulder, CO: Westview, 2001. 185p. 
$25.00 paper. 

Big Business, the State, and Free Trade: Constructing Coa- 
litions in Mexico. By Strom C. Thacker. New ‘York: 
Cambridge University Press, 2000. 239p. $54.95. 


Jonathan Hiskey, University of California, Riverside 
With many developing countries entering their third decade 
of neoliberal reforms, research on the political economy of 
this transition continues apace. As is the model itself, in view 
of the shift from an apparent policy “consensus” to a complex 
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maze of conditions and caveats faced by countries that 
attempt to reap the benefits of neoliberalism (see Moises 
Naim, “Washington Consensus or Washington Confusion,” 
Foreign Policy 118 [Spring 2000]: 86-103), scholarship in this 
area is becoming increasingly complex and wide-ranging. 
Initially, research on the politics of economic reform focused 
on institutional, economic, and international factors to ex- 
plain variations in structural adjustment processes and out- 
comes across developing countries. Although critical in pro- 
viding a basic blueprint for understanding the dynamics of 
the economic transition that swept through the developing 
world in the 1980s and 1990s, the early research was limited 
in part because much of what it was trying to explain was still 
largely unfinished. Just as early work on democratization was 
limited in its foci to “first-generation” institutional reforms, 
so inquiry into the politics of economic transition in the 
developing world left unanswered a significant set of “sec- 
ond-generation” questions that only now can be adequately 
addressed. 


As the experiences and outcomes of the initial round of 
neoliberal reforms become clearer with time, scholars are 
examining the complex set of interactions between domestic 
and international actors that produced the reforms, as well as 
the consequences of these various coalitions for the scope 
and quality of a country’s reform package. The books by 
Strom C. Thacker and Nancy Neiman Auerbach are two 
examples of this recent trend. 

Thacker’s work posits a central role for domestic business 
interests, and the coalitions they formed with state actors, in 
explaining the almost revolutionary shift of Mexican trade 
policy during the past twenty years from highly protectionist 
to aggressively internationalist. Similarly, Auerbach moves 
beyond international, state, or society-based explanations of 
financial liberalization policies in Mexico and focuses instead 
on the policy consequences of the “strategic interaction 
between entities whose behavior, or potential behavior, affect 
the very preferences and behavior of other influential actors” 
(p. 14). In short, both works examine significant economic 
policy trends of the 1990s, trade and financial liberalization, 
with an emphasis on the interactions between, rather than the 
actions of, a set of domestic and international actors. Con- 
sequently, their principal contribution emerges from their 
focus on and identification of the interactive, rather than the 
independent, effects of political factors on the processes and 
outcomes of neoliberal reforms. 

Thacker’s point of departure is the issue left unexplained 
by previous accounts of trade policy in Mexico. This literature 
generally focuses on three factors: the policymaking role of 
international forces, the mse of a technocratic elite in Mexico 
during the 1980s, and the unique features of Mexican polit- 
ical institutions that allow an unchecked role for a limited 
number of policymakers in formulating and implementing 
neoliberal reforms. For Thacker, the unresolved question is 
why trade liberalization in Mexico was so extensive, given the 
weakness of the country’s principal policymaking institutions 
(namely, the presidency) during this period and the still 
powerful “entrenched political and economic interests” that 
were diametrically opposed to neoliberal reforms (p. 4). 

Thacker sets out to provide evidence for the answer to that 
puzzle. The internationally oriented business sector, often 
viewed as a relatively apathetic proreform constituency, and 
its coalition with the reform-minded political elite are the 
basis for Thacker’s account. In addressing the role of the 
business sector and how the technocratic elite used it to push 
through its policy agenda, Thacker challenges a central 
assumption of previous research. The question of how gov- 
ernments with little political capital can implement difficult 
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economic reforms is all the more vexing if one assumes that 
the short-term losers will be the most vocal, whereas the 
long-term winners, because benefits are not immediately 
apparent, will not actively support reforms. What Thacker 
masterfully demonstrates is that the long-term beneficiaries 
of free trade in Mexico were instrumental in the govern- 
ment’s success in implementing those policies. 

Auerbach’s work is a more explicitly comparative frame- 
work but presents a similar argument to explain Mexico’s 
financial liberalization process. Auerbach highlights the role 
played by the beneficiaries in pushing through liberalization 
policies. In contrast to Thacker, however, who views the 
protrade business sector and technocratic political elite as 
willing coalition partners, Auerbach sees a much more pow- 
erful independent influence of the private banking sector and 
a much more conflictual relationship between this sector and 
the state financial policymakers. In fact, Auerbach argues 
that the unchallenged power of private banking interests 
explains the short-sightedness of Mexico’s financial reforms, 
which benefited private banking conglomerates but under- 
mined the long-term objectives of the government’s eco- 
nomic program. Curiously, for Auerbach “the lack of a 
dynamic domestic exporting elite” (p. 48), the same sector 
Thacker argues helped push through the country’s free trade 
agenda, enabled the private financiers to control the scope 
and direction of financial liberalization. For all their similar- 
ities in approach and subject matter, the two authors attach 
significantly different levels of policy influence to the various 
domestic interests with a stake in Mexico’s neoliberal re- 
forms. 

Auerbach not only offers an account of the nature of 
financial liberalization in Mexico based on the “strategic 
interaction between market actors and state actors” but also 
links the outcome of liberalization to the political and 
economic leverage held by those actors during the course of 
their interactions (p. 22). Consequently, the relative failure of 
Mexico’s reforms in the financial sector owes much to the 
dominance of a select group of bankers (grupos) in the 
policymaking process. Theoretically, a more worthy state 
counterpart, or a more competitive banking sector, would 
have produced a more successful set of financial reforms in 
Mexico. Auerbach provides evidence from South Korea and 
Hong Kong to support her contention that a more equitable 
balance of policymaking roles between the state and the 
private sector, and a clearly defined leadership role for the 
former, will produce more positive financial reforms. The 
1997 Asian financial crisis, however, seems to have forced 
some reworking of this argument and a shift in focus toward 
the level of competition within the financial sector, whether 
under state or private control, as critical in explaining finan- 
cial liberalization processes and outcomes. 

Both works make important advances in understanding the 
causes and consequences of the economic reform process. 
Both, however, also suffer in varying degrees from similar 
problems that invariably trace back to the difficult empirical 
task of identifying and substantiating the interaction effects 
of different sets of actors on policies and their outcomes 
without raising a host of equally plausible alternative expla- 
nations as well. This difficulty is particularly evident in the 
two studies’ conflicting views on the policymaking influence 
of the export-oriented business sector in Mexico. It plays a 
central role fer Thacker in explaining the scope and success- 
ful implementation of trade liberalization in Mexico. For 
Auerbach, however, the political weakness of the export 
sector allowed private banking interests to control the pro- 
cess for their own short-term gain at the expense of domestic 
exporters. Thacker’s account is more convincing, but the 
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different roles attributed to Mexico’s export sector in the 
reform process suggest the inherently murky nature of deter- 

mining specific levels of influence among different sets of 
omen and state actors. 

Notwithstanding this empirical difficulty, these studies con- 
tribute much to our understanding of the politics of economic 
reform. First, both offer important correctives regarding the 
role played by the beneficiaries of economic reforms. The 
neoliberal winners did not sit idly by and allow opponents to 
monopolize the debate. Rather, it appears they played an 
active role in bringing about these policies. Second, the 
theoretical advances made by both studies provide valuable 
stepping stones for future research. They tell us where to go 
next by focusing on the interactions among the various 
international, state, and private sector interests. 

Finally, both works identify what may prove to be an 
enduring state-society ruling coalition in those developing 
countries that are caught in the seemingly unending cycle of 
neoliberal reform. Many of the losers in the reform process 
are still struggling to give political voice to a viable alternative 
to the neoliberal model, but policymakers appear to have 
joined with the neoliberal winners to form coalitions that 
allow for the continued pursuit of market-based policy re- 
forms. Vicente Fox’s election as president of Mexico may be 
due in part to the strength of these coalitions. Whether such 
alliances will be sufficient to sustain the reform process 
through the end of its third decade, or through another round 
of protracted economic crises, remains in doubt. That these 
neoliberal policymaking coalitions represent one of the most 
fruitful avenues of political economy research for the fore- 
seeable future is beyond doubt. 


The Political Mobilization of the European Left, 1860-1980: 
The Class Cleavage. By Stefano Bartolini. Cambridge: 
Cambridge University Press, 2000. 637p. $69.95. 

The End of Class Politics? Class Voting in Comparative 
Context. Edited by Geoffrey Evans. Oxford: Oxford Uni- 
versity Press, 1999. 364p. $70.00 cloth, $35.00 paper. 

Diane Sainsbury, University of Stockholm 

The decline of leftist parties and the end of class politics are 

perennial topics of debate, and these two books no doubt will 

generate renewed controversy and discussion. Although class 
and politics are central to both, their projects are vastly 
different but in some ways complementary. 

The End of Class Politics? calls into question the wide- 
spread assumption that there is a uniform weakening in the 
relationship between class and partisanship across advanced 
industrial societies. In challenging this view, a major part of 
the book analyzes patterns of voting behavior over time. It 
features a broad comparison of class voting in twenty coun- 
tries during the postwar period and in-depth studies of the 
United States, Britain, France, Germany, Norway, Sweden, 
the Czech Republic, and Russia. 

As distinct from much of the earlier research that primarily 
used the Alford index of class voting, the Evans volume 
employs a variety of refined techniques and analytical tools. 
The Alford and Thomsen indexes, constructed as dichoto- 
mies of class (manual-nonmanual) and party (Left-Right) 
(chaps. 2, 5, 7, and 10), are complemented by a more detailed 
and theoretically informed class schema developed by John 
Goldthorpe and Robert Erikson (most chapters), indices of 
relative class voting based on odds ratios (most chapters), 
logmultiplicative or associational models (chaps. 5 and 11), 
typological models (chap. 3), and more precise party classi- 
fications (chaps. 3, 6, 7, 9, and 10). Accordingly, the book 
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represents “a state-of-the-art account of the changing nature 
of class voting” (p. 2). 

Overall, The End of Class Politics? points to cross-national 
variations rather than a secular trend in the decline in class 
voting. The reported trends range from a decline in Norway, 
to a realignment in the United States, to a strengthening in 
the Czech Republic and Russia. The assumption of uniform 
decline is further undermined by an analysis of class voting in 
Britain, France, and the United States from the mid-1930s 
through the early 1990s (chap. 5); it documents striking 
dissimilarities in levels and trends, and the conclusion is that 
in some cases national differences increased over the period. 
The book also contains useful discussions on hypotheses 
about changes in class voting (esp. pp. 98-104). Among the 
methodological insights worth mentioning is that several 
chapters apply different measures and techniques but tend to 
arrive at quite similar results. 

A shortcoming of the Evans book is its emphasis on 
description and the paucity of explanation. The editor states 
two good reasons for careful description. First, “it is neces- 
sary to know what it is we are trying to explain before we try 
to explain it”; second, “description is itself a test of theory” 
(p. 323). Nevertheless, as studies of voting behavior, the 
contributions share a typical flaw of this sort of analysis—an 
exclusive focus on voters and little attention to context. 
Equally problematic, the analyses include relatively few in- 
dependent variables, and in some instances only class posi- 
tion. As a result, they are ill equipped to offer explanations. 
Furthermore, the type of explanation tested is generally 
limited to social structural factors; political explanations— 
such as party ideological orientations, strategies, patterns of 
party competition, and differing institutional arrangements— 
are seldom part of the research design. In many cases the 
authors can only speculate about the determinants of the 
patterns they describe and propose an agenda for future 
research, As pointed out by one ef the commentators (chap. 
12), this is a model study of class voting; but because of its 
failure to examine political factors, the book is not about class 
politics (p. 312). In short, analytical techniques and empirical 
findings often gain the upper hand at the expense of explan- 
atory issues other than the authors’ cogent criticisms of the 
theories they question. 

By contrast, Stefano Bartolini’s project is definitely about 
class politics. It entails systematic cross-national comparisons 
of the European Left in thirteen countries, focusing on 
historical processes that span more than a century. Among 
the theoretical contributions is Bartolini’s scheme of analysis 
and his discussion of mobilization and the concept of cleav- 
age, which is the dependent variable. Building on Stein 
Rokkan (e.g., “Cleavage Structures and Party Systems,” in 
Peter Flora, ed., State Formation, Nation-Building and Mass 
Politics in Europe: The Theory of Stem Rokkan, 1999), Barto- 
lini devises a framework to analyze the development of the 

Left m terms of sociocultural conditions, organi- 
zational mobilization, and the structure of opportumities 
embedded in the processes of state formation and nation- 
building. He characterizes his model as one that emphasizes 
the constraints and opportunities associated with national 
settings rather than the choices or strategies of elites. Upon 
closer examination, however, environmental constraints seem 
to converge with choices made by elites in his analysis. 

Rokkan is also a source of inspiration for Bartolini’s 
ambition to develop causal sequential generalizations. To 
achieve this, Bartolini combines a diachronic or historical 
approach with cross-national analyses of variance at particu- 
lar times. More specifically, his chapter analyses operate at 
four levels: a general analytical level to establish patterns for 
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the entire European Left (all countries and all elections), 
country-specific patterns, developments over time for the 
entire European Left, and developments over time for indi- 
vidual countries. 

In clarifying the nature of the dependent variable, Barto- 
lini defines a cleavage “as a conflict line translated into 
politics, the translation is what historically constitutes the 
linkage between social condition, consciousness, and action” 
(p. 19). Accordingly, he conceptualizes class cleavage as a 
variable comprised of three dimensions: sociostructural, nor- 
mative, and organizational. In other words, compared to the 
Evans volume, which mainly deals with variations in class 
voting, Bartolini examines variations in electoral mobilization 
(the size of the Left vote), social homogeneity, organizational 
density and unity, and ideological orientation. 

The analytical framework structures Bartolini’s book. The 
initial chapter describes the variations.in the European Left, 
and subsequent chapters assess the effect of socioeconomic 
transformations, cultural diversity, enfranchisement, organi- 
zational mobilization in party and corporatist channels, state 
Tesponses and political integration, opportunities for social 
and political alliances, and the conditions for the rise of 
strong communist movements. Parenthetically, the examina- 
tion of political coalitions forms one of the richest compar- 
ative analyses in the book (chap. 8). In the concluding 
chapter Bartolini presents three explanatory models, one for 
each dimension of the class cleavage: electoral mobilization, 
ideological orientation, and organizational division. 

In contrast to several chapters in the Evans volume, a 
recurrent theme in Bartolmi’s book is the importance of 
political variables in relation to socioeconomic conditions. 
According to Bartolini, the strength of the Left does not stem 
from industrialization and urbanization but from the pattern 
of franchise extension and closure of the political system. The 
emergence of strong leftist movements was related to the late 
timing of suffrage, sudden enlargements of the electorate, 
and organizational consolidation that was led by parties 
rather than unions. Likewise, Bartolini argues that noneco- 
nomic power relationships and resources are crucial in 
determining the response of the dominant economic elite to 
the emergent working class, that is, whether the response 
involved the state apparatus, institutional obstacles and priv- 
ileges, or political domination. In fact, he reverses the line of 
argument, making a repressive response dependent upon the 
availability of state resources. In a similar vein, he attaches 
major weight to the logic of political coalition rather than to 
the logic, of socioeconomic alliances, as found in Barrington 
Moore’s Social Origins of Dictatorship and Democracy (1966). 
Briefly, Bartolini underlines the significance of political rep- 
resentation of groups established during the phase of party 
system structuring as a precondition for coalitions. 

An enterprise of such scale is not without hazards. An 
obvious difficulty is mastering the experiences of the Left in 
thirteen countries. Country experts may question Bartolini’s 
interpretations and point to inadequacies and even factual 
errors. Judging from the Scandinavian cases, there are short- 
comings of this sort; although annoying, they seldom seri- 
ously affect the author’s major conclusions. A second prob- 
lem is the collection of comparable data for the thirteen 
countries. In some instances, Bartolini is forced to rely on 
extrapolations and the definitions used in national sources, 
which contrasts with the meticulous approach of several 
authors.in the Evans volume. On a theoretical plane, there is 
an interesting tension between a reasoning about multiple 
causation and the search for single explanatory models. A 
limitation of Bartolini’s book is its focus on the “golden age” 
of the Left, although the author suggests that his approach 
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could be applied to the current transformation of the class 
cleavage by identifying a new macroconstellation and its 
constraints (p. 572). This suggestion, however, exposes a 
weakness of the approach. It concentrates on the translation 
of social oppositions into cleavages through mobilization, 
which makes emergent social forces the object of analysis. In 
this framework, contemporary leftist parties are largely con- 
signed to a passive or reactive role. 

In summation, the current position of the Left in govern- 
ment, usually as the party of the prime minister, m many 
Western European countries adds to the timeliness of the 
two books. They, contrary to many studies that are preoccu- 
pied with the Left’s decline, seek to understand the resilience 
of the Left as a political force and the continued importance 
of cla as an ingredient of politics. 


The U.S. Presidency in Crisis: A Comparative Perspective. 
By Colin Campbell New York: Oxford University Press, 
1998. 266p. $18.95 paper. 

Martin J. Smith, Unmersity of Sheffield 

Colin Campbell has written an interesting and wide-rangmg 

book that illustrates what he sees as a crisis of the U.S. 

presidency through the lens of comparative government. His 

focus is on executive leadership in the United States, United 

Kingdom, Australia, and Canada. His key argument is that 

the “U.S. presidency is indeed broken” (p. 1), and similar 

problems of executive leadership can be found in the various 
cases. For Campbell, despite the differences of these systems, 


particularly between presidential and parliamentary models 
and between federal and unitary government, there has been 


a convergence of executive styles. Not least is the rise of 
outsiders as leaders, their growing monocratic styles, and 
their increasing tendency to appeal directly tothe public. 
Consequently, they face similar problems or, in the author’s 
dramatic language, crises 

Campbell identifies the key problem as the difficulty that 
executives now have in managing the environment and 
therefore in achieving desired policy goals. The book begins 
with the problems facing the U.S. presidency. During the 
Reagan/Bush years, the manifestation of the crisis was a 
consequence of two factors. The first was gridlock, as both 
Reagan and Bush were thwarted to varying extents by 
Congress. The second was the New Right ideology that 
resulted in the Reaganites’ misunderstanding of the nature of 
governing, which meant they failed to operate properly the 
two “gearboxes” of the presidency. The first gearbox is the 
relationship between the executive and the administration. 
According to Campbell, the administration believed that this 
relationship had been resolved (in Reagan’s first term, when 
it worked well) through the public choice prescriptions of the 
New Right. This mistaken view led the administration down 
the “garden path” (p. 47). 

The second gearbox, the relationship between the presi- 
dent and his team, was apparently handled relatively well by 
Reagan, despite chronic problems of executive overload, in 
the first term but broke down in the second. Under Bush it 
operated poorly throughout. The problem of the second 
gearbox became even greater under Clinton, who failed to 
pay any real attention to the issue of his relationship with his 
advisers and was in i “his own most trusted staff 
member.” The problem Campbell identifies appears to be 
essentially a problem of management. Because U.S. presi- 
dents are unable or unwilling to pay sufficient attention to 
their “ ? they are not able to overcome the prob- 
lems of gridlock. í 
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The book then reviews, almost through political biogra- 
phies of leaders, similar crises in the other systems. In Britain 
the fundamental problem for Thatcher was the increasingly 
monocratic nature of her leadership, and for Major the 
problem seemed to be the inability to fill the vacuum she left. 
He was unable to put any institutional bones back into 
Thatcher’s personalism. In Canada the crucial issue is the 
extent to which leadership became personalized, particularly 
under Trudeau, although this seemed to work relatively well 
for him but less so for Mulroney. Nevertheless, in Campbell’s 
view, Chretien has been a successful leader. Likewise in 
Australia, personal leadership appeared relatively successful 
for Hawke but was less so for Keating. This suggests that the 
problems of governing may be a result of more than the 
personalization of government. 

To some extent the book’s comparative approach is under- 
mined by the lack of a clear analytical framework. The key 
concepts seem to be rather loose, such as “gearboxes” and 
“let’s deal” executives, with considerable focus on individual 
personalities. Little attention is paid to the external context 
and how this may have affected each of the countries. 
Perhaps more problematic are the many assumptions and 
assertions that are not adequately demonstrated: the crisis of 
the executive, the dominance of the New Right, the overload 
of the executive, the way in which party members demand 
changes of ossified government (p. 50). 

Campbell’s curious conclusion is that our systems have 
encountered considerable slippage in the capacity of leaders 
to deliver and that the fault for this rests with the leaders 
themselves. The author seems to yearn for the age of the 
heroic leader who can overcome the problems of government 
and deliver on goals. Yet, at the same time Campbell seems 
to see heroic leadership as the essence of the problem; the 
tendency of leaders such as Reagan, Clinton, Hawke, and 
Thatcher to believe too much in themselves and so not to 
manage the system leads to the failure to deliver. This 
argument essentially reduces the so-called crisis of the exec- 
utive to personal failings rather than to the structures of 
government or wider external factors, such as problems of 
governance in a changing global arena. 

The crisis of executives, if there is one, is surely a conse- 
quence of the difficulties of governing i i complex 
societies, polities, and economies. More important, Campbell 
fails to establish convincingly that there is a crisis. Was 
Clinton’ less successful than Carter? Was Thatcher less 
successful-than Heath? If- anything, Reagan, Clinton, and 
Blair have been successful leaders, certainly in an economic 
sense and to a degree politically. Thatcher and Hawke were 
successful for a long period, before ignominious ends, and 
they had a real and lasting effect on their countries and 
political systems. Even Major, who clearly had problems, 
maintained the Conservatives in power until 1997, just about 
beld together a deeply riven party, and saw Britain out of 
recession. On some measures he was a success. 

It is clear that executive government has changed and is 
faced by new problems, especially the fragmentation of the 
processes of policymaking and policy delivery, but it is not 
proven that there is a general crisis of the executive or that 
political leaders are worse now than twenty or fifty years ago. 
The process of governing may be more fraught, and it is 
certainly more visible, but ironically,.in a world of competing 
power centers, central executives are increasingly the only 
organization with the oversight and resources to have a 
generalized effect within polities. 
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Democracy and Institutions: The Life Work of Arend Lijp- 
hart. Edited by Markus M. L. Crepaz, Thomas A. Koelble, 
and David Wilsford. Ann Arbor: University of Michigan 
Press, 2000. 262p. $49.50. 


Russell J. Dalton, Un:versity of California, Irvine 


Democracy and Institutions is a festchrift to celebrate Arend 
Liphart’s contributions to political science. Often a 
festechrift is like reading about a party to honor a relative; 
many of the partygoers have familiar faces, and everyone says 
nice things about the honoree. In this case, the accolades are 
Clearly warranted. 

The book’s ten chapters address two broad themes that 
characterize Lijphart’s major research interests. The first is 
the distinction between majoritarian and consociational/con- 
sensus democratic systems. This distinction has been very 
important in expanding our understanding of the democratic 
process, both in terms of theoretical concepts and quantita- 
tive methods. The chapter by Rein Taagepera, for example, 
summarizes some of the key points of Lijphart’s two books on 
this subject, and it highlights some of the ways in which this 
concept has been fruitful in expanding democratic theory and 
the empirical measurement of democratic institutions. G. 
Bingham Powell’s chapter shows the power of this distinction 
in understanding the responsiveness of electoral systems. 
Higher thresholds of electoral access (i.e., majoritarian sys- 
tems) are associated with less accurate and complete repre- 
sentation of voters’ party preferences in the distribution of 
legislative seats as well as in the policymaking process. It is 
fair to say that consociational democracy has been one of the 
most important concepts in the field of comparative politics, 
both for its value in describing alternative democratic forms 
and because these forms have a breadth of implications for 
specific aspects of the democratic process. This point is 
illustrated by many of the chapters in this book. 

I have always been troubled by the static nature of 
Lijphart’s empirical evidence, although the logic is explicitly 
causal and dynamic. Jack Nagel’s analysis of the 1993 change 
in the New Zealand electoral system presents an interesting 
opportunity to study the dynamic aspects of Lijphart’s argu- 
ment. New Zealand again proves Duverger’s Law: The shift 
from a majoritarian toward a proportional representation 
(PR) electoral system increased the number of parties and 
the number of issue dimensions represented within electoral 
alignments. What is missing, however, is more discussion of 
the theoretical implications of these changes. Does this mean 
that unrepresented issues before 1993 were a source of 
dissatisfaction that stimulated electoral reform, or did elec- 
toral reform based on other criteria create an opportunity for 
new issue cleavages to form? Most important, does PR 
represent a different way of democratic representation rather 
than a better way, which is the underlying question of the 
book. 


Nagel also offers an explanation of the New Zealand 
electoral reform that partially contradicts the Lijphart model. 
Nagel suggests that a majority-based government was the 
desired outcome for the public, but the “majoritarian” insti- 
tutions were producing governments based on less than a 
popular majority. Thus, New Zealanders saw the PR system 
as a way to create governments that represented a majority of 
the electorate, not to create a supermajority or consensual 
model. Both of Nagel’s key findings warrant more discussion 
of the theoretical and practical implications for Lijphart’s 
work on democratic institutions. 

“The book’s second theme is the relation between social 
diversity and the choice of democratic institutions. Lijphart 
maintains that consociational systems are better able to 
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represent political interests in a diverse society and thus are 
more likely to yield stable democracies. Milton Esman re- 
views the theoretical debate on these issues as an introduc- 
tion to this literature. Thomas Koelble examines this idea in 
the context of the new democratic system in South Africa, a 
case in which Lijphart’s writings have clearly influenced the 
constitutional debate. 

Andrew Reynolds builds on this literature and makes a 
significant contribution to it. He describes a larger framework 
of multiple forms of majoritarian and consensual democra- 
cies that goes beyond Lijphart’s original dichotomy; this is a 
valuable addition to the debate. He then evaluates the South 
African democratic system within this framework and in 
comparison to other African democracies. The result is a 
much richer understanding of the South African experience, 
potentially highlighting a case in which the relationship 
between social diversity and constitutional choices does not 
follow the predicted path. The Reynolds chapter is a jewel. 

Another jewel is Lijphart’s response to the authors in a ` 
final chapter. Lijphart uses this opportunity to make compar- 
isons across his various works on democratic institutions and 
to draw out the distinctions between consociational and 
consensus democracy. He also welcomes the expansion of 
alternative democratic models, such as Reynolds suggests. 
The chapter closes with his normative argument that nonma- 
joritarian democracy in its various forms and shapes is clearly 
preferable to the majoritarian or pluralitarian alternative. 
Even if the skeptical reader stops short of this conclusion 
(perhaps contrasting the experience of majoritarian’ Britain 
and the hyperpolarized Germany of the 1930s), one is 
convinced that the consociational/majoritarian distinction 
has proven its value as a tool to understand important 
variations in the democratic process. 

In the end, an eager graduate student might ask what is so 
creative about the consociational/majoritarian framework. 
Like many other discoveries, it seems simple and self- 
evident—but only after it has been pointed out to us. It takes 
an insightful individual to recognize a concept that others 
have not clearly seen, to give it meaning, and in this case to 
measure it empirically and show its relevance to a breadth of 
political phenomena. This has been Lijphart’s enduring and 
significant contribution to the comparative study of politics. 


Heroic Defeats: The Politics of Job Loss. By Miriam A. 
Golden. Cambridge: Cambridge University Press, 1997. 
194p. $59.95 cloth, $21.95 paper. 


Bruce Western, Princeton University 


Unemployment is the scourge of organized labor. Joblessness 
erodes trade union membership and undermines bargaining 
power. Unions that deflect unemployment become enduring 
and influential economic actors. Unemployment puts unions 
on the defensive because the creative destruction of jobs in 
inefficient firms and industries seems essential for economic 
dynamism. Consequently, strikes against plant closings or 
layoffs appear quixotic, bloody-minded, or both. Understand- 
ing how unions act when faced with workforce reductions 
thus raises basic questions about the place of organized labor 
in capitalist economies. 

Miriam Golden tackles this issue in an important book that 
makes a valuable contribution to our understanding of con- 
temporary labor politics. Whereas recent research on the 
relationship between unions and unemployment tends to be 
quantitative and highly aggregated, Golden’s work is quali- 
tative and disaggregated. Her empirical attention is fired on 
interactions between particular firms and unions under con- 
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ditions of mass workforce reductions. The empirical core of 
the book details a comparative case analysis of four main 
sites: large layoffs in the auto industry in Britain and Italy and 
pit closures in the British and Japanese coal industry. In all 
four cases unions faced large redundancies, but only the 
British auto industry avoided significant industrial strife. 

These events provide the empirical vanation to support a 
simple but compelling argument: Unions will rationally strike 
to protest layoffs when union activists are at risk of dismissal. 
The pattern is contextual, however; local employers and 
unions will more likely accept the costs of industrial action if 
they are supported by third parties, usually national repre- 
sentatives. 

Contrasting outcomes in the Italian and British auto 
industries illustrate unions’ interest in protecting their sup- 
porters, Fiat’s Turin plant was blockaded for weeks in 1980 
following the announcement of 23,000 layoffs among the 
150,000 employees in the area. The union charged that shop 
stewards and militants were targeted, and the layoffs thus 
threatened not just workers but the basic principle of union 
representation. A year earlier, British Leyland announced 
personne! reductions of similar size at its’ plant in Birming- 
ham. In this case, retention was based on seniority, and shop 
stewards were insulated from the’ downsizing. By 1982, 
employment at British Leyland had been halved, largely in a 
climate of industrial peace. 

Perhaps the most famous recent example of labor resis- 
tance to industrial restructuring: can be found in the British 
coal industry in the 1980s. Triggered by a strike in Yorkshire, 
the central office of the National Union of Mineworkers 
acted swiftly to precipitate nationwide industrial action. The 
strike was exceptional. British unions had accommodated pit 
closures and declining employment throughout most of the 
twentieth century. According to media reports at the time, 
and a great deal of subsequent analysis, responsibility for the 
strike lay with the union’s leader, Arthur Scargill. He person- 
ified the Old Left in the British Labour Party and, for many 
analysts, had drawn his union into an orgy of self-destruction. 
This case, more than most others, provides a stern test of 
Golden’s rationalistic account of union behavior. In convinc- 
ing fashion, her analysis shows that the British coal strike was 
a last-ditch effort to save an industry relegated to extinction 
by Margaret Thatchers Tory government. A temporary 
reprieve may have been available under Labour, so the union 
embarked (unsuccessfully) on a path to bring down the 
government. 

Golden compares strikes in British coal to those in Japan 
two decades earlier, which also featured a strong role for the 
national union in a local industrial dispute. The national 
employers’ federation also joined the dispute, opposing an 
emerging form of industrial unionism. With support by 
national actors on both sides, the costs of strike action in 
Japan were significantly subsidized, both for the firm and the 
union. These conditions facilitated industrial disputation, ‘and 
the ensuing strike lasted‘eight months and involved about 
350,000 union supporters. 

The empirical analysis is concluded by a briefer consider- 
ation of industrial relations in the U.S. private sector. Here, 
layoffs are common, and seniority is the dominant rule for 
retention. Significant historical evidence indicates that unions 
support seniority provisions chiefly to protect the jobs of 
union activists. As the theory would predict, workforce 
reductions in the United States are largely removed from the 
sphere of industrial conflict. 

Although Golden offers a crisp and plausible account, 
labor sympathizers may baulk at several of the main impli- 
cations. Unions are treated as organizationally distinct from 
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the workforces that they ostensibly represent. Is Golden 
saying that unions are self-serving, acting more in the inter- 
ests of their own cadre than in the interests of all workers? 
This would be a misreading in my view. The underlying 
thesis, which is explicit in the book’s title and the final pages 
of the conclusion, is that unemployment is as much a political 
event as it is economic. It is highly contested and enormously 
consequential for the balance of power on the shop floor. Just 
as employers call upon the I natural pressures of 
market forces to target militants for job loss, so must labor 
leaders protect the principle of union representation through 
strike action. Golden does not claim, as some might charge, 
that unions are feathering their nests by shielding their 
factory lieutenants. This may sometimes be the case, but 
Golden’s unions are largely in the business of trying to retain 
some political control over workplaces made vulnerable by 
industrial restructuring. 

The prominent place of rational choice theory may also 
raise flags for some readers. In my view, the main substantive 
insights stand well enough on their own without the impri- 
matur of tree figures or utility functions. The trappings of 
rational choice hint broadly, without strong justification, at 
the generality of explanation. (Would this analysis apply 
equally in northern Europe, where unions enjoy greater 
acceptance among employers?) The logic of empirical anal- 
ysis sometimes emphasizes confirmation rather than falsifi- 
cation. These are small reservations, however. On the other 
side of the ledger, a deliberately utilitarian theory provides a 
valuable antidote to the more spectacular accounts of wild- 
eyed union radicals who eagerly commit their members to 
profoundly lost causes. 

Comparative politics now seems quite good at producing 
broad-gauge analyses of the political conditions for economic 
success, but Golden underlines the micropolitics of the shop 
floor and the fundamentally political character of unemploy- 
ment. This is a significant and critical correction that deserves 
a broad audience among comparativists and all students of 
political economy. 


New Labour and Thatcherism: Political Change in Britain. 
By Richard Heffernan. London and New York: Macmillan 
and St. Martin’s, 2000. 234p. $65.00. 


Radhika Desai, University of Victoria 


The most interesting and insightful understanding of Thatch- 
erism was to come from leftist analyses, in particular in the 
Gramscian terms of “hegemony.” Even though there were 
overstatements of particular positions, and corrections en- 
sued—such as the question of whether Thatcherite “hegemo- 
ny” meant that the ideology was accepted by the people 
wholeheartedly—the analyses had the merit of understanding 
that the change wrought by Thatcherism in Britain involved 
the whole historical conjuncture—its social and economic as 
well as its cultural and political aspects. As such, it left far 
behind much of traditional political science literature, with its 
narrow focus on “the political,” or elections and voting and 
parties and party systems. Elections are, after all, only the 
periodic and often none too faithful register of other devel- 
opments that can never be explained within their own narrow 
confines. Traditional political science did make certain im- 
portant contributions to the understanding of phenomena, 
but they had to be placed within the wider developments that 
remained outside its scope. For example, Ivor Crewe’s fa- 
mous surveys (“Has the Electorate become Thatcherite?” in 
Robert Skidelsky, ed., Thatcherism, 1988) showed the shal- 
lowness of the penetration of Thatcherism as an ideology. 
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David Marsh and Rod Rhodes, in their detailed study 
(Implementing Thatcherite Policies. . . , 1992), revealed the far 
less than ideological nature of policymaking and implemen- 
tation under Thatcher. 

Richard Heffernan has set himself the task of putting 
political science to work in explaining the transformation of 
the Labour Party into New Labour, which occurred in 
tandem with the successes‘of Thatcheriam in elections and 
policy. He seeks to explain “how and why political change 
comes about” (p. ix). He very ably and deftly deploys theories 
of change in parties and party systems, and he argues that 
Labour was passive in the face of the change wrought: by 
Thatcherism, that it was remade by it rather than remade 
itself. -This also means that all the talk of New Labour 
representing a “third way” is no more than a fig leaf to cover 
the abject capitulation to Thatcherism that New Labour 
really is. Today, both New Labour and Conservatives have 
“aligned themselves on the same ideological territory char- 
acterized by a shared acceptance of the reworked boundaries 
of the public-private sector and a belief in the primacy of the 


private over the public in the management of the economy” 


Briefly, the argument runs as follows. Thatcherism’s true 
success lies neither in being hegemonic in any clear sense nor 
in winning the battle of ideas, but in changing the political 
agenda of Britain around a new set of beliefs and a new 
consensus about what is possible and practicable for a 
government to do. It has gained acceptance, mostly at the 
level of policy, not as an ideology but as a guide. This is what 
it set out to do, and despite electoral and political constraints, 
it achieved this, overthrowing the postwar set of assumptions 
and political consensus. Through successive defeats, in par- 
ticular those of 1987 and 1992, Labour has been indelibly 
stamped by this Thatcherism. Up to 1987 the transformation 
of Labour under Kinnock remained within a broadly social 
democratic framework, but after that date the the idea was 
accepted that it was neither feasible nor desirable for the 
government to interfere with the market, which was the 
precursor to Blair’s full-blown endorsement of “enterprise 
culture.” Despite the Smith interregnum, it was a smooth 
process, with no radical breaks. Blair’s hectically activist 
stewardship of the policy and party reform process did not 
mark any “dramatic shift... 1994 was not a ‘year zero’ for 
the Labour party because Blair’s reforms were built on the 
firm foundations established before he came to the leader- 
ship” (p. 83). Choosing to accept rather than reshape the 
Thatcherite changes in the pattern of political preferences, 
Labour’s modernization was a process and politics of “catch- 
ing up” with Thatcherism. New Labour has completed the 
shift from one consensus, a social democratic one, to the 
neoliberal one in British politics. 

Heffernan also shows very well the distance between New 
Labour and the social democratic Right of Old Labour, to 
which it is often likened: Crosland or even Jenkins were 
social democrats; Blair is a Thatcherite. And Blair is not 
Gaitskell: “In his speech to the 1959 Labour Conference, 
Hugh Gaitskell declared his party in favour of ‘social justice, 
an equitable distribution of wealth and income. . . a “classless 
society” . . . [where] the public interest must come before the 
private interest.’ In contrast, Tony Blair’s speech to the 1996 
Conference indicated the refashioning of language since 
1975: ‘A society of opportunity. A society of responsibili- 
ty... the decent Society, a.new social order for the Age of 
Achievement’ ” (p. 136). Clearly, there was a choice before 
Labour in the face of Thatcherism: Given the lack of 
acceptance among the electorate of neoliberal prescriptions, 
New Labour could have sought to persuade the voter of 
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different goals and values. It choose capitulation to Thatch- 
erism instead. 

All the theoretical weapons of political science that Hef- 
fernan brings to bear on the story of New Labour beg, as is 
the wont of this type of explanation, further questions. Chief 
of these, surely, is that if the political system, the consensus, 
in Britain changed because Labour capitulated to Thatcher- 
ism, why did it do so? This, unfortunately, is not a question 
that even arises within the terms of this theoretical armoury. 
But without the answer, Heffernan has not explained the 
change in Britain; he has only shown how it would be 
described by a very able and scrupulous “political scientist.” 

The answer involves a discussion of a number of issues that 
lie outside the theoretical terms of reference, although not 
Heffernan himself, if his excellent account, with Mike Mar- 
qusee, of Labour politics under Kinnock is any indication 
(Defeat from the Jaws of Victory: Inside Kinnock’s Labour 
Party, 1992). These include the fate of the social democratic 
Keynesian project in the context of a changing political 
economy and the worldwide ascendancy of neoliberalism 
upon which the Thatcherite success was built; the defeat of 
the Left within Labour, the mediatization of politics in 
general, the changing character of Labour’s membership and 
that of the unions; and accompanying in the social 
character of the leadership. It is at the confluence of such 
historical factors, and probably a few others, that one can 
fully explain New Labour’s capitulation to Thatcherism, the 
loosening of the party’s historic connection with the labor 
movement and abandonment of its vocation, albeit reformist, 
to humanize capitalism. 


Revolutionary Movements in Latin America: F1 Salvador’s 
FMLN and Peru’s Shining Path. By Cynthia McClintock. 
Washington, DC: United States Institute of Peace Press, 
1998. 219p. $59.95. 


Jeff Paige, University of Michigan 
Cynthia McClintock has written an important and disturbing 
book on the future of revolution in, Latin America, and it 
stands as a sobering antidote to the confident declarations of 
the end of revolution in Jorge Castafieda’s widely influential 
Utopia Unarmed (1993). No sooner had Castafieda pro- 
claimed the end of revolution then what appeared to be the 
first post-Cold War revolution broke out in his own country, 
Mexico, in the remote southern province of Chiapas. Mc- 
Clintock argues that it was actually not the first but the 
second such revolt. The first was the Sendero Luminoso 
(Shining Path) movement in Peru, which effectively ended 
with the arrest of its maximum leader Abimael Guzmán 
(a.k.a. President Gonzalo) on September 12, 1992, a little 
more than a year after the collapse of the Soviet Union. 
McClintock’s views demand respect. She is the leading North 
American authority on Sendero and has extensive field 
experience in the southern Peruvian highlands, where the 
movement began. Her volume on the Shining Path has long 
been awaited by students of revolution in Latin America and 
elsewhere seeking an explanation of this bizarre and violent 
movement. Just such an explanation is provided here. 
Revolutionary Movements in Latin America is actually a 
comparative study of only two such movements: Sendero 
Luminoso and El Salvador’s Farabundo Marti National 
Liberation (FMLN). The cases are well chosen for compar- 
ative analysis, however; despite the fact that neither took 
power, they were the two most effective revolutionary move- 
ments in post-World War II Latin America, much more 
powerful as politico-military organizations than those of 
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cither Fidel Castro or the Nicaraguan Sandinista Front of 
National Liberation. Both came much closer to seizing power 
than is widely realized: the FMLN in 1980-81 and again in 
1989; Sendero in 1991-92 before Guzmén’s capture. In other 
respects, however, McClintock argues they could hardly be 
more dissimilar: The FMLN was the last revolutionary 
movement of the Cold War era; Sendero the first post-Cold 
War movement. The accounts of the organization, ideology, 
tactics, and leadership of the two movements are among the 
book’s strongest points—clearly written and compelling po- 
litical analysis. Even in the case of the FMLN, where 
McClintock is drawing on secondary sources in an area in 
which she is not a specialist, the account is persuasive and 
even original. ' 

The FMLN was, in her account, a coalition of defectors 
from the Communist Party of El Salvador, dissident Christian 
Democrats, and New Left student activists; it had an inter- 
nally collegial if not democratic structure and a pronounced 
social democratic and pragmatic orientation in at least some 
of its factions. This is a departure from most scholarly views 
of the FMLN, which is usually contrasted with the greater 
socialist heterodoxy and pragmatism of the Sandinistas, but it 
is consistent with the intellectual and ideological trajectories 
of many of the leaders as well as the ease with which the 
organization has made the transition to a democratic politics 
after the peace agreement was signed in El Salvador in 1992. 
McClintock confirms the widespread impression of Sendero 
as authoritarian, fanatical, apocalyptically violent, millenar- 
ian, and personalistic, although she never loses sight of the 
underlying inequities that drove even this bizarre offshoot of 
the Latin American revolutionary tree. In her view Sendero 
shares more with Islamic fundamentalism than it does with 
socialist revolutionaries like the FMLN, despite Guzmán’s 
slavish devotion to the Moaism of the Cultural Revolution.. 

The causes of the two movements, too, differ dramatically 
in McClintock’s account. The rise of the FMLN was a result 
of the violent closure of the Salvadoran political system, 
particularly after the fraudulent 1972 election, whereas Sen- 
dero was a direct consequence of the collapse of the Peruvian 
economy as the result of a bungled response to the debt crisis 
and pressure for neoliberal restructuring. Sendero in her view 
must be seen as the dark side of the debt crisis and 
neoliberalism; Peru began the descent into poverty, inequal- 
ity, and economic chaos earlier and more dramatically than 
other beneficiaries of neoliberal restructuring, and it fell 
farther and deeper into the abyss. When Alberto Fujimori 
imposed his famous “Fujishock” in August-1991 to restruc- 
ture the Peruvian economy to satisfy foreign lenders, 20% to 
50% of Limefios were reduced to eating in soup kitchens, 
school teacher salaries had fallen to one-tenth of their 
precrisis levels, public health programs were virtually aban- 
doned, agricultural credit had dried up in the poorest areas of 
the Peruvian altiplano, and a substantial portion of the 
college graduates from provincial universities were scram- 
bling for jobs as cab drivers. 

It was this economic crisis that provided the inspiration 
and recruits for Sendero. One of the book’s strongest fea- 
tures is the original interview data from 33 Sendero militants, 
which make clear both the social composition and motives of 
the rebels. The movement was overwhelmingly composed of 
students, teachers, and professionals with a smaller but 
significant peasant fraction. This is in distinct contrast to the 
FMLN, which was overwhelmingly peasant in its mass base 
despite its origins in student politics at the University of El 
Salvador. It is clear that the Peruvian economic crisis simul- 
taneously devastated rural communities in the southern 
highlands of Peru and cut short career aspirations of the 


recently educated middle class, particularly those in remote 
provinces like Ayacucho, where the revolt began. It is clear 
that the perception of economic devastation to the poor (the 
rebels refer to it as “economic genocide”) was the primary 
ideological force in the movement, but the teachers’ and 
students’ truncated economic prospects help explain the 
extremism of the movement. 

Although McClintock tries to distinguish the “political” 
origins of the FMLN from the “economic” origins of Send- 
ero, at times the contrast is overdrawn. It is clear that many 
student rebels were attracted to the FMLN because of the 
closure of legitimate political channels (and the University of 
El Salvador itself) by the military, but they were drawn to 
activism in the first place by the profound economic and 
social inequalities of El Salvador. This is clearly reflected in 
FMLN ideology both in the 1980s, when it was a revolution- 
ary guerrilla organization, and today, when it has became a 
militant political party fighting neoliberal restructuring. Sim- 
ilarly, the peasant base of the movement was forged in the 
peasant associations of the 1970s, organized under Catholic 
or Christian Democratic auspices, that were subsequently 
radicalized through repression. These mobilizations were in 
turn the result of increasing landlessness and inequality 
caused by agro-export expansion from the 1950s to the 1970s. 
It is entirely possible that absent military repression these 
movements might have moved in a social democratic or other 
reformist direction. But without the economic and social 
demands, there would have been no movement in the first 
place. i 

Nevertheless, McClintock’s principal point is well taken. 
Democracy, particularly an ineffective and illegitimate de- 
mocracy like that in Peru in the 1980s and 1990s, is no 
insurance against revolution. "From 1980 to 1992, when 
Sendero came close to taking power, Peru was a reasonably 
democratic society in which leftist political parties were able 
not only to contest but also to win national political power. 
The combination of ineffective democracy and immiseration ' 
in Latin America has increased markedly over the last two 
decades in response to neoliberal restructuring. If Mc- 
Ciintock’s general thesis is correct, it is surprising that there 
have not been more Sendero-like revolutionary movements 
in Latin America. McClintock argues that Sendero closely 
resembles fundamentalist religious movements that have 
been on the rise in response to globalization and economic 
restructuring, although less so in Latin America than else- 
where. 

Will Sendero become the prototype for future revolution- 
ary movements? Although McClintock correctly points out 
that the Zapatistas and Sendero are responding to the same 
kind of economic restructuring, the movements do not 
resemble each other in other respects. The Zapatista leader 
has taken great pains to emphasize that not he but the people 
are the leader (it is sub-commandante Marcos instead of 
Presidente Gonzalo), they have formed alliances with other 
progressive groups rather than murder them, have made 
extremely sparing use of violence, emphasize the reformist 
and democratic nature of their goals, and stress indigenous 
rights rather than messianic Moaism. Sendero may in retro- 
spect come to be regarded as a strange degenerate form of 
communism that emerged in the context of the failure of not 
only the communist and socialist Left but also the new world 
order of neoliberalism. 

Perhaps Ecuador, which McClintock uses as a contrasting 
ponrevolutionary case to Peru, is more representative of 
revolutionary movements of the future. Since her research 
was completed, Ecuador has become a test case of her 
hypothesis. It has been reasonably democratic and also has 
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undergone increasing immiseration in response to neoliberal 
restructuring. The result was a massive and possibly revolu- 
tionary social movement that seized power briefly last year. 
But the movement is based on mass mobilizations not 
guerrilla war, is democratic in its organization and goals, 
emphasizes indigenous rights as well as coalitions with stu- 
dents and workers, and eschews millenarian dreams. Never- 
theless, Ecuador’s putative democracy has not been able to 
respond effectively or contain this movement thus far, and a 
wider revolutionary upsurge cannot be ruled out. Although 
Maoist fundamentalism may be gone, this in no way suggests 
that revolution is off the historical agenda. That is the crucial 
conclusion of this important and thought-provoking book. 


, Equality, and Growth: The Politics of Economic 
Need in Postwar Japan. By Deborah J. Milly. Cambridge, 
MA: Harvard University Press, 1999. 386p. $49.50. 


James W. White, University of North Carolina 


This book is an analysis of the emergence of a welfare 
state—or, as the author puts it, of the state’s response to 
poverty—in postwar Japan. The central question is: How did 
a conflict-ridden Japanese political system develop and 
choose among accommodative alternatives for dealing with 
the problem of poverty? The author’s answer is: by redefining 
poverty in a way acceptable to the major political actors. This 
redefinition cast poverty as a dependent variable of the 
economic structure and performance and cast income equal- 
ity—to be achieved through economic growth and labor 
market participation rather than individual transfer pay- 
ments—as the solution. The redefinition accommodated bu- 
reaucrats (those interested in remedying the poverty prob- 
lem, those interested in growth, and those interested in 
keeping the budget down), ruling Liberal Democratic Party 
(LDP) politicians interested in economic growth and winning 
reelection, unions interested in the employment of their 
members, and leftists interested in the legacy of egalitarian- 
ism bequeathed by the American Occupation. But neither 
redefinition nor accommodation was easy. 

Milly’s study is both thematic and chronological. It focuses 
on four case studies: unemployment insurance, relief works, 
labor market management, and the minimum wage. Milly 
follows an overview of statist, paternalistic prewar poverty 
policy with an examination of the Occupation period, during 
which the Americans brought in unprecedented and disso- 
nant democratic norms with poverty implications and exer- 
cised ultimate authority over a series of Japanese govern- 
ments that had to try to meld old and new in the face of a 
huge postsurrender poverty problem. The result, after the 
Occupation, was a decade of intense political conflict be- 
tween the Right, which wanted to roll back democratic 
reforms, and the Left, which wanted to extend them. Fortu- 
nately for the system, however, the two major forces in play 
were not monolithic; there were divisions within the bureau- 
cracy and the opposition, and the conservative ruling party 
was pulled one way by its traditionalistic norms and another 
by its desire to win elections. But the gradual formation of 
this pluralistic, potential foundation for compromise was 

by combat over political principles. 

A turning point came in 1960 with the emergence—and 
broad acceptance—of a politically directed, bureaucratically 
managed economic growth agenda within which popular 
welfare was posited as dependent on growth and full employ- 
ment. This agenda has, in broad outline, channeled poverty 
policy ever since, and it led to three decades of spectacular 
economic growth, a respectable if not lavish welfare system, 
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and one of the most egalitarian income distribution struc- 
tures in the world. The question is: How did it come about? 

Milly focuses on four independent variables: institutions, 
expert knowledge, politics, and norms. The institutions of 
primary interest are bureaucratic agencies, with the differing 
goals noted above; while the politicians were fighting it out 
over conflicting norms, these bureaucrats built during the 
1950s a new definition of poverty, based on expert knowl- 
edge, which they hoped would encompass multiple norms, 
defuse the fighting, and enhance popular welfare but not 
contradict the goal—also dear to the hearts of many bureau- 
crats—of economic development. But this redefinition 
lacked appeal until 1960, a truly traumatic year in postwar 
Japanese history, when labor unrest and political protest rent 
the country’s social fabric. Actors on both Left and Right 
pulled back from the brink, and a new prime minister 
consciously adopted a growth-first policy designed to calm 
and depoliticize the situation. Suddenly the experts’ redefi- 
nition of poverty made sense, and within the space of a few 
years it won the day. 

The redefinition of poverty satisfied a number of impera- 
tives. It was consistent with egalitarian Occupation norms, 
but in focusing on labor market participation it avoided 
expanding “the dole” and promoting sloth, which were major 
fears of the conservatives. It satisfied Ministry of Welfare 
bureaucrats, who felt a duty toward the poor, and also 
Ministry of Labor personnel, who focused on the working 
population. It appealed to all those worried about social 
unrest and to many in the working class itself, which bought 
fully into the work ethic, and thus weakened the political base 
of the Left. The new development agenda promised to help 
the LDP achieve three goals simultaneously: aid the poor, 
build the economy, and defeat their opponents. It had a little 
for almost everyone. 

This story, as far as it goes, is persuasive and lucid, and 
Milly makes a major contribution by bringing Japan into the 
welfare capitalism literature. But it seems to me that three 
significant questions are begged: why 1960, why equality, and 
why this much welfare? In response to the first, the author's 
stress on continuity and redefinition, while accurate, under- 
emphasizes (despite p. 175) the question of why accommo- 
dation itself became a ruling party goal as well as the 
importance of the political turmoil of 1960 in enhancing elite 
receptivity to a new definition of poverty. A new definition 
was meaningless untl the ruling party was in a mood to 
accommodate, and both accommodation in general and this 
definition—consistent with Prime Minister Ikeda’s party- 
salvation-through-growth policy—in particular became infi- 
nitely more appealing in 1960. 

Why was the solution to poverty found in income equality? 
Certainly the Occupation introduced the norm, and it con- 
tributed both to development and to LDP electoral victory; it 
also served to deflect the unwanted growth of a sense of an 
individual right to welfare. Perhaps that is an adequate 
explanation. But almost nowhere in all of Japan’s exquisitely 
hierarchical social history does one see the idea of equality, 
and in the 1950s and 1960s almost no one in Japan really 
believed that all people were created equal. So why, given the 
availability of alternative solutions to poverty (see p. 278), 
was this one chosen? 

Finally, why did the Japanese build a welfare state of this 
magnitude rather than a more parsimonious or generous 
one? This is perhaps a picky question, but if my suspected 
answer—that the LDP added just enough welfare at any 
point to keep one vote ahead at the polls—is correct, it 
supports my other suspicion that political tactics played a 
greater role in this story, and bureaucratic expertise a lesser, 
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than Milly lets on. But these are questions of emphasis, and 
of what the reviewer wants rather than what the author has 
done, and they do not detract from the importance and 
relevance of this study to all those interested in the evolution 
of welfare policy in advanced capitalist democracies. 


The Political Right in Postanthoritarian Brazil: Elites, Insti- 
tutions, and Democratization. By Timothy J. Power. Uni- 
versity Park: P ia State University Pres, 2000. 
284p. $55.00 cloth, $19.95 paper. ` 


Jorge Zaverucha, Universidade Federal de Pernambuco 


This book fills a gap in the literature on Latin American 
politics, Latin Americans and Latin Americanists have ded- 
icated few efforts to understanding the role of the political 
Right during the process of transition from authoritarianism 
to democracy. This is a pity because the Right, especially in 
Brazil, tends to retard attempts to make institutions account- 
able, thus damaging the prospect of democratic consolida- 
tion. Power’s book is a must for those who want to under- 
stand the role played by the Right in Brazil during the 1980s 
and 1990s. 

The author’s main research question is: Are conservative 
transitions to democracy conducive to eventual democratic 
stability? In order to answer this question, Power tries to 
analyze how those political elites that served the military 
regime (1964-85) perform during democracy. According to 
him, if the continuities from the outgoing authoritarian 
regime remain extraordinarily high, then there is a comserva- 
tive transition. In this kind of transition, elites retain impres- 
sive influence in the postauthoritarian regime. The mainte- 
nance of authoritarian elite interests, which varies in intensity 
from country to country, is the price paid by democrats for 
securing the acquiescence of those elites in exchange for the 
control of government. ; 

For Power, the fate of authoritarian elites is the most 
important variable to be studied. Therefore, he focuses on 
two institutional channels that link state and society in 
democracy: parties and parliament. Power writes up his 
comprehensive research in a smooth way. Nonetheless, al- 
though the front-page photo of his book shows former 
president Sarney with his three military ministers, Power did 
not pay enough attention to the nature of the relations 
between the party or groups that won out in the transitional 
process and the military. 

Tt is true that the 1988 Constitution, which was written by 
Congress, removed some institutional vestiges of dictator- 
ship. Yet, civilians kept untouched some constitutional 
clauses that deal with the armed forces and police. The 
military have regularly used the right-wing parties m Con- 
gress to block. many of the institutional mechanisms that 
could promote changes. The inverse is also true. This polit- 
ical accord contributes to the maintenance of authoritarian 
elites in a position of effective political power, thus retarding 
the “consolidation” of democracy. 

Power delineates three pillars of Brazil’s conservative 
transition: (1) continuity in political personnel, (2) weakness 
of political institutions, and (3) the intensification of anti- 
institutional political practices. He proves that “high individ- 
ualist, anti-institutional orientations are not evenly distrib- 
uted across the political class but are in fact clustered 
i j on the right” (p. 230). Representative 
institutions do not promote a wide sense of inclusion, and a 
Congress mired in scandals erodes democratic legitimacy. 
Brazil is a political democracy, but it is far from having a 
consolidated democracy. 


Given this dim picture of Brazilian democracy, Power 
challenges the argument made by Daniel Levine (“Paradigm 
Lost: Dependency to Democracy,” World Politics 40 [April 
1988]: 377-94) that conservative transition processes may 
lead to more stable democracies. Power argues that Levine 
thinks in terms of: (1) transitions based on explicit pacts 
(Venezuela and Colombia), but Brazil is a case of a conser- 
vative transition without a pact; and (2) elite-led transitions 
with representation. Brazil, however, is an elite-led transition 
with precarious channels of representation. Given these 
differences, the Brazil case casts doubts on the claim that 
conservative transitions are more durable. According to 
Power, if Brazilian democracy has not met a “sudden death,” 
it can perish from a “slow death.” A third possibility needs to 
be considered: It may move toward neither consolidation nor 
a sudden or slow death. This equilibrium point has already 
lasted for 16 years. For how long should we expect to define 
Brazil’s democracy as “(un)stable”? 

The concept of stability is unclear. Venezuela was consid- 
ered a stable democracy until 1992. Does the fact that 
electoral democracy survived two coup attempts mean that 
democracy is very stable indeed? Or are coup attempts 
indicators of instability? And what about Colombia? Succes- 
sive governments have been elected in the last thirty years, 
but is it possible to define Colombia as a stable democracy? 
“Stability,” like “consolidation,” becomes an ex post facto 
concept. 

Negotiated transition.is a concept much more fruitful than 
transition through pacts. It is true that Brazil did not have a 
formal pact, but it is widely known that former president- 
elect Tancredo Neves had informal talks with the military. 
The creation of a Constituent Assembly formed by the 
existing Congress, rather than a specially elected National 
Constituent Assembly, is one indicator of this. Lack of space 
does not allow me to mention other examples. How does one 
label the Uruguayan Club Naval talks? They were more 
explicit than the Brazilian ones but less explicit than the 
Spanish Moncloa pact, in the sense that no official records 
were kept of the conversations. For example, although the 
Colorados and the military believed that there would be no 
prosecutions, other political forces argued that no written 
clause existed, and they pressed for a plebiscite on the issue. 

The book mentions two conservative tiated transi- 
tions: Brazil since 1985 and Spain from 1975 to 1979. In 
Spain, contrary to Brazil, concessions to the former regime 
were temporarily made. It is true that José Sarney and Adolfo 
Suárez both came from the authoritarian camp, but it should 
be emphasized how both actors discounted time differently 
for their own purposes. This may help explain the short 
period of Spanish conservativeness. 

Suárez saw himself as the head of an interim government. 
He worked for writing a democratic constitution and called 
elections quickly. He won and gained legitimacy to lead the 
process of democratization. Sarney, on the contrary, pressed 
Congress for a full five-year presidential mandate, although 
he replaced Tancredo Neves, who died moments before the 
inauguration. Without strong support from his own party, 
Sarney, with the Right’s support.and military intimidation, 
got what he wanted from Congress. Hence, he postponed 
pressures for democratic advances and had time to regroup 
conservative elites, which were divided at the end of the 
military regime. 

Although I share Power’s view of the Brazilian Right, I 
think his statement that this conservative transition is not 
durable is premature. Perhaps he did-not pay enough atten- 
tion to the possibility that the state could remain coercively 
authoritarian despite the procedures of democracy. It is true 
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that Brazilian democracy is fragile, and that an authoritarian 
retrogression may occur, but it is also true that it can last in 
its conservative form for many years to come, as long as the 
Right does not lose its control over the national political 
agenda. As Power reminds us, the 1964 coup happened when 
the Right lost this kind of control. 


Labor and the State in Egypt, 1952-1994: Workers, Unions, 
and Economic Restructuring. By Marsha Pripstein Posus- 
ney. New York: Columbia University Press, 1997. 327p. 
$49.50 cloth, $17.50 paper. 

Ibrahim M. Oweiss, Georgetown University 

Marsha Pripstem Posusney presents a well-documented but 

narrowly focused study based on her research in the United 

States and Egypt as a graduate student. Despite a typical 

left-of-center point of view, with all its articulation, terminol- 

ogy, and vocabulary, the book is well researched, thoroughly 
documented, and tucidly written. 

In between a lengthy introduction and a brief conclusion, 
the book is divided into five chapters. The first three lay the 
groundwork for an explanation of Egypt’s labor union, whose 
economic reform is covered in chapter 4. The fifth chapter 
discusses. modalities of a new era in the relationship between 
labor and the state. Several case studies are presented in 
detail after the government reversed position and adopted a 
policy of privatization. The complicated interaction is lucidly 
covered between labor leadership and the Egyptian regime, 
the presidency and the ruling party, and some of the oppo- 
sition groups, such as the Al-Wafd Party. 

The author divides the period of study into three segments. 
The first extended from 1952 to approximately 1955 and was 
marked by confrontation between the revolutionary leader- 
ship and the labor movement. Strikes were outlawed, and 
labor was denied the right to establish a national trade union 
confederation. Following: the coup led by Gamal Abdul 
Nasser against King Farouk in 1952, the revolutionary lead- 
ership meant to consolidate its power over Egyptian society, 
including labor. 

The second period emerged gradually from the first and 
lasted until the socialist decree of July 1961 became effective. 
The size of Egypt’s public sector became extensively large, 
and the vast majority of labor came under the control of the 
government, which thought of labor as a base for its legiti- 
macy. In an attempt to create a strong linkage with labor, it 
was mandatory for any union officer to be a member of 
Egypt’s only party, the Arab Socialist Union. 

The third period started when Sadat succeeded Nasser and 
gradually consolidated power, achieved in 1971, and contin- 
ued to the end of the author’s research in the mid-1990s. In 
that period, the author discusses Egypt’s labor movement 
within the context of its political economy and its political 
reform under the aegis of the International Monetary Fund. 
The author, however, does not realize the distinctive roles of 
the International Monetary Fund and the World Bank in 
their involvement with Egypt. She views them as one coher- 
ent institution with a specific agenda and targets as far as 
Egypt’s economic liberalization is concerned. ' 

The author addresses the effect on Egypt’s labor move- 
ment during the third period of the Economic Reform and 
Structural Adjustment Program (ERSAP). The negative. re- 
action of labor and its opposition to economic liberalization 
under ERSAP are emphasized. The book analyzes in detail 
the effect of privatization on unemployment in the short run. 
Yet, there is no coverage of the long-run consequences, that 
is, job creation through economic growth. The reforms 
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pushed Egypt's growth rates to 5-6% in the second half of 
the 1990s. The author’s sympathy (p. 7) with labor opposition 
to reforms may color her evaluation. 

The book draws certain conclusions that cannot be sub- 
stantiated. The author states that the 1993 standby credit 
agreement “was Egypt's first positive review from the Fund 
after thirty years of relations” (p. 5), but in the 1970s there 
was another positive review, which led the Paris Club to 
reschedule Egypt’s debt. Furthermore, there is a contradic- 
tion between that statement and another on page 210: “Egypt 
had signed with the IMF in May 1987.” In several places the 
author describes Egypt’s monarchy, overthrown by the 1952 
revolution, as “British-backed.” Yet, under the monarchy, 
Egypt unilaterally canceled the 1936 Anglo-Egyptian Treaty, 
and consecutive monarchal governments were adamant 
about getting British troops out of Egypt. On February 4, 
1942, the British ambassador ordered a tank to destroy the 
fence of Abdeen Palace and gave King Farouk an ultimatum 
to change the cabinet. On July 23, 1952, British troops were 
stationed only miles away from Cairo and could have exe- 
cuted a rescue plan 1f King Farouk indeed had British 
backing. 

The book relies on many secondary and tertiary sources, 
such as the Middle East Economic Survey, although the 
author could have reviewed primary sources when she was in 
Egypt researching her dissertation. Nevertheless, this is a 
good study of Egypt’s labor movement since 1952. The 
complexities of its structure, institutions, and fluctuating 
relationship with Egypt’s political leadership are presented in 
detail and well analyzed. 

There is an extensive bibliography, and the book offers 
several case studies crucial for a deep understanding of 
Egypt’s labor movement. Although the authors sympathies 
are evident, she gives a detailed account of how the labor 
movement was affected by the political environment in Egypt, 
especially the substantial changes from socialism to a market 
economy. The book fills a gap in the literature on Egypt’s 
labor movement within the context of political change. 


Without a Map: Political Tactics and Economic Reform in 
Rossia. By Andrei Shleifer and Daniel Treisman. Cam- 
bridge: MIT Press, 2000. 223p. $25.95. 

Stephen White, University of Glasgow 

It is, of course, still too soon to say, but this has not stopped 

students of postcommunist transition from reaching some 

rather different conclusions. When reformers were in the 
saddle, the optimists held sway. Russian government policy 
was itself based upon their recommendations. Western gov- 
ermmments supported Boris Yeltsin, even when he was uncon- 
stitutionally dissolving the Russian legislature. Inflation shot 
up, but (we had always been told) you could not cross a 
precipice in two steps. A massive program of privatization 
wes launched in 1992 and pressed further in the years that 
followed. Yeltsin, the architect or at least the guarantor of 
change, won Russia’s first presidential election in 1991 and 
won again in 1996 on the basis of a new and postcommunist 
constitution that invested him with far-reaching powers. 
But the pessimists were already gaining ground. The 1993 
election was a nasty surprise: Enthronement of a newly 
confected ruling party was expected, not the success of 

Zhirinovsky’s Liberal Democrats. In 1995 it was the Commu- 

nist candidates who came first, with Yeltsin supporters im a 

humiliating third place. More than this, the recipe for 

economic transformation was beginning to look a little 

suspect. There was no swift recovery, as in Poland, but a 
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steady and historically unprecedented fall in GDP and living 
standards. Social indicators declined in parallel; most nota- 
bly, the population began to fall in 1992 and has been falling 
ever since. Industrial output was concentrated in even fewer 
factories than before; traditional industries held up better 
than electronics; and exports depended even more heavily 
than before on oil and gas. The collapse of the currency in 
August 1998, and of the Kirienko government, appeared to 
represent a final and devastating verdict on everything that 
had happened under the label of reform since the start of the 
decade. 

Shliefer and Treisman are optimists, and ones who through 
their direct involvement in Russian events are well placed to 
present a plausible account from this progovernment per- 
spective. But this is a second-edition optimism, more cau- 
tious: No longer are reforms simply bound to succeed, the 
question is whether they exceeded reasonable expectations 
and whether they were politically feasible at the time. In this 
reading, the reformers enjoyed some remarkable early suc- 
cesses, particularly the privatization of most of state industry 
in 1992-94. Then they outmaneuvered a coalition of specu- 
lative banks and subsidized enterprises to get inflation down 
to more reasonable levels in 1995. They managed to achieve 
these successes by winning over key opponents and margin- 
alizing others. But after Yeltsin’s reelection in 1996 the 
reformers encountered stiffer resistance among the industrial 
barons that privatization had created and among regional 
governors. So tax reforms slowed down, and public finances 
weakened. 


Perhaps the leading merit of the carefully crafted analysis 
in Without a Map is the emphasis placed upon political 
constraints of this kind and upon the stakeholders who 
together constituted the proreform and antireform coalitions. 
There were four in particular: industrial ministries, industrial 
directors, workers, and regional and local governments. To 
run an enterprise in 1990s Russia, it was necessary to bring 
together most of these stakeholders, and certainly the last 
three (the role of industrial ministries weakened as enter- 
prises gained the right to elect their own m ent). 
Somewhat confusingly, there were five stakeholders in the 
“system of federal tax collection and economic control that 
existed in Russia in the 1990s” (pp. 137-8): the federal 
government, regional governors and legislatures, local gov- 
ernments, enterprises, and state tax officials. Stil more 
confusingly, “four sets of actors dominated Russia’s politics 
in the 1990s” (p. 178): regional governments, the central 
political leadership, the central bank, and the powerful firms 
that dominated natural resource extraction. 

The value of Without a Map lies more in the detailed 
exploration of the role of these stakeholders than in the 
larger discussion of reform. It exaggerates the extent of the 
changes that have occurred. Private enterprises, certainly, 
accounted for 88% of the total by 1998, but they were 
responsible for no more than one-quarter of industrial output 
and one-third of the workforce (mixed forms of ownership 
were rather more important). Privatization was in any case 
largely a paper transaction, given that the second and most 
favored option in the legislation was an employee-manage- 
ment buyout. Vouchers were distributed, but they brought 
few benefits to ordinary people, and the general view (84% in 
a representative U.S. Information Agency survey) was that 
the whole exercise had mainly benefited the mafia and 
members of the CPSU nomenklatura. Agriculture was little 
affected, and by the late 1990s no more than 2% of output 
came from private commercial farms. 

Shleifer and Treisman were “unable to find a single study 
that does not show positive effects of privatization on restruc- 


turing in Russia” (p. 36). Joseph Blasi and his colleagues, in 
a study frequently cited in Without a Map, conclude that 
managers were just as keen as their Soviet predecessors to 
retain state subsidies, cheap credits, and protection from 
foreign competition (Kremlin Capitalism, 1997). More recent 
findings suggest that privatization “failed to bring any signif- 
icant change in the way Russian companies were managed” 
and “had very little (if any) effect on gross output and average 
output per employee” (Vladimir Tikhomirov, “The Second 
Collapse, of the Russian Economy,” Europe-Asia Studies 52 
[March 2000]: 222). There is certainly little evidence in 
macroeconomic performance of the dynamic effects that 
privatization was supposed to have had: National income 
plunged to half its previous level under the guidance of the 
reformers, investment fell even more sharply, and growing 
numbers of workers lost their employment. 

For a political scientist, one surprise of Without a Map is 
that so little attention is given to the electoral legitimation of 
the policies of the Yeltsin government. The parliament, for 
instance, was “hostile to market reforms from the start” (p. 
viii), but at least it had been popularly elected, whereas 
Gaidar owed his position to a presidential decree. Yeltsin 
himself was elected in summer 1991, but with a Communist 
rupning-mate, and there was no indication that he would 
shortly embark upon a fundamental change in the economic 
and social system. There was some indication in the spring 
1993 referendum that Russian voters broadly supported the 
new leadership and its policies, but this was after the event. 
Promarket reformers received little support in the elections 
of December 1993 and even less in December 1995. Is it 
surprising that there were “constraints” in implementing 
policies that were not approved by the Russian people at a 
general election and that clearly reduced many of them to 
destitution? 

Shleifer and Treisman are impatient with the idea that 
there may be special countries or that Russian cultural or 
historical circumstances may have made a difference to 
economic strategy. The only conclusion is that opposition to 
the reformers was misconceived and often politically moti- 
vated. But the objections to the Yeltsin-Gaidar strategy— 
from a group of critics that included Nobel laureates—were 
not necessarily in terms of the Russian soul but in terms of 
Russian objective conditions. As James Tobin and others 
argued at the time, there should have been more attention to 
competition than privatization and a more gradual approach 
toward the process of change rather than a continuation of 
the radical measures that had led to a “deep crisis” (Neza- 
vistmaia gazeta, July 1, 1996). Indeed, one wonders why the 
whole process is labeled reform at all; it would beg fewer 
questions to speak of it as the (attempted) construction of 
capitalism, even if the outcome was neither reform nor a 
functioning market economy. 


Democratixing Communist Militaries: The Cases of the 
Czech and Russian Armed Forces. By Marybeth Peterson 
Ulrich. Ann Arbor, University of Michigan Presa, 1999. 
292p. $57.50. 


Jonathan Adelman, University of Denver 


This relatively slim volume (187 pages of text) provides a 
useful and valuable guide to an often overlooked aspect of 
the post-Cold War international transitions: that of the vital 
transition of militaries from communism to democracy. Ul- 
rich makes a good case that militaries, with their control of 
the instruments of violence, also need to be studied, if only 
because they can block or hinder the democratization pro- 
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ceas. The literature on democratization and on civil-military 
relations pays little attention to the democratization of 
communist military. Ulrich shows that Samuel Huntington’s 
thesis (The Soldier and the State, 1957) needs to be modified 
in that military professionalism and civilian control differ 
significantly between communist and democratic militaries. 

Ulrich highlights well the deficiencies of traditional ap- 
proaches (such as those of Huntington) that assume the 
military can stand aside from and be unaffected by great 
societal changes. She shows the critical differences between 
civil-military relations in democratic and communist systems 
as well as the importance of democratic political control and 
democratizing postcommunist states. In a very good table 
(pp. 24-5) Ulrich shows the marked differences between 
military professionalism in a democratic and communist state 
in such areas as recruitment and attention, promotion and 
advancement, officership and leadership, education and 
training, norms of political influence, prestige and public 
relations, and compatibility of military and societal values. 
Thus, democratic political control and democratic military 
professionalism are critical to democratic militaries that can 
overcome the legacies of the past. 

Yet, there are difficulties with this volume. The first, and a 
major one from the viewpoint of comparative historical 
analysis, is whether the fundamental comparison of Russian 
and Czech cases makes sense. Both countries are on the 
periphery of Western Europe and underwent democratic 
overthrow of the communist regime, but Russia has been a 
great and vast power, even a superpower, whereas the Czech 
Republic (itself only a recent creation) has been for the most 
part either a smallish part of a larger empire (Hapsburg) or 
a relatively dependent smailer state in Central Europe. 
Russia made history, whereas the Czechs had history made 
on them. Culturally, the Czechs were an integral part of 

culture, and the Russians were peripheral outsid- 
ers. Ideologically, the Russians embraced communism in the 
October Revolution, but the revolution in Czechoslovakia, 
although much more popular than elsewhere, still in 1948 
relied on Russian power. On the eve of communism, Czecho- 
slovakia in 1948 was historically one of the more advanced 
industrial powers in Europe (renowned for its Skoda works), 
whereas Russia in 1913 was an industrial laggard with high 
rates of illiteracy and semiliteracy. 

Furthermore, there is an exaggerated emphasis on the 
power of exogenous factors—especially American power and 
influence—to transform the nature of militaries, communist 
or not. Ulrich argues that “the United States’ inability to 
overcome its own Cold War legacy as evidenced in the 
persistence of Cold War bureaucratic inertia accounts for 
much of the lack of success” of Russian and Czech military 
reform. She is even more explicit in laying the blame on 
American policy by adding: “The United States was unable to 
Telease adequate resources from its defense arsenal... to 
fund and staff sufficiently efforts to help post-Communist 
militaries make the ideological and organizational shifts 
necessary to consolidate democracy in the region” (pp. 
180-1). But, by her own later statements (p. 184), “the 
prevalence of democratic values and expectations as evi- 
denced in the oversight capabilities of the developing demo- 
cratic institutions, the media and the society at large deter- 
mined the extent of democratic political control of the armed 
forces.” 

The United States has some influence, but institutional 
change, unless directed by an occupying r (as in 
Germany and Japan, but not in the post-Cold War cases), 
must largely come from within a society and an institution, 
especially a total institution such as the Russian military. The 
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Soviet military was a powerful and successful institution 
proud of its great victories in World War II and its role in 
elevating the Soviet Union to global parity with the United 
States, space exploits, and superpower status. Democracy, as 
the author does mention at one point, is seen as destroying 
the power status, glory, and budgets of the military, whereas 
Marxism-Leninism brought it military glory, high budgets, 
and great status. Thus, even much greater American funding 
would have had little effect and perhaps even, coming from 
the triumphant enemy, a negative role in changing the 
Russian military. 

Ulrich argues at the beginning that “the military institu- 
tions of the former Soviet bloc must overcome patterns of 
interaction between civilian authorities and military leaders 
that contrasted sharply with the norms of interaction that 
their Western democratic counterparts experienced” (p. 1). 
The problem is whether there is only one possible model for 
modern countries, that of the United States, or whether 
countries with very different levels of economic and political 
development and different culturea, geography, and histories 
can evolve in a somewhat different direction. If Japan can 
develop models for democracy (dominant one-party system) 
and capitalism (strong government role and zaibatsu driven), 
why cannot Russia create its own models that incorporate 
some Western notions and others compatible with deep 
Russian traditions and history? 

Furthermore, has enough time elapsed to make a serious 
judgment about the two militaries? Although political scien- 
tists do not adhere to historians’ 30-year rule, Ulrich is 
making judgments based on less than seven years’ experience 
in postcommunist Russia (and even more a pre-Putin Russia) 
and merely ten years for the postcommunist Czech Republic 
(only five years in its post-Slovakia phase). At a similar point 
after the American Revolution, the United States (1788) still 
had not consolidated and was wrapping up the Articles of 
Confederation. Seven years after the French Revolution 
(1796) Napoleon had not yet appeared on the scene, and 
seven years after the English Revolution (1656) Oliver Crom- 
well was still lord protector and the restoration of Charles I 
was four years away. It may be too soon—and the material 
too fresh—to understand fully what from the viewpoint of a 
decade or two may be much clearer. 

We do walk away with a heightened appreciation of the 
difficulties of the transition regimes, of the resistant power of 
institutions to change even in the face of failure and disaster, 
and especially the parlous state of the Russian military. 
Corruption, low morale, terrible living conditions, public 
disapproval, and poor battlefield competence hinder the 
largely untransformed army at every turn. Overall, then, this 
is a valuable book in achieving its goals, but other issues need 
to be addressed. 


Confessions of an Interest Group: The Catholic Church and 
Political Parties in Europe. By Carolyn M. Warner. 
Princeton, NJ: Princeton University Press, 2000. 249p. 
$55.00 cloth, $17.95 paper. 


Stathis N. Kalyvas, University of Chicago 
The empirical puzzle at the heart of this book is the diverging 
postwar behavior of the Catholic Church in France and Italy 
(1944-58). Why did the French church, which needed far 
more political help to recover lost ground after World War 
I, link only superficially with a political party before aban- 
doning it altogether, whereas the Italian church, which exited 
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the war in much better shape, forged much stronger connec- 
tions with a party? Phrased more generally, why and how do 
churches pick their political allies? 

To address this question and the more general issue of the 
links between interest groups and parties, Carolyn Warner 
adopts a two-pronged strategy. First, she offers an “economic 
metaphor” (p. 5) that draws on the microeconomic theory of 
the firm: The Catholic Church is seen as an interest group 
whose actions can be “modeled” as if it were a firm in a 
market seeking a supplier of goods. The question is then 
restated in economic terms. Second, there is an extensive and 
nuanced historical narrative of the two cases, together with a 
short excursus in Germany, along the following lines. Al- 
though both churches supported Christian Democratic par- 
ties (the Christian Democratic Party [DC] in Italy and the 
Popular Republican Movement [MRP] in France), the 
French church did so much less vigorously than the Italian 
one, due to a complex combination of factors that included 
past legacies and divergent structures and preferences (such 
as traditions of anticlericalism, more or less centralized 
church structures, and leadership preferences). Eventually, 
the French church abandoned the MRP altogether when it 
failed to promote the church’s policy agenda. In short, “the 
key difference was that the Christian Democratic Party did 
not fail the Church on the Church’s core policy concerns; the 
Popular Republican Movement did” (p. 118). But why did 
the MRP do so? Warner argues that its ideological concerns 
did not match those of the church. The loss of church support 
led to the demise of the MRP. 

If Warmers account is correct, the empirical puzzle that 
motivates the book is not much of a puzzle. If the MRP 
defaulted on its side of the deal, why is it surprising that the 
church abandoned it? More generally, if the French church 
was multifariously plagued by its past (and one encounters 
variations of this argument in many historical accounts), what 
is surprising about its failure to establish a strong political 
bridgehead? In fact, the real puzzles turn out to be the 
demise of the MRP (as compared to the success of the DC) 
and its decision to forgo the support of the church, which led 
to its own demise, both of which Warner addresses. 

Both the MRP and the DC have been the objects of 
numerous historical studies, and the failure of the MRP in 
particular has motivated a number of them. Warner offers by 
far the most thorough and ambitious comparative treatment 
of these two parties during the crucial 1944-58 period. 
Nevertheless, the book has a number of problems. First, it is 
not entirely convincing that church support is the only 
variable that explains the MRP’s failure. Indeed, the diver- 
gent trajectory in the two countries is heavily overdeter- 
mined. For example, the reason the French church was not as 
anticommunist as the Italian one may be found im the 
variation of the threat posed by the Communist Party in each 
country; the war experiences of the two churches, their 
prewar politics, which in one case go as far back as the French 
Revolution; or how specific organizations and leaders inter- 
preted this past. It is not always easy (or even possible) to 
discriminate among all these variablea, given the research 
design. 

Second, some historical arguments, in particular those 
concerning the pre-1944 period, are debatable. They include, 
among others, church perceptions of the French Third Re- 
public, the effect of the Dreyfus affair, and the legacy of the 
prewar Catholic political mobilization. 

Third, the account of the withdrawal of church support 
from the MRP is problematic in the following way. Whereas 
the two Catholic churches and the DC are treated as rational 
actors, the MRP is (evolutionary) irrational: It defaults on its 


deal with the church, effectively committing suicide. The 
reason, Warner argues, is that the MRP was blinded by 
ideology, it was an “idealistic” party (p. 184) whose view of 
itself undermined its office-seeking interests (p. 168). This 
may well be possible, although one wonders why the MRP 
did not behave as idealistically in the first place (i.e., in 1946). 
Still, treating the MRP as an exceptional type of actor raises 
serious doubts about the overall argument, both substantively 
and analytically. Moreover, explaining the MRP’s demise by 
pointing to its self-defeating preferences verges on the tau- 
tological. 

Finally, the historical narrative is not fully consistent. It 
reads at times like an ideational story, in which ideology 
dominates, and at times like a structuralist-determinist ac- 
count, in which existing structures and past legacies deter- 
mine the present (e.g, pp. 140-1). In both instances, and 
contrary to what Warner asserts in the conclusion (p. 207), 
there ıs hardly room for any strategic interaction. 

On the positive side, Warner is to be commended for her 
exemplary archival in France and Italy. The material 
from primary sources is of a quality one rarely finds in 
political science. The high point of the book is chapter 7, 
which provides a concise yet extremely rich depiction of the 
mobilization techniques used by the church in the two 
countries that should be read by students of party politics. 
Contrary to much work in the rationalist tradition, Warner 
does not assume away the actors’ preferences but scrutinizes 
them carefully. She historicizes her interpretations by provid- 
ing a plausible account of how past and present contexts 
shape actors’ understandings of the world around them as 
well as their own self-understandings. These are not easy 
tasks, and Warner generally carries them out in a competent 


Yet, Warner's combination of a rationalist framework and 
a historical narrative ultimately falls short. The analytical 
payoff of the economic metaphor neither generates nonob- 
vious or counterintuitive predictions nor resonates in the 
historical narrative. Notwithstanding claims to the contrary, 
the economic metaphor is hardly used as a “tool” (p. 220), 
even less “a model” (p. 12). Do we really need economics to 
figure out that “the Churches chose the Christian Democratic 
parties because the parties’ programs and capacities most 
closely matched Church preferences” (p. 115)? The numer- 
ous qualifications of actor “rationality” due to various con- 
straints (prior choices, leaders’ ideology, institutions and 
structures) further rob the rationalist framework of whatever 
analytical and explanatory leverage it may have; in short, it 
feels like an artificial structure that has been superimposed 
on the historical narrative. 

Research that aims to stride the fine line between deduc- 
tive analysis and historical narrative faces a crucial dilemma: 
Either its analytical framework is functional (it is primarily 
deductive and yields nonobvious predictions), ın which case 
the narrative ought to be “harnessed” by it and made as 
transparent as possible (even at the cost of simplifications); 
or the narrative is better left without a confining explicit 
framework (either because the story is too rich and multifac- 
eted or because the analytical framework may not be geared 
toward the particular class of events described or attuned to 
the kind of evidence produced). By disregarding this di- 
lemma, Warner is pulled in two directions at once. She has 
produced a fine electoral history of political Catholicism in 
postwar Italy and France rather than a theoretical treatment 
of linkages between parties and interest groups. 
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Forging Democracy from Below: Insurgent Transitions in 
South Africa and Ei Salvador. By Elisabeth Wood. Cam- 
bridge: Cambridge University Press, 2000. 247p. $64.95 
cloth, $22.95 paper. 

Jenny Pearce, University of Bradford 

Scholarly communities have become increasingly segregated 

over the last decade or so. A key issue such as democratiza- 

tion is explored from a multiplicity of intellectual angles, 
academic lenses, and disciplinary epistemologies. There are 
the political scientists who have specialized in democratic 
transitions, with an emphasis on elite bargains and pacts, as 
well as sociologists and social movement theorists, who focus 
on Dew social actors and the redefinition of the boundaries of 
politics “from below.” There are political theorists, some of 
whom assume that democracy is a limited set of procedures 
and the institutions that enable them to work, and others for 
whom democratization means “more democracy” measured 
by wider and more active political participation. There are 
the scholars of revolutions and guerilla movements who tend 
to see El Salvador not as a case of successful democratiza- 
tion, as it is for Wood, but as an example of a failed 


insurgency. 

Comparative historical sociology seeks to identify and 
systematize specific paths to democracy through the use of 
two or more cases in which the critical variables can be 
isolated against the background of otherwise very divergent 
contexts and conditions, When the democratization process 
is linked to the end of a conflict or civil war, there are those 
who see it through the interdisciplinary lens of peace studies 
or through the policy lens of governmental or nongovern- 
mental peace programs. How can peace agreements restore 
democratic procedures of conflict management or build them 
where they have never existed? Indeed, can a peace agree- 
ment exist without some prior commitment to democracy? If 
so, how does that commitment come about when there have 
been no antecedents? 

It is the great merit of this comparative study of the paths 
to democracy in El Salvador and South Africa that Elizabeth 
Wood bridges all these approaches while retaining intellec- 
tual coherence. In addition, she does this through rigorous 
scholarship and fieldwork in two countries, and particularly in 
the case of El Salvador, which she knows best, she links 
detailed microlevel ethnographic work with discussion of 
large-scale macrolevel processes. The book is an undoubted 
academic success, which should encourage fruitful and all too 
rare engagement across the different intellectual communi- 
ties outlined above. Our understanding of the relationship 
among issues of peace, democracy, and historical change 
could greatly benefit from such engagement. 

The core argument derived from the two case studies is 
that democracy can be brought about from below by an 
alliance of militant workers, urban poor, and peasants who 
are prepared to confront historically repressive elites. The 
alliance thus represents a specific msurgent path to democ- 
racy. But the author’s contribution is not this relatively 
straightforward claim. Others have pointed to the role of 
subaltern classes in struggles for democracy. Wood identifies 
the conditions in which such militancy can be successful and 
explains why it was possible to bring the civil conflicts in her 
two cases to a negotiated solution, which has eluded many 
other civil conflicts in the world today. 

In the cases of El Salvador and South Africa, which have 
evident dissimilarities in terms of size, social divisions, and 
the pattern and level of development, there is one basic 
similarity. They share a history of oligarchic rule, in which 
economic elites have relied on extracconomic coercion, or 
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political rather than market forms of labor control. Indeed, 
this enabled them to generate much higher profits than would 
have been possible under liberal market forms. It also led 
them to oppose with particular virulence and brutality at- 
tempts to liberalize and democratize, as these would have 
challenged the labor-repreasive system on which they relied 
and which had created an economic interdependence be- 
tween them and the workers, peasants, and urban poor they 
exploited. Only when the latter were able to mobilize politi- 
cally and militarily in such a way that the challenge to this 
economic interdependence rendered the entire historic edi- 
fice of accumulation and production unsustainable were the 
economic elites able to force regime elites to negotiate with 
the insurgents. 

In other words, it was not that the model had become 
unprofitable; rather, mass rebellion by the exploited ruptured 
it and made it unworkable. It was not, as Theda Skocpol 
argues with respect to the classic revolutions and to support 
her structuralist emphasis, that the regime was already weak 
when the revolutionaries rose up. Wood places the revolu- 
tionaries in the center of the picture but locates their 
“success” (her criteria are discussed below and of course are 
very distinct from Skocpol’s) in the structural features of the 
prevailing social relations of production. 

In the course of the civil conflicts, other factors convinced 
recalcitrant elites that the old economic model had to be 
abandoned, such as international sanctions in South Africa. 
Tn El Salvador the oligarchy could not prevent a first phase of 
agrarian reform, which was part of a counterinsurgency 
strategy promoted by the U.S. government, and it repre- 
sented the first breach in their historic concentration of 
landownership and power. In various ways, the war provoked 
the collapse of traditional agro-exports, and remittances from 
refugees and economic migrants to the United States offered 
up new economic opportunities in fmance and commerce 
that those members of the elite who were less dependent on 
agro-exports could seize as an alternative. 

Wood refutes rather convincingly the liberal argument 
with respect to South Africa that the transition to democracy 
was due to the increasing economic irrationality of apartheid. 
The argument is less clear in the case of El Salvador, where 
the crisis of traditional agro-exports was apparent before the 
insurgency began, particularly with respect to cotton, as 
various authors have shown (e.g., Wim Peluppessy, ed. 
Perspectives on the Agro-Export Economy in Central America, 
1991). This was not necessarily understood at the time as a 
terminal decline, however, and the argument that it was 
insurgent actions that brought this home is credible. Never- 
theless, this is one area in which there is more room for 
debate. Is it the path to democracy or to capitalist modern- 
ization that is at stake in these two case studies? And does it 
matter? 

I do not have any problem with the notion that ultimately 
it was insurgents who forced recalcitrant elites to accept 
democracy by making the coercive labor model and politi- 
cally repressive state unsustainable. In addition, this argu- 
ment usefully challenges those who see only failure in the 
Salvadorean guerilla movement's abandonment of its initial 
socialist aspirations and acceptance that it could not over- 
throw the Salvadorean oligarchy (although the same people 
would see the African National Congress as a successful 
movement, given its victory over apartheid). Yet, I wonder 
whether the goal of liberal democracy can be judged in the 
same light today as it was, for instance, when Barrington 
Moore (Socia! Origins of Dictatorship and Democracy; 1966) 
wrote about the social origins of dictatorship and democracy, 
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taking the 1940s as his point of reference, a moment when 
winning state power was still a highly meaningful objective. 

Free elections and the right of previously excluded groups 
to participate politically were important, but the ability to 
turn this into effective political power and a domestically 
accountable economic project had been severely reduced by 
the time the first free elections were held in the two countries 
in 1994. Wood is fully aware, of course, that the context of 
neoliberal globalization favored a negotiated settlement in 
both cases. It reassured elites that by relinquishing the 
labor-repressive model of accumulation they were not by any 
means being forced to give up economic power. “Elites could 
assume with some confidence that international financial 
agencies and the global mobility of private capital would 
discipline any post-transition government that encroached 
too severely on holders of wealth, professionals, and other 
globally mobile economic elites” (p. 200). International aid 
and financial flows were forthcoming in both cases to assist 
elites in the economic transition to global competitiveness. 
This context limited the redistributive effects of the settle- 
ments on the poor, as Wood acknowledges in her epilogue. It 
also limited the relevance of electoral participation and 
democracy to their everyday lives. 

Winning contro] of the state (which the former Salvador- 
ean guerillas of the Farabundo Marti National Liberation 
party have not yet done but certainly could in the future) was 
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no longer a guarantor of propoor policies. Both El Salvader 
and South Africa are today renowned for their high level of 
social violence and crime, the ineffectiveness still of their 
judiciaries, and the persistence of political corruption. It is 
the capacity to gain access to the informal U.S. labor market 
that provides livelihoods for many Salvadorean families, not 
Salvadorean government policy or economic productivity 
within El Salvador. In this sense, Wood’s emphasis on the 
goal of democracy may prove to be misplaced as a measure of 
insurgent success and achievements. 

The main accomplishment of the insurgents may have been 
to force the path of capitalist modernization. The costs of 
economic restructuring were diminished when balanced 
against the costs of defending the old repressive political 
economy. This economic transition may prove in historical 
terms to be at least as significant as the political one. It may 
also give us important insights into the contemporary civil 
conflict in Colombia; Wood concludes that if a peace process 
there is to endure, it must be self-enforcing. Colombia also 
can be seen against the background of an economic elite 
confronting the costs and challenges of capitalist modernity. 
Will the insurgents tip the balance in favor of a peace process 
that would enable this to happen at the cost only of the 
political participation of guerilla forces, who lack an alterna- 
tive economic project to cause elites much anxiety at the 
ballot box? 
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Disarmed Democracies: Domestic Institutions and the Use of 
Force. By David P. Auerswald. Ann Arbor: University of 
Michigan Press, 2000. 184p. $44.50. 


James Lee Ray, Vanderbilt University 


This book focuses on the evaluation of six hypotheses regard- 
ing the effect of variation in democratic domestic institutions 
on foreign policy decision making and crisis management. (1) 
If a national election is immediately forthcoming, then 
executives will be reluctant to initiate international conflicts 
or make threats. (2) If an executive’s office tenure is subject 
to legislative confidence, then that executive will be reluctant 
to use force or engage in coercive diplomacy absent a strong 
poari ee ce cg 
egislative confidence will have domestic incentives to send 
strong signals of resolve internationally. (4) Executives who 
deal with legislatures that can overturn or hinder the use of 
force will be reluctant to use force or make coercive diplo- 
macy threats. (5) If the legislature attempts to overturn or 
curtail the executive’s coercive diplomacy threat, then that 
signals international opponents of a divided democracy, 
regardless of subsequent executive actions. (6) If the legisla- 
ture can overturn or hinder the executive’s use of force but is 
extremely divided, then the executive will be less reluctant to 
use force than he might otherwise be. 

Auerswald derives these hypotheses from a theoretical 
structure based on the assumptions that executives value 
personal survival over national survival, national survival over 
staying in r, and holding onto power over any single 
policy preference. Because he no cases in which 
personal or national survival are at stake, the most important 
assumption for this book is that executive political leaders 
want to stay in office. i 

Aue d acknowledges that his hypotheses might best be 


investigated with large N aggregate analyses, but he argues 
that there are not enough cases in which democracies 
threaten or use force against opponents who could not 
endanger national survival to such analyses. There- 
fore, he selects crucial cases in which a few democracies with 
different institutional structures deal with the same or similar 
crises. He focuses on domestically strong pure presidential 
systems, premier-presidential governments, majority ia- 
mentary governments, domestically weak pure presi ential 
and premier-presidential governments, and coalition parlia- 
mentary governments. 

To evaluate the effect of these different institutional ar- 
rangements and the validity of his hypotheses, Auerswald 
analyzes the policies and reactions of the United Kingdom, 
France, and the United States to the 1956 Suez Canal Crisis 
and the 1995 Bosnian War. He also examines a set of cases, 
or examples, of coercive diplomacy, such as the American 
attempt to compel Syria to withdraw from Lebanon in 1983, 
the attempt by the United States to compel Iraq to remove its 

from Kuwait in 1990 and 1991, the successful attempt 
by the British in 1964 to persuade the Indonesians to halt 
their raids on Malaya, the crisis leading to the 1982 Falklands 
War between the United Kingdom and Argentina, and 
French threats aimed at compelling Libya to withdraw its 
troops from Chad in 1978 and 1983. In the course of these 
case studies, Auerswald attempts to evaluate the relative 
effects of power disparities and realpolitik, and public opin- 
ion, as well as variations in domestic institutional arrange- 
ments on foreign policy decision making. 

Auerswald finds for the most part that his hypotheses are 
valid. Some of his conclusions are more convincing than 
others. At least one of his hypotheses— how legislative efforts 
to overturn an executive’s attempt at coercive diplomacy 
signals to international opponents that the “democracy is 
divided”—seems to verge on the tautological. The logic 
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behind another hypothesis—if the legislature can overturn or 
hinder an executive’s use of force but is extremely divided, 
then the executive will be less reluctant to use force than he 
might otherwise be—is not all that persuasive. What if the 
legialature is not divided but firmly united in support of the 
executive’s use of force? If united rt is the alternative to 
a divided legislature, then one paid ape that a divided 
legislature actually would make an executive more reluctant 
to use force. 

The hypotheses tend to be worded im such a cautious, 
guarded fashion that “invalidating” them is difficult. One 
might wish, for example, that they had focused on the actual 
probability of using force, rather than on whether the exec- 
utive is more or less reluctant to use force. (It is easy to 
suspect that for almost every international crisis in which 
force is contemplated, some official somewhere at some time 
expresses reluctance.) When the focus is on single cases, one 
cannot deal very well with such probabilities. Force either 
was or was not used in a given case. 

On occasion the author appears reluctant to see discon- 
firming evidence. One of the more straightforward hypothe- 
ses asserts that as elections approach, executives will be 
reluctant to initiate conflicts or make threats. In the discus- 
sion of the Falkland crisis from the British point of view, 
Auerswald fails to see evidence that tends to confirm a 
contrary hypothesis: As elections near, executives will be 
tempted to initiate conflicts in order to divert attention from 
domestic problems and/or to achieve some international 
success to improve electoral prospects. He does note that 
when the British government threatened Indonesia in order 
to compel it to cease attacks on Malaya, the upcoming 
elections seemed to provide a motivation for the government 
to “threaten massive reprisals and... quickly implement 
them” (p. 103). The author does not point out that this 
example runs counter to the hypothesis about the alleged 
pacifying effect of imminent elections. 

Nevertheless, the case studies are in general carefully and 
competently done, and they will serve to enhance the reader’s 
understanding of the dynamics and outcomes of all the 
decisions and events on which they focus. Occasionally, one 
gets the feeling that the author seems destined to find either 
that a case analysis supports one of the hypotheses or that the 
implicit “all else being equal” clause appended to each 
hypothesis is violated in a way that makes it clear why 
apparently disconfirming evidence is not really disconfirming. 
Still, the logic of the case selection and the author’s lucid 
explication of crisis processes and outcomes make this book 
overall a’ successful attempt to evaluate the effect of institu- 
tional variations among democratic governments on foreign 
policy decision making. 


Nuclear Strategy in the Twenty-First Century. By Stephen 
Cimbala. Westport, CT, and London: Praeger, 2000. 211p. 
$65.00. 

Power versus Prudence: Why Nations Forgo Nuclear Weap- 
ons. By T.V. Paul. Montreal and Kingston: McGill- 
Queen’s University Press, 2000. 227p. $60.00 cloth, $27.95 


paper. 
Nuclear Monopoly. By George Quester. New Brunswick, NJ, 
and London: Transaction, 2000. 234p. $44.95. 
Fred Chernoff, Colgate Unmwersity 
The arrival of a new administration in Washington typically 
sparks debate about possible redirection of a range of 
American policies, including strategic nuclear policy. These 
three books come at a time of reevaluation of American 
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strategy, which also has prompted American allies and others 
to reexamine entrenched assumptions about mutual assured 
destruction, the relationship of nuclear offense to defense, 
and the international power structure. Nuclear Strategy in the 
Twenty-First Century fits into a long tradition of general 
strategic reevaluations. The volumes by T.V. Paul and 
George Quester focus on nuclear restraint—the former on 
why some states have denuclearized and some have refrained 
from building nuclear arsenals, and the latter on why the 
United States did not forcefully prolong its nuclear monopoly 
when it appears it could have done so. 

Stephen Cimbala has produced many valuable works on 
nuclear strategy and international conflict. The present vol- 
ume explores key questions facing American strategists at the 
start of the new century, including the value of weapons of 
mass destruction in the “postnuclear age”; the ability of the 
United States to retain its unique status over time, given the 
challenges that may be expected to arise from China and 
Russia; and the future of nuclear proliferation. Cimbala 
draws on Western, Chinese, and Russian writers on strategy 
to interpret the evidence of the post-World War II era, with 
special attention to the Cuban missile crisis. 

Cimbala argues that the nuclear peace since 1945 has been 
much more a result of luck than of any predictable, system- 
atic effects or well-established theory of nuclear deterrence. 
Taking issue with the systemic theories of Waltz and others, 
Cimbala argues that nuclear competition has sprung up in 
places such as South Asia and the Middle Hast, where the 
rivals view one another with a visceral hostility that far 
outstrips the Soviet-American rivalry when it began. He takes 
up a version of the agent-structure question and asks whether 
it is wise to consider system structure as a distinct framework. 
He notes that systems theories have certain advantages, such 
as allowing simpler expression and fewer variables on which 
to focus. But systemic approaches are acceptable only if two 
criteria are satisfied: The theory must lead to insights that are 
“accurate” and “not otherwise attainable” (p. 74). Cimbala 
contends that neither is met. 

In the chapter on the Cuban missile crisis Cimbala argues 
that nuclear arsenals are much more dangerous than the 
peaceful resolution has led many authors to conclude. He 
insists that the crisis was not a moment of glory for either 
side; instead, it was a result of serious and mutual misman- 
agement. He judges the principals quite harshly, charging 
that the crisis resulted from “a series of political and military 
blunders committed by governments in possession of large 
arsenals and small brains” (p. 62). This represents a distinct 
rethinking of Cimbala’s earlier account (Military Persuasion: 
Deterrence and Provocation in Crisis and War, 1994), in which 
he praised the two leader’ abilities. 

Cimbala argued in Military Persuasion that, in light of 
intelligence that the United States had a substantial strategic 
forces advantage (17 to 1), President Kennedy’s decision not 
“to press Khrushchev to the limit . . . demonstrated an innate 
shrewdness on his part plus an intellectual comprehension of 
the problem of inadvertent escalation” (1994, 14). Cimbala 
adds that Khrushchev was “smart enough” to see the poten- 
tial dangers of recklessness. Khrushchev’s initial gamble of 
emplacing missiles in Cuba “miscarried . . . [but] Khrushchev 
played this weak hand very skillfully. In fact, from the 
standpoint of a process-oriented approach to decisionmak- 
ing, he played it brilliantly” (1994, 15). The account of 
Kennedy’s advisors differs in much the same way in the two 
books. ExComm’s structure and performance is lauded in the 
1994 book (p. 107), but in the current volume Cimbala speaks 
of “erratic personalities, uncertain decision-making pro- 
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cesses, preposterous military advice, and wishful thinking in 
favor of preferred scenarios” (p. 63). 

Although Cimbala makes his case well in his latest book, it 
would have been interesting to see him address the reasons 
for his reappraisal. The disparaging of Soviet and American 
leadership fits into Cimbala’s position on nuclear prolifera- 
tion, as he argues that in the Cuban crisis escalation to 
superpower war was avoided not because of nuclear deter- 
rence but because of conventional deterrence: The Soviet 
leadership saw clearly that American superiority in the 
Caribbean was overwhelming. 

Political realists deny that states’ external behavior results 
from their internal structure. Cimbala rejects this position 
but at the same time endorses a view of human nature 
accepted by realists (at least by those who take a position on 
human nature). Cimbala uses the latter view to argue that the 
greater the number of states with nuclear weapons, the 
greater is the danger of nuclear war. He holds that this is true 
for two reasons. First, the states that seek nuclear arms are 
those that are least trustworthy. Second, the requirements of 
information warfare are likely to yield a very dangerous and 
destabilizing combination of forces, since leaders may be 
tempted to use nuclear weapons earlier in a crisis in a world 
of high technology and information-intensive weaponry. 

Cimbala argues that the U.S. successes in the Gulf War 
and the Kosovo campaign have impressed upon other major 
powers, notably Russia and China, that they should work 
hard to close the gap in the area of information warfare. 
Cimbala cites Robert Gilpin’s notion of hegemonic cycles to 
argue that there will likely be challenges to U.S. dominance 
from these states. (He does not mention that proponents of 
Gilpin’s theory would expect military dominance like that of 
the United States to arise from “hegemonic war.”) Cimbala 
makes a strong case for the interesting and suggestive claim 
that the system is transforming in such a way that nuclear 
weapons are becoming tools of the poor and weak rather 
than the rich and powerful. 

Overall, Cimbala’s volume offers a clearly argued, often 
provocative, integrated set of conclusions to produce a 
coherent account of where nuclear weapons strategy stands, 
how it got there, and where it appears to be heading. 

In Power versus Prudence, T.V. Paul raises one of the more 
interesting and neglected questions in security studies, that of 
‘nuclear restraint. The first two chapters lay out the theoret- 
ical explanations of armament, nonarmament, and disarma- 
ment, and the next six apply theory to a series of case studies. 
The comparative method leads Paul to group states that are 
similar in economic orientation, alliance relationships, and 
role in the international system (Germany and Japan; Can- 
ada and Australia; South Africa, Ukraine, and South Korea; 
and so on); Paul shows relationships with their (non-/de-) 
nuclearization patterns. 

Instead of taking realist and liberal assumptions as his 
starting point, Paul begins with “situational variables,” the 
most important of which are “the level of conflict and 
cooperation and the level of politico-military interdepen- 
dence in a given region” (p. 15). It is not surprising that levels 
of conflict are associated with patterns of nuclear arms 
acquisition. (Cimbala, as noted, identifies this as well.) Paul is 
free to look for associations between whatever independent 
and dependent variables he likes, but level of conflict and 
cooperation is usually taken as a dependent variable. His 
approach seems to limit the policy applicability of his study, 
since policymakers still need to answer one of the major 
questions in international relations: Why is there a high level 
of conflict in some regions and high cooperation in others? 

Paul finds that nuclear acquisition is linked with protracted 
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conflict and long-term rivalry. He argues that the theoretical 
assumptions of realism are useful to explain some states’ 
behavior, and the assumptions of liberalism are useful for 
others; their use depends upon where the state is situated 
(high-, medium-, or low-conflict zone). It also depends on 
whether the state has global or regional ambitions and on 
whether the state has a great power protector. The study 
deals only with nongreat powers, as Paul believes great 
powers make decisions based on different—primarily balanc- 
ing—considerations. He attributes to states in low-conflict 
regions an awareness of the security dilemma, whereby 
higher arms levels for one’s own state may lead to decreased 
security due to the resulting arms competition. Such an 
awareness, in a zone of low conflict, may lead to conformity 
to a security regime. The merits of regimes seem to be under- 
valued, however, since Paul appears to employ a rather narrow 
definition of that term. He regards the norms of a regime as 
efficacious only if policymakers explicitly invoke them in debate. 
Proponents of such theories see norms as frequently exercising 
considerable influence in much subtler ways. 

Paul draws from a range of literature on security and 
strategy. He distinguishes the “hard realism” of Hans Mor- 
genthau and John Mearsheimer from the “soft realism” of 
Snyder and Zakaria. He argues that hard realism can explain 
the nuclear restraint of states that rely on the arsenals of their 
great power protector. But since the end of the Cold War the 
flurry of activity has not centered on acquisition of nuclear 
arms (out of a fear of the end of bipolarity) but on renunci- 
ation of nuclear arms by technologically capable states. 
Moreover, the states that both have relied on their protec- 
tors arsenals and have now become the most capable of 
affording their own nuclear Germany and Japan, 
have not taken the nuclear path. Thus, Paul endorses a 
version of soft realism, namely, “prudential realism,” accord- 
ing to which states generally consider the most probable 
threat rather than the worst possible scenario. 

Paul seems to treat the domestic politics explanation rather 
unevenly. He clearly emphasizes external forces but at- 
tributes strong reinforcing roles to the nuclear programs of 
some states, such as Argentina and Brazil, and 
dispatches the hypothesis in the case of others, notably India. 
In the latter case he does not appear to engage the domestic 
politics argument of such authors as Scott D. Sagan (“Why do 
States Build Nuclear Weapons,” International Security 21 
[Winter 1996/7]: 54~86)—even though Paul notes a 91% 
approval rating for the 1998 Pokhran tests Although the 
points Paul raises in these passages are cogent, a more 
genuine comparison of his prudential realism with domestic 
politics explanations would strengthen his conclusions. Nev- 
ertheless, Paul presents a generally insightful review of the 
nuclear programs of the states he selected for study. 

Quester’s Nuclear Monopoly takes up a special case of 
nuclear restraint: Why did the United States not forcefully 
protect its nuclear monopoly in the years immediately after 
World War II? If states seek to maintain relative advantage, 
as widely held theories of international relations suggest, 
then why did the United States “allow” its nuclear monopoly 
to lapse aftér just four years? The Pax Britannica lasted for 
seven decades, and the Pax Romana more than four centu- 
ries. In the period just after World War II, U.S. territory was 
invulnerable to nuclear attack, and U.S. strategic forces were 
capable of striking targets around the globe. The issue is 
timely, since many states are now concerned by the U.S. 
decision to proceed with a missile defense system designed to 
restore the invulnerability of U.S. territory to attack and 

the ability to strike targets around the world. 

Why did the United States not threaten Stalin with war in 
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exchange for a nonnuclear pledge? One popular answer 
focuses on the problem of extended deterrence; American 
territory was invulnerable in 1945—49, but the European 
allies could be overrun by larger Soviet conventional forces. 
Quester points out inadequacies in this account, arguing as 
do others that the East-West numerical imbalance was a 
symptom of the different political systems and moral values, 
not inherent capabilities, Quester examines various other 
answers and concludes that the reason for the restraint was 
that the United States, which saw itself as a moral nation, 
could not entertain outright blackmail and the threat of 
nuclear destruction agamst a state with whom it was not at 
war. 

There were advocates of the use of American force to 
preserve the nuclear monopoly who argued that the long- 
term benefits of stability would outweigh the short-term 
destruction. Significantly, few of those advocates occupied 
important positions in government. Exceptions were in the 
Pentagon, such as General Leslie Groves, General Curtis 
LeMay, and Secretary of the Navy Francis Matthews. It is 
interesting to note that two of the most brilliant minds of the 
twentieth century, m my view, Bertrand Russell and John 
Von Neumann, are among Quester’s outright advocates of 
preemption. 

The central question of Nuclear Monopoly raises two 
difficult methodological problems. First, Quester seeks to 
understand a nonevent, that is, why something did not 
happen. Second, he develops his argument by considering 
numerous counterfactual conditionals, such as: “If Stalin had 
not been so politically threatening, would all of our thinking 
on the need for assured second-strike retaliatory forces 
perhaps never have emerged” (p. 156)? Or: “Would Stalin 
have been as patient and cautious,...if he held a nuclear 
monopoly, facing the possibility that the United States was 
working to break this monopoly” (p. 169)? The unique 
problems raised by these sorts of questions perhaps should 
have been addressed (cf. Philip Tetlock and Aaron Belkin, 
eds., Counterfactual Thought Experiments in World Politics, 
1996; or Gary King, Robert Keohane, and Sidney Verba, 
Designing Social Inquiry, 1994). What are the standards, 
criteria, and methods for testing these hypotheticals and for 
determining when one has reached an answer? 

When testing hypotheses, one typically tries to find com- 
parable cases (perhaps using a version of John Stuart Mill’s 
methods) that will shed some light on the matter under 
investigation. Such cases may be hard to find, especially given 
Quester’s central question. A more detailed comparison with 
the Pax Romana or Pax Britannica, which Quester mentions, 
would help. Also, there is a modern liberal democracy that 
views itself as moral and did choose to bomb a dictatorial 
rival in order to prevent it from developing nuclear weapons, 
namely, Israel in its 1981 attack on Iraq’s suspected nuclear 
weapons installation at Osiraq. It is surprising that Quester 
does not use this potentially useful case as a comparison. 
Overall, Nuclear Monopoly is, like most of Quester’s works, 
an insightful and original exposition of an important problem 
with a well-argued answer. 

Each of these books has its own strengths and weaknesses, 
but all three take up important questions, advance the 
existing debate, present clear expositions and adequate em- 
pirical background to make the theoretical issues vivid for 
advanced undergraduates, and offer much to debate—and 
with which to disagree—for scholars of nuclear politics and 
history. 
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Idealism and Realism in International Relations: Beyond the 
Discipline. By Robert M. A. Crawford. New York: Rout- 
ledge, 2000. 198p. $85.00. 


Karen A. Mingst, University of Kentucky 
Robert Crawford, Lecturer of Humanities and Social Sci- 
ences, University of British Columbia, presents a devastating 
critique of the discipline of international relations as cur- 
rently practiced and offers a revised conception that accepts 
the “inherent pluralism of international relations theory” (p. 
135). The argument is complex, grounded in theory and 
history, and it will be appreciated only by those well schooled 
in the theories and major theorists. The book is a difficult 
one, to be savored by committed international relations 
scholars in a careful reading. 

Crawford’s powerful critique revolves around three inter- 
related: arguments. First, idealism and realism have been 
“severed from normal usage and transformed into ‘para- 
digms’ within a unitary science” (p. x). Theorists have 
incorrectly drawn upon the ideas of traditional political 
theorists, selectively appropriating concepts for their own 
purposes. The result is that idealists have become “a synonym 
for gullibility and naivete” (p. 64), and realists, captured by 
dominant positivist methods, have been divorced from social 
theory and wedded to the notion that IR is an unchanging 
phenomenon. If idealism and realism were examined within 
their proper contextual roots, we would find that these are 
“not competing perspectives within the discipline, as much as 
fundamental ineradicable faultlmes that run under it” (p. x.) 

Second, Crawford musters his most devastating critique for 
those committed to a science of international relations, 
namely, American theorists. In the search for law-like gen- 
eralizations, scholars such as Kenneth Waltz eschew tradi- 
tional normative questions of political philosophy. The dom- 
imance of realism approaches “intellectual totalitarianism” 
(p. 17), which puts the discipline “in exile from the usual 
methods, debates, and concerns of social-political theory” (p. 
63). “A science of IR, or any branch of politics, that 
concentrates only on how things are (or how they appear to 
be) is not merely one-sided, but misses the whole point of 
political analysis” (p. 34). 

In contrast, the British international relations community 
is more intellectually questioning and tolerant of new ideas. 
Whereas Americans sees a discipline in crisis when confor- 
mity is not achieved, the British see liberation in “breaking 
free of consensus-based disciplinary models” (p. 135). 

Third, Crawford does not find solace in critical theorists, 
united only by criticisms of structural theory, or deconstruc- 
tionists, who see IR exclusively as individualized perspectives. 
The latter, including Richard Ashley, are particularly subject 
to disdain. Although Ashley alerted many to the crisis in 
American international relations, he proceeded to do pre- 
cisely what he criticized: “deepening commitment to a total- 
izing conception of cognitive and moral relativism that 
merely mirrors, and reverses, the sweeping categorical im- 
peratives characteristic of mainstream IR, making this lib- 
erationist’ agenda oddly reminiscent of the very project it 
purports to reject” (p. 142). 

Crawford’s unique theoretical contribution rests with the 
first argument. Both realists and idealists often disagree 
among each other about the proper interpretations of spectfic 
texts, but neither group has come to the conclusion that it has 
fundamentally misinterpreted the historical texts. Both 
schools would conclude that theoretical progress has been 
made. For realists and idealists within the IR discipline, 
progress is seen in trying to transcend the idealism-realism 
schism, in such exercises as neoliberal approaches to regime 
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theory. In contrast, Crawford sees progress in IR discourse as 
accepting the contested nature of political theory and of 
rejoicing in an “ambiguous and open-ended activity” (p. 34). 

Although the critique of the international relations disci- 
pline and the American scholarly community's hegemonic 
role in its perpetuation does not represent an original 
contribution, Crawford does make the case as convincingly as 
anyone. Even the most committed scholar who reads the 
book (and many may not due to the high price) has to be not 
only intellectually impressed by his argument but also chal- 
lenged by his assertions. 

The author does not present his own perspective of 
disciplinary inquiry. A sketch of what IR theory should entail 
is presented, but not in a single chapter or section of the text. 
IR theorists, like those in the British school, should be more 
amenable to the new ideas and interpretations found in social 
and political theory; they should accept multiple intellectual 
traditions as a healthy sign of scholarly activity; and they 
should not try to impose’ disciplinary unity and intellectual 
hegemony. They should listen to the growing diverse voices 
of critical theorists, and they should acknowledge the need to 
study the postmodern age as an historical era, punctuated by 
a commitment to engaging in cultural analysis. That “is 
Crawford’s sketch of a pluralistic discipline. I look forward to 
the next book in which the contours are filled in. Not only 
must the underbrush be cleared, but also new trees must be 
planted. 


Counting the Public In: Presidents, Public Opinion, and 
Foreign Policy. By Douglas C. Foyle. New York: Columbia 
University Press, 1999. 368p. $52.00 cloth, $22.00 paper. 

William H. Baugh, University of Oregon 

In a landmark 1992 article (“Public Opinion and Foreign 

Policy,” International Studies Quarterly 36 [December 1992]: 

439-66), Ole Holsti surveyed and assessed recent research 

that challenged what he termed the “Almond-Lippmann 

consensus.” Later (in Public Opinion and American Foreign 

Policy, 1996), more broadly but more descriptively, Holsti 

labeled it the post-World War II consensus about public 

opinion’s general nature, structure, and influence upon for- 
eign policy. The consensus held that public opinion (1) is 
volatile and therefore a dubious base for stable foreign 
policies, (2) largely lacks structure and coherence, and (3) has 
very limited effect upon foreign policy. Foyle brilliantly 

pursues one of the four research paths Holsti (1992, 459) 

identified as needed to build toward some new consensus: 

“case studies employing archival sources that will provide the 

most directly relevant evidence on the impact of public 

on.” 

Foyle studied under Holsti at Duke University, and the 
origins of this book lie in the author’s Ph.D. dissertation, now 
reworked and significantly and substantively expanded. He 
argues that the influence of public opinion on foreign policy 
outcomes is not direct but mediated by the interaction 
between (1) decision makers’ beliefs about the proper role of 
public opinion in foreign policy formulation and (2) the 
prevailing “decision context” in which policy must be made. 
Using qualitative content analyses of beliefs expressed in 
decision makers’ statements and writings, Foyle categorized 
those beliefs according to the normative question (whether 
they believe it is desirable for input from public opinion to 
influence foreign policy choices) and the practical question 
(whether they believe that public support is necessary). 

By dichotomizing and cross-tabulating these questions, 
Foyle creates four categories: delegate (desirable/necessary, 
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e.g, Bill Clinton), executor (desirable/unnecessary, e.g. 
Jimmy Carter), pragmatist (undesirable/necessary, ¢.g., 
Madeline Albright), and guardian (undesirable/unnecessary, 
e.g., Ronald Reagan and, significantly in agreement with the 
old consensus, Walter Lippmann). Drawing primarily upon 
the private papers of Eisenhower and Dulles, and upon 
public source materials for others, Foyle analyzes and classi- 
fies the beliefs of all presidents from Truman through Clinton 
as well as four secretaries of state (Byrnes, Dulles, Rusk, and 
Albright). 

Decision context is first defined in classic Charles Her- 
mann (“crisis”) terms: level of threat, decision time available, 
and surprise or anticipation. Focusing on presidential deci- 
sion making on national security cases effectively limits the 
study to situations of high threat, which leaves us with only 
one face of Hermann’s crisis cube, and four situations: short 
versus long decision time, crossed with surprise versus antic- 
ipation. Cases examined in depth for the Eisenhower admin- 
istration include the Chinese offshore islands (short decision 
time, surprise: crisis); Dien Bien Phu (short, anticipated: 
reflexive); Sputnik (long, surprise: innovative); and the “New 
Look” defense, emphasizing nuclear firepower over troops 
(long, anticipated: deliberative). 

For other administrations, and more debatably, Foyle 
argues (p. 16) “the surprise/anticipation factor is not ex- 
pected to interact with beliefs in determining [a president’s] 
reaction to public opinion.” This reduces decision context to 
short versus long decision time, with decision situations 
labeled reflexive (or crisis) and deliberative. A crisis case and 
a deliberative case are examined for the administrations of 
Carter (Afghan War, the Panama Canal treaty), Reagan 
(Beirut bombing, Strategic Defense Initiative), Bush (Persian 
Gulf War, German reunification), and Clinton (Somalia, 
Bosnia). 

Foyle’s model development and case examination are 
considerably more nuanced than this summary suggests, and 
the predictive power of his relatively simple model is surpris- 
ingly strong. Presidents classify cleanly into one or another of 
the four belief orientations, and what they believe about the 
desirability and necessity of public opinion’s influence upon 
foreign policy matters greatly. The beliefs model correctly 
predicts (actually, postdicts) presidential behavior in essen- 
tially all the cases examined. This performance far exceeds 
that of classical realist or neorealist models, which predict 
that elites will either ignore or educate public opinion on 
national security policy, and Wilsonian liberal models, which 
argue that democratic norms require that public opinion 
should affect foreign policy formulation. Admitted limita- 
tions are that the cases all combine high threat with a 
national security focus, generally fall in periods when public 
approval of presidential performance was relatively high, and 
occur at least one year before the neat presidential election. 

Nonetheless, this substantial and significant work yields 
some intriguing and nonobvious findings. Use of force to 
divert public attention from other problems receives little 
support. Public opinion influence on the crisis cases is not 
dominated by either lack of influence or constraint on policy 
options. Lack of information about public opinion does not 
keep decision makers from integrating public preferences 
into the decision-making process, but they respond much 
more to long-term anticipated public views than to current 
opinion. President Reagan, whose guardian classification 
predicts that he will ignore public opinion in crises and 
attempt to lead it in reflexive situations, ignored polling data 
even when his advisors followed it closely. Policymakers 
sometimes perceive constraints by public opinion even when 
the public is objectively vague or even supportive. Overall, 
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although public opinion does not affect foreign policy very 
directly, and its effect is conditioned by policymakers’ per- 
ceptions and sensitivity to public opinion, it influences some 
presidents very strongly, and individual presidential beliefs 
clearly matter enormously. 

Foyle’s work is highly lucid and accessible. It is well 
grounded in both the recent and mid- twentieth-century po- 
litical science literature on public opinion and foreign policy, 
as well asin the relevant cognitive literature. It is extensively 
annotated (51 pages, plus a twenty-page bibliography). The 
author generally displays appropriate and careful attention to 
methodological issues, and he summarizes them in ar excel- 
lent appendix. All the qualitative content analyses of beliefs 
were performed before Foyle examined his selected foreign 
policy cases. His concluding chapter offers interesting spec- 
ulation about lines for future research (at pp. 271-3) as well 
as implications for democratic theory, the domestic sources 
of international relations, the linkages between public opin- 
ion and foreign policy, and security policy after the Cold War. 
This is a major scholarly work that adds greatly to our 
knowledge of the linkages between general public opinion 
and foreign policy. We cannot hold it against Foyle that he 
raises a host of related questions deserving further research 
as we work toward that new consensus envisioned by Holsti 


Citizenship and Immigration in Post-War Britain. By Ran- 
dall Hansen. Oxford: Oxford University Press, 2000. 301p. 
$29.95. 


Andrew Geddes, University of Liverpool 


Britain’s postimperial and the shrinkage of the 
nation’s conceptual borders are the subject of Randall Han- 
sen’s analysis of immigration and citizenship in postwar 
Britain. In this well-researched, lucid, and at times pungent 
account, Hansen delves deeply into the archives to produce 
the definitive political history of Britain’s abandonment of a 

cel) oes eae postimperial migration regime and adoption 
of restrictive, racialized immigration controls. 

The book is divided into two parts. The first analyzes the 
period between 1948 and 1962, when the absence of immi- 
gration controls and the continuation of an open migration 
regime are specified as the independent variables. Why for 
fourteen years after the British Nationality Act (1948) was 
the free entry of immigrant workers from former colonies 
allowed, even though evidence suggests that these immi- 
grants were unwanted for “racial” reasons? Hansen identifies 
four explanations: a lingering attachment to bipartisanship 
between the Conservative and Labour parties; the distribu- 
tion of power within the ruling Conservative Party, continued 
attachment to a Commonwealth ideal; and opposition from 
within government, particular from the antirestriction Colo- 
nial Office against the prorestriction Ministry of Labour. 

Hansen is particularly concerned to confront what he sees 
as partial and selective accounts of the 1948-62 period that 
focus on state racism. Hansen’s target is not so much the 
policymakers—who he sees by and large as making the best 
of a bad situation—but academic interpretations that de- 
nounce the impact of state racism on citizenship and immi- 
gration policy. Indeed, his careful and exhaustive analysis of 
archival sources demonstrates that racism reared its ugly 
head, but antirestriction voices were also resonant in White- 
hall and actually held sway until the late 1950s. Hansen is 
particularly. acute in his observation that Britain’s search for 
a postwar role led to a perception that the Commonwealth 
could be the route to a presence on the world stage, which 
mediated the position on immigration control. The abandon- 
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ment of this idea by the early 19608 also helps explain the 
move restriction of immigration. 

In the second part of the book Hansen explores the rapid 
introduction of immigration controls in the period from 1962 
onward. It is worth remembering that Britain introduced 
stringent immigration controls more than ten years before 
other European countries, and these controls were directed 
squarely at African, Asian, and Caribbean immigrants and 
were thus strongly racialized. The policy is commonly linked 
to the so-called race riots that occurred in Nottingham and 
Notting Hill in the late 1950s and to a broader eruption of 
populist antiimmigration sentiment with effects on both the 
Conservative and Labour parties, Hansen supplements this 
by demonstrating the waning influence of the Colonial Office 
and prevalence of the prorestriction, technocratic arguments 
of the Ministry of Labour. 

The effectiveness of these controls has been facilitated by 
the absence of significant institutional impediments at the 
legislative, executive and judicial levels. The absence of 
constraint allowed British governments to enact Europe’s 
most stringent immigration controls. A constituency-to-legis- 
lature transmission belt has channelled antiimmigration sen- 
timent from the local level to Westminster. Majority support 
in the House of Commons has allowed both Labour and 
Conservative governments to pursue ever tighter controls 
through primary and barely scrutinized secondary legislation. 
Meanwhile, the relative weakness of the judiciary and the 
absence of a Bill of Rights left little scope for rights-based 
politics. 

This still holds good today, as demonstrated by the reluc- 
tance of either Conservative or Labour governments to 
countenance a common EU immigration and asylum policy 
that would empower the European Commission and Euro- 
pean Court of Justice over the heretofore relatively untram- 
melled executive authorities of the British state. The compe- 
tition of deterrence over asylum seeking between the Labour 
and Conservative parties before the 2001 general election is 
also evidence of the absence of constraints. Indeed, these 
kinds of factors led Gary Freeman (in Wayne Cornelius, 
Philip L. Martin, James F. Hollifield, et al, Controlling 

: A Global Perspective, 1994, pp. 297-300) to refer 
to Britain as Europe’s “deviant case” because of the attain- 
ment of restrictive policies with relatively few institutional 
constramts. Hansen confirms this view and also puts some 
flesh on the bones of Britain’s metaphorical “awkwardness” 
about European integration, both in the 1950s (attachment to 
the Commonwealth) and more recently (suspicion of Euro- 
pean institutions). 

Underpinning this account is another, more implicit, ob- 
jective: a defense of Britain’s liberal race relations policies 
predicated, as they have been for more than 40 years, on 
strict immigration controls accompanied by antidiscrimina- 
tion legislation in pursuit of “good race relations.” The 
underlying thesis is captured well by a statement on page 16: 
“Politicians were liberal, the public illiberal.” Governments 
were driven by populist eruptions of antiimmigration senti- 
ment to enact stringent policies. Moreover, they did so in the 
shadow of the 1948 British Nationality Act, which established 
a path for the postwar politics of citizenship and immigration 
in Britain. 

Hansen (p. 135) rebuts a constructivist interpretation of 
these events: that immigration became a problem because 
social and political elites saw it as a problem (and saw it asa 
problem of race). For Hansen, immigration was a social and 
political problem and one to which democratically elected 
governments needed to respond. The fact that they did so by 
acting responsibly and establishing race relations policies that 
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combined controls with antidiscrimination legislation was to 
their credit. 
But race relations, it would seem, could only be 
if the immigration door were closed. If further 
“unwanted” immigration were prevented, the integration of 
already settled immigrants could proceed. But there was a 
flaw in this reasoning. Family migration has continued since 
the 1960s, and asylum-seeking migration accelerated in the 
1990s. This further immigration to the heart of the 
liberal race relations paradigm. The response by successive 
governments has been ever more stringent methods to re- 
strict immigration. It seems that the liberalness of race 
relations rests on an inherently flawed assumption of zero 
immigration. The reality of continued flows appears to have 
produced illiberal responses to new and unwanted immigra- 
tion, all in the name of good race relations. The relation 
between an illiberal path to supposedly liberal objectives is at 
the heart of the British politics of citizenship and immigra- 
tion. 


Ambiguous Order: Military Forces in African States. By 
Herbert M. Howe. Boulder, CO: Lynne Rienner, 2001. 
291p. $55.00. 


Robert J. Griffiths, Unmersity of North Carolina at 
Greensboro 


Africa’s conflict and instability have focused renewed atten- 
tion on the continent’s armed forces. Africa’s i i 
derives from domestic and regional conflict that involves 
increasing numbers of combatants, the widespread availabil- 
ity of inexpensive weapons, continuing ethnic tensions, and 
the scramble to exploit natural resource wealth. According to 
Herbert Howe, these conflicts are exacerbated by such fac- 
tors as p tive self-defense, efforts to contain conflicts, 
and the lack of military professionalism. Howe’s timely book 
examines a key security dilemma that has emerged in Africa. 
The armed forces of many countries on the continent cannot 
provide security, and their behavior actually contributes to 
insecurl i 


ty. 

The focus is a familiar one in the literature of civil-military 
relations: military professionalism or, more accurately, the 
lack of this characteristic among African militaries. Howe 
examines the way the lack of professionalism affects coun- 
tries’ ability to enhance national and regional security. This 
peer had Maer le iate because of wid d conflict 
and little likelihood of Western intervention. Howe discusses 
three recent approaches to improving security: regional 
intervention, the use of private security companies, and 
Western efforts to assist in the creation of more professional 
militaries as a step toward the establishment of a regional 

The roots of military unprofessionalism go back to the 
colonial era: “Colonial militaries do bear some of the respon- 
sibility for Africa’s presently inadequate levels of profession- 
alism” (p. 29). After independence, the propensity toward 
personal rule, which resulted in greater emphasis on loyalty 
to the leader than on military professionalism, as well as 
replication of ethnic recruitment patterns of the colonial era 
and corruption further eroded professionalism. African mil- 
itaries were also undermined by the creation of pa guards 
to insulate leaders from their own militaries. The result is 
military forces that are unable to deal effectively with Africa’s 
security challenges. 

The intervention of the West African states’ Economic 
Community Ceasefire Monitoring Group (ECOMOG) in 
both Liberia and Sierra Leone represented an effort to mount 
an African response to conflict. Howe concludes that ECO- 
MOG proved unable to end the Liberian conflict and instead 
prolonged it (pp. 164-5). Although he acknowledges that 
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ECOMOG contributed to greater regional instability, he 
credits its efforts in Liberia with trying to save lives and 
respond to humanitarian concerns. Nigeria, ECOMOG’s 
most rful member, had significant interests at stake 
aside Font’ humanitarian concerns, however. A desire to play 
the role of regional hegemon and deflect Western criticism of 
its own military government provided key motivations for 
intervention. Commercial rtunities for Nigerian officers 
certainly became a reason for continued involvement. 

ECOMOG’s intervention in Sierra Leone was even less 
successful. Failure to learn from the Liberian experience, the 
lack of a counterinsurgency doctrine, Nigeria’s unilateral 
expansion of ECOMOG’s mandate from peacekeeping to 
peace enforcement, economic exploitation by Nigerian offic- 
ers, and harsh treatment of civilians pl that operation. 
With the departure of ECOMOG in 1999, the UN-deployed 
United Nations Mission in Sierra Leone (UNAMSIL) fared 
even worse, and British troops were eventually dispatched to 
prevent Revolutionary United Front (RUF) rebels from 
taking Freetown. The seizure of UNAMSIL troops by RUF 
forces after the departure of ECOMOG and continuing 
instability in Sierra Leone certainly indicate failure of this 
regional effort. 

The difficulties of ECOMOG’s intervention will undoubt- 
edly not be lost on South Africa, which is widely ed to 
play the leading role in the creation of a regional force in 
southern Africa. Initially wary about its ability to contribute 
effectively to peacekeeping efforts and aware of the unease 
with which some neighbors regard it because of interventions 
during the apartheid era, South Africa is likely to continue to 
be cautious about regional intervention, especially in light of 
its 1998 rience in Lesotho. 

Personal rule and the lack of military professionalism also 
have prompted the emergence of private security companies 
(PSCs) as another response to continuing security challenges. 
Mercenaries are certainly not new to Africa, but PSCs 
represent a significant change from the traditional image of 
the soldier of fortune. Executive Outcomes (EO), a South 
African firm, represents the prototype of the private security 
firm. Drawing on personnel from the former South African 
Defense Force, especially its counterinsurgency units, EO 
played a role in Angola in 1993 and Sierra Leone in 1995. In 
Angola, EO helped force National Union for the Total 
In dence of Angola (UNITA) to sign the Lusaka Pro- 
tocol, although the war continues to grind on. In Sierra 
Leone, EO provided a measure of stability that permitted 
elections in 1996. The newly elected Kabbah government 
signed a peace agreement with RUF that ultimately did not 
hold, and EO withdrew from the country when its contract 
ended in 1997. 

Although EO had military success in these two instances, 
the privatization of security raises several troubling ques- 
tions. These include a lack of political accountability and the 
links between PSCs and commercial interests. As Howe 
notes, the private security option also helps personal rulers 
avoid developing oreeson niam (p. 208). Ironically, African 
insecurity created the circumstances for the emergence of the 
private security option, which works against the profession- 
alism of the armed forces, the achievement of which is 
essential to the enhancement of security. PSCs also encour- 
age resort to military solutions rather than negotiation, and 
Howe notes that it is le that these firms might prolong 
conflict to maximize their profit (p. 210). He onude that 
the supply of skilled personnel, the need for effective security, 
and the inability to monitor PSCs effectively make the need 
for military professionalism even more compelling (p. 229). 

The third option Howe examines combines external aid 
and an African response to conflict situations. The African 
Crisis Response Initiative (ACRI) is a U.S. effort to create 
more professional armed forces that could form the basis for 
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regional peacekeeping. The United States supplies training 
and equipment relevant to peacekeeping operations to 
armed forces from stable democratic countries. Howe cites 
several problems associated with this approach, including its 
emphasis on peacekeeping rather than peace enforcement, 
which is often required in Africa’s conflicts; the inclusion of 
governments whose militaries have been involved in regional 
conflicts and whose militaries have engaged in questionable 
behavior, and the lack of Nigerian and South African partic- 
ipation. Some governments have expressed concern about 
U.S. neocolonialism in deciding who will participate, and 
there is also concern that ACRI would primarily serve U.S. 
strategic interests rather than African security. Questions 
also involve command and control decisions as well as the 
willingness of African countries to contribute troops to a 
program that would increase professionalism and perhaps 
represent a challenge to their governments. Despite the 
drawbacks, Howe concludes that ACRI is a step in a positive 
direction: It addresses the issue of professionalism and 
incorporates several conceptual innovations in security assis- 
tance, including “double multilateralism,” interoperability, 
and prior planning, which may provide a template for future 
military assistance, whatever ACRI’s ultimate fate (p. 261). 

The strength of this volume is Howe’s thorough examina- 
tion of three recent approaches to insecurity in Africa. In the 
process, the author clearly illustrates the complexity of the 
security environment on the continent. He also highlights the 
economic motivations of both military personnel and PSCs 
cet ae role in conflict on the continent. Having examined 

roaches and found them all lacking to some degree, 
back to his original assertion that military profes- 
palin: is the key to Africa’s security dilemma. If there is a 
weakness here, it is a lack of specifics on how to accomplish 
the goal of professionalism. Howe suggests the answer lies in 
greater democratic control, but he does not fully develop this 
theme. The conclusion contains a very brief case study of 
Nigeria that concentrates more on the obstacles to military 
reform than the process by which it might be achieved. The 
way in which accountability, transparency, and control can 
best be assured is left largely to speculation. Although the 
circumstances in South Africa are somewhat different, dis- 
cussion of its efforts to create a civilian defense secretariat 
and the efforts to establish effective parliamentary oversight 
might have been more instructive. 

Howe’s analysis goes to the heart of the security problem in 
Africa, the lack of democratic institutions that foster military 
professionalism. The author believes that democracy facili- 
tates military professionalism, and we are left with the 
conclusion that Africa needs something it is lacking, which is 
hardly a revelation. Does this say more about the state of the 
literature in the fleld or the state of African militaries? Howe 
raises more questions than he answers, but this is valuable 
nonetheless. No single volume can address all the dimensions 
of the complex security environment in Africa and their 
implications, but Howe focuses attention on important prob- 
lems and makes a solid contribution to the literature on 
African security and civil-military relations. 


The Dynamics of Economic and Political Relations between 
Africa and Foreign Powers: A Study in International 
Relations. By Tukumbi Lumumba-Kasongo. Westport, 
CT: Praeger, 1999, 133p. $55.00. 


Peter J. Schraeder, Loyola University Chicago 


This book adds to a growing literature on African interna- 
tional relations written by African scholars. The project was 
an outgrowth of Lumumba-Kasongo’s involvement in a con- 
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ference sponsored by the Council for the Development of 
Social Science Research in Africa (CODESRIA), a leading 
pan-African research center located in Dakar, Senegal, that 
has sought to promote the scholarship of a new generation of 
Africans. 

Lumumba-Kasongo seeks to contribute to the dependen- 
cy-decolonization debate, that is, the degree to which the 
colonial era still influences contemporary African interna- 
tional relations (p. 10). According to the dependency school, 
the granting of legal independence that began in the 1950s 
did little to alter the constraining web of economic, political, 
military, and cultural ties that continue to bind African 
countries to the former colonial powers. This conceptualiza- 
tion of African international relations, often referred to as 
neocolonialism, is especially prominent in writings about the 
relationship between France and its former colonies, primar- 
ily due to policies designed to maintain what the French term 
their chasse gardée (exclusive hunting ground) in francoph- 
one Africa. Even in former colonies of European powers 
either too weak (e.g., Spain) or uninterested (e.g., Britain) to 
preserve privileged ties, the Cold War and superpower 
Intervention ensured the gradual replacement of European 
neocolonial relationships with a new set of ties dominated by 
Moscow and Washington. According to this perspective, 
direct colonial rule has been replaced by neocolonial rela- 
tionships that perpetuate external domination of African 
international relations. 

Scholars of the decolonization school argue that legal 
independence was but the first step in an evolutionary 
process that has permitted African leaders to assume greater 
control over their political and social systems. Although 
external influences were extremely powerful immediately 
after independence, layer upon layer of foreign control is 
slowly being peeled away. Noting that individual African 
countries can follow different pathways, decolonization pro- 
ponents argue that the most common pattern of political 
self-realization begins with legal independence, followed by 
efforts to assure national sovereignty in the military, eco- 
nomic, and cultural realms. “In this view, each layer of 
colonial influence is supported by the others, and as each is 
removed, it uncovers and exposes the next underlying one, 
rendering it vulnerable, untenable, and unnecessary,” ex- 
plains I. William Zartman, one of the most prominent 
decolonization scholars. “Thus, there is a natural progression 
to the removal of colonial influence: its speed can be varied 
by policy and effort, but the direction and evolution are 
inherent in the process and become extremely difficult to 
reverse” (see Zartman, “Europe and Africa: Decolonization 
or Dependency,” Foreign Affairs 54 {January 1976]: 326-7). 

Lumumba-Kasongo’s book clearly falls within the depen- 
dency side of the debate. “Nearly four decades after most 
African countries earned nominal political independence, the 
social and economic conditions of the majority of people 
have not improved, (p. 12)” he states. “Regardless of signif- 
icant efforts by some states to change social and political 
conditions, and the adoption of the stabilization programs of 
the IMF [International Monetary Fund] and the SAPs [struc- 
tural adjustment programs] of the World Bank of the 1980s, 
all statistics from the national, international, and regional 
organizations indicate that the living conditions of the ma- 
jority of the African people are worsening (p. 12).” 

The reasons this is occurring—and how the situation may 
evolve in the future—ere discussed in four relatively brief 
chapters. Chapter 1 explores a variety of theoretical strands 
from the dependency school that emphasize Africa’s contin- 
ued marginalization within the international system. An 
emphasis is placed on the economic and political dimensions 


American Political Science Review 


of dependency, particularly the rising influence of interna- 
tional financial institutions such as the IMF and the World 
Bank. Two chapters then focus on specific case studies: 
Liberia’s relationship with the United States from the colo- 
nial era to the collapse of the Samuel K. Doe regime in 1990, 
and the Democratic Republic of the Congo’s (formerly 
Zaire) relationship with Belgium from the colonial era to the 
collapse of the regime of Mobutu Sese Seko in 1997. A final 
chapter assesses how the decline of the Cold War’s bipolar 
international system and subsequent emergence of an in- 
creasingly multipolar international system will affect African 
international relations. 

As is the case with all provocative books, one can quibble 
with a variety of issues. These range from simple errors 
associated with copyediting (e.g., it is stated on p. 74 that 
President Clinton “has not made any firm commitment to 
visit Africa,” but on p. 27 it is recognized that an extended 
presidential visit to six African countries occurred in 1998), to 
more substantial matters, such as the time frame (e.g., the 
analysis of Liberia does not cover events subsequent to 1990, 
but an important focus of the book is the post-Cold War era) 
and case study selection. For example, why focus on the more 
unique and limited colonial experience of the United States 
and Belgium as opposed to Great Britain and France, not to 
mention the more minor roles played by Germany, Italy, and 
Spain? Of greater interest, however, are three broader 
questions that emerge from Lumumba-Kasongo’s analysis. 

In keeping with dependency theory, a dominant theme of 
the book is Africa’s increased ion within the 
international system, a process deemed to bave gathered 
strength during the post-Cold War era. Although most 
Africanists would agree that marginalization is a key opera- 
tive process in general, several would question whether one 
can broadly generalize across the universe of African cases 
(e.g., a resurgent South Africa under democratically elected 
administrations since 1994) or whether all outcomes of such 
marginalization are necessarily negative. To cite just two 
cases, corrupt dictatorships can no longer play the ideological 
card to the extent possible during the Cold War, when the 
superpowers competed for realms of influence, and African 
countries themselves are increasingly focused on African 
solutions for African problems, most notably a greater com- 
mitment to regionally based security efforts. Indeed, Lu- 
mumba-Kasongo notes that marginalization is “providing 
opportunities for various African social groupings to invent 
new options” (p. 29), but this counterargument to the basic 
thrust of his book is not developed in the case studies or in 
the concluding chapter. 

A second issue revolves around Lumumba-Kasongo’s in- 
terest in determining how multipolarization of the interna- 
tional system since the Cold War will affect the ties of 
dependency inherited from the colonial era. The dependency 
literature suggests a negative outcome, most notably because 
France, Germany, Japan, and the United States—the four 
major powers with extensive economic links to the conti- 
nent—are all capitalist systems. My work on francophone 
West Africa suggests, however, that multipolarization en- 
ables African elites to take advantage of the highly compet- 
itive international economic system (see Schraeder, “Cold 
War to Cold Peace: U.S.-French Competition in Francoph- 
one Africa,” Political Science Quarterly 115, no. 3 [2000]: 
395-419). In contrast to the expectations of dependency 
theory, the decline of the rigid bipolar system has afforded 
African elites more maneuverability, especially in negotiating 
access rights to their highly lucrative telecommunications, 
petroleum, and transportation industries. In short, they can 
play foreign firms against each other. Lumumba-Kasongo 
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alludes to this when he notes that the Cold War’s end has 
prompted “the search for new alignments in power relation- 
ships” (p. 113), but the overall thrust of his book seems to be 
that such realignments will do little to alter dependency ties. 

The most noteworthy omission from Lumumba-Kasongo’s 
analysis, and one that carries important implications for ' 
dependency theory, is the tendency of African countries to 
intervene militarily in their neighbors’ affairs at a time when 
such intervention by the great powers is declining. A decade 
after the Cold War, Africa’s so-called new bloc of leaders— 
including Isaias Afwerki of Eritrea, Meles Zenawi of Ethio- 
pia, Yoweri Museveni of Uganda, and Paul Kagame of 
Rwanda—as well as other regional military powers, most 
notably Nigeria and South Africa, are becoming the new 
power brokers in African international relations. A prime 
example i is the continuing conflict in Congo-Kinshasa, which 
is one of Lumumba-Kasongo’s two case studies. Foreign 
observers now commonly refer to this as Africa’s “First 
World War” due to the introduction of ground troops by at 
least five countries (Angola, Namibia, Rwanda, Uganda, and 
Zimbabwe). Precisely for this reason one wishes that Lu- 
mumba-Kasongo had included the post-1997 period in Con- 
go-Kinshasa. 

In sharp contrast to great power conduct in the region 
during the 1960s, when the United States launched a series of 
very aggressive covert (and overt) operations to counter the 
spread of perceived “radical” influences, the Clinton admin- 
istration employed its Security Council veto to block the 
dispatch of any UN-sponsored peacekeeping force. This is a 
clear signal that the United States (and other great powers) 
are either unwilling or perceive themselves as unable to 
impose peace in this vast arena. It appears that peace can 
only emerge from the evolving regional military balance of 
power and the mutual political, military, and economic 
interests of regional actors, contrary to the dependency 
arguments of Lumumba-Kasongo and others, who stress the 
overriding influence of foreign powers on African interna- 
tional relations. 


Open States in the Global Economy: The Political Economy 
of Small-State Macroeconomic Management. By Jonathon 
Moses. New York: St. Martin’s, 2000. 259p. $69.95. 


William D. Coleman, McMaster University 


This well-written book makes an important contribution to 
the growing literature on how changes in the international 
economy, particularly globalization, affect domestic eco- 
nomic sovereignty. Moses argues, correctly, that the experi- 
ences of small states are particularly interesting in this 
regard. Long faced with problems of adjustment to interna- 
tional markets, they have had little ability to shape interna- 
tional markets and politics. Their example is instructive 
because they are more experienced in dealing with such 
phenomena as globalization than are larger states, such as the 
United States or Japan. Moreover, as economic globalization 
proceeds and the markets for goods and capital become more 
integrated, even larger states are finding their policy auton- 
omy to be shrinking. In this respect, understanding the 
experience of small states in maintaining policy autonomy 
will have increasingly broader relevance. 

The author situates his argument in relation to three 
schools of comparative political economy. The first, termed 
the “Left/flabor” approach, looks at relative class power to 
explain the electoral strength of leftist governments in Eu- 
rope. A strong labor movement is seen as crucial to the 
election of social democratic governments that have pursued 
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full employment while building a comprehensive, unrversal 
welfare state. The second school, the “small states” approach, 
complements the first but stresses the importance of corpo- 
ratist domestic institutional arrangements for securing the 
cooperation between classes needed to adjust successfully to 
international economic constraints. The third approach, the 
“politics matters” school, departs from the comparative case 
study method of the first two and uses cross-national, pooled 
time-series analysis. Proponents seek to demonstrate how 
small open-economy polities with social democratic govern- 
ments and corporatist labor union structures have main- 
tained autonomy in finding an acceptable tradeoff between 
fighting unemployment and rising inflation. 

In reviewing this literature in the light of oncoming glob- 


alization, the author stresses two important but problematic - 


assumptions that have been made. First, the literature relies 
almost exclusively on closed-economy assumptions about the 
effects and likely influence of macroeconomic policy instru- 
ments. Even in what is perhaps the most sophisticated study 
of this genre (Fritz Scharpf, Crisis and Choice in European 
Social Democracy, [1987] 1991), external factors enter the 
discussion on an ad hoc basis, and policy solutions to 
unemployment are interpreted in a closed-economy frame- 
work. Second, following from the first assumption, scholars 
assume that the various policy instruments for macroeco- 
nomic fine-tuning will have the same effects over time. This is 
consistent with the idea of a closed economy. In an open and 
globalizing economy, however, a given policy instrument, 
such as the exchange rate, may have different and even 
opposite effects as the extent of globalization changes. 

Moses seeks to correct these shortcomings by arguing that 
the increased capital mobility characteristic of globalization 
has led small, open-economy polities to use new policy 
instruments to maintain full employment. He adds that 
authorities have to give priority to external events in inter- 
preting policy choices but recognize that these choices are 
influenced by both domestic and external pressures. In 
making this argument, the author shows that these polities 
did not rely as heavily on deficit-financed, Keynesian-inspired 
economic policies as is commonly understood. In fact, he 
suggests, these policies were seldom needed. 

Central to the development of this argument is a theoret- 
ical framework for understanding policy choices in small, 
open states. This framework is developed m two steps. First, 
small open economies face two linked balances: maintenance 
of domestic output at the full employment level and a balance 
of payments with the outside world. Authorities must design 
economic policies that take both into account but give 
priority to the external one. For example, a small corporatist 
state needs to keep its export sectors competitive in the 
international economy. To do so, it may need to balance its 
budget and ensure that prices in the sheltered or protected 
sectors do not undermine the productivity and competitive- 
ness of the export sectors. One approach described by the 
author involves a corporatist-bargained incomes policy that 
keeps wages down in the sheltered sector and targets sectoral 
and regional mdustrial policies to address pockets of unem- 
ployment. 

The author prepares for the application of this revised 
model by introducing a framework in chapter 3 that distin- 
guishes different types of international regimes in terms of 
their relative degree of globalization. Based on both traded 
goods and capital flows, the postwar period is divided into 
four regimes: 1948-58 (high national autonomy), 1958-71 
(the Bretton Woods regime), 1971-86 (flexible adjustment), 
and 1986 to present (approaching laissez-faire). The authors 
argues that this challenge to the established theories requires 
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a Close reading of the historical record of policy change. For 
this reason, he opts for an informed case study of policy 
change in Norway under each regime. Norway was chosen 
because it best fits the orthodox theories: Politics have been 
dominated by the social democratic Labor Party; corporatism 
is well entrenched; the country has consistently maintained 
full employment, and in not joining the European Union, 
Norway has sought to maintain maximum policy autonomy. 
Norwegian policy is followed through the four international 
eras in chapters 4-7, and the book ends with a strong 
concluding chapter. 

The empirical analysis is very convincing. Moses shows that 
Norway rarely followed the expected Keynesian policies. He 
also demonstrates the consistent concern of political author- 
ities with the external balance, whether in terms of trade (the 
current account) or investment (the capital account). The 
analysis reveals that Norway relied much more on sectoral 
industrial policy and regional policy in addressing ‘full em- 
ployment than on a Keynesian approach. Furthermore, the 
decision to use particular macroeconomic policy instruments 
(monetary, fiscal, income) changed as the international econ- 
omy liberalized. Under the two earlier and more closed 
international regimes, for example, fiscal and monetary pol- 
icy instruments could be coordinated; in the two Jater re- 
gimes, these policies often worked at cross-purposes, partic- 
ularly as movements of capital became liberalized. 

The convincing analytic narrative of the case study shows 
the strength of the theoretical innovations introduced by the 
author. In his conclusion, he notes the problems of extrapo- 
lating to broad generalizations based on a single case study. 
This methodological caution is welcome. Nevertheless, the 
next step is to apply the theory to other small, European 
social democratic states. Perhaps an even more interesting 
and challenging application would be to such polities as 
Canada and Australia, where leftist parties have been less 
dominant than in parts of Europe and corporatist structures 
less well entrenched if at all. Certainly, the commitment to 
Keynesianigm in these countries has been spotty at best, 
which might be explained by the model proposed here, that 
is, policymakers give priority to the external accounts but also 
look carefully at industrial and regional policies as a means to 
fight unemployment without undermining the competitive- 
ness of export sectors. The relative strength of corporatism 
may once again be shown to be crucial for explaining why 
some states, such as Canada, have had less success in using 
these kinds of policy tradeoffs to keep unemployment low. 


in a Glass House: Mass Media, Public 
Opinion, and American and European Foretgn Policy in 
the 21st Century. Edited by Brigitte L. Nacos, Robert Y. 
Shapiro, and Pierangelo Isernia. Lanham, MD: Rowman & 
Littlefield, 2000. 361p. $79.00 cloth, $26.95 paper. 


Mare A. Genest, University of Rhode Island 


This edited volume is an ambitious and engaging attempt to 
fill a gaping hole in political science literature regarding the 
effect of mass media and public opinion on American and 
European foreign policymaking in the post-Cold War era. By 
stressing comparative studies of public opinion and foreign 
policy, the work explores the role that will be played by the 
global telecommunications revolution in international policy- 
making during the twenty-first century. These essays by 
leading European and American scholars offer few concrete 
answers but provide important insights and raise provocative 
questions that will prove essential as a foundation upon 
which other scholars will base their research. The editors 
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have assembled a diverse collection that constitutes the first 
serious endeavor to investigate systematically the complex 
relationship among the media, public opinion, and foreign 
policymaking in a globally integrated information age. 

The work is divided into three sections. Part 1 focuses on 
whether the media has a new, expanded role in twenty-first 
century world politics. The primary question raised in this 
section is whether the confluence of globalization and the 
demise of the Cold War have enhanced the power of the 
media as an independent transnational actor in influencing 
international polt ing. Two notable essays, one by Mar- 
tin Shaw (chap. 3) and the other by Robert Entman (chap. 2), 
posit that the media have gained power at the expense of the 
state. Shaw asserts that the end of the Cold War facilitated 
the creation of a “global media space” (p. 36) in which 
numerous actors are now able to compete with governments 
(especially the United States) in framing issues. This provoc- 
ative theory holds that advances in communication technol- 
ogy have widened the public arena, which allows journalists, 
interest groups, and other actors to in informing 
and shaping public opinion. Unfortunately, Shaw and Ent- 
man fail to produce compelling evidence to support their 
argument, and later articles seem partially to refute their 
contention. 

The articles in the second and third sections tend to cast 
doubt on whether the demise of the Cold War has enhanced 
the media’s ability to frame foreign policy news at the 
expense of government policymakers. A major theme of part 
2, which investigates public and elite attitudes after the Cold 
War, is that poli often fail to understand public 
attitudes. One might conclude that, despite all the techno- 
logical advances of the last decade, policymakers remain 
confused over or resistant to public preferences. 

In part 3, “The Public Opinion—Foreign Policy Linkage” 
(pp. 223-304), both Natalie La Balme (chap. 16) and 
Pierangelo Isernia (chap. 17), who using studies involving 
Italy and France, respectively, find that the end of the Cold 
War has changed the opinion-policy relationship, but the 
linkages between the two are less transparent and more 
tenuous in Europe. These contributors demonstrate that the 
global media space discussed in the first section of the book 
may not be global at all and is applicable mainly to the United 
States. 

Like far too many political science tests, this volume is 
filled with technical jargon that is familiar to only a small 
number of specialists. The problem is somewhat unavoidable, 
but the editors could have incorporated some simple peda- 
gogical tools to help expand the audience for this important 
work. For example, a glossary of key terms would be of 
enormous assistance. Moreover, the editors’ mtroductory 
chapter does a commendable job of presenting a useful 
summary, but it might have offered a brief overview of past 
scholarship on the subject. The significant intellectual contri- 
butions of this vohime may not be very accessible to a wider 
group of scholars and policymakers. 

These essays ultimately raise more questions than they 
answer. For example, why does the media seem to play a 
more central role in U.S. foreign policy processes than in 
those of other democratic societies? Also, does political 
culture, religion, or public access to new technology play a 
central part in contributing to a “global public sphere” (p. 
87)? Furthermore, the editors readily admit that this volume 
largely neglects the role of the Internet in the debate about 
the interrelationship among the media, public opinion, and 
the foreign policy process. Nevertheless, Decisionmaking in a 
Glass House is a vital contribution to our understanding of 
this complex and evolving relationship in the twenty-first 
century. 
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A Rising Middle Power?: German Foreign Policy in Trans- 
formation, 1989-1999. By Max Otte, with Jorgen Grewe. 
New York: St. Martin’s, 2000. 324p. $45.00. 


Thomas Berger, Johns Hopkins University 


Over the past decade a near consensus has emerged among 
academic analysts of German foreign and security policies 
that the tradition of realpolitik is dead, at least in the Federal 


Simon Reich (The German 
reviewer (Cultures of Antimilitarism, 1998) draw on the 
constructivist literature in international relations 
to argue post-1945 German political culture has made 
classical balance-of-power thinking a taboo in German potit- 
ical discourse. Others, using neoliberal institutionalist argu- 
ments, maintain that growing complex inte: nce and 
the proliferation of international institutions impelled 
the Germany to adopt a cooperative ch in its relations 
with the outside world (see, e.g. J J. Anderson and 
John B. Goodman, “Mars or Minerva? A United Germany in 
a Post-Cold War Europe,” in Robert O. Keohane, Joseph S. 
Nye, and Stanley Hoffman, eds., After the Cold War: Interna- 
tional Institutions and State Strategies in Europe, 1989-1991, 
m Regardless of their theoretical orientation, a growing 
of observers agree that the Federal Republic is 
Sonslanonaliy inclined toward a low-key approach to for- 
eign affairs and favors diplomacy and economics to achieve 
its national objectives rather than more traditional military 
instruments. 

Not Max Otte. In this bold new book he challenges this 
nascent conventional wisdom and argues that Germany very 
much fits the realist paradigm of a country whose foreign 
policy is driven by considerations of power and national 

. In contrast to other realist scholars, such as Kenneth 
Waltz and John Mearsheimer (see, e.g, John Mearsheimer, 
“Back to the Future,” International Security 15 [Summer 
1990]: 5-56), Otte argues that Germany will not necessarily 
revert to an or more confrontational set of 
foreign policies. It is the strongest power in Western Europe 
in terms of population and economic and military might, but 
it is only a middle power in the larger scheme of things. Even 
after reunification it remains far weaker than the United 
States, Russia, the People’s Republic of China, or a combi- 
nation of other European military powers. Moreover, given 
its strategic environment and un ing demographic and 
economic realities, the Federal Ri lic is arguably past its 
zenith. Unlike the first half of the twentieth century, today 
Germany cannot aspire to a role of global or even regional 
hegemon. Instead, contends, German leaders are con- 
strained to try to preserve the gains achieved in recent 
decades, which make the Federal Republic a typical status 
quo power. 

This means that Ge. will continue to rely heavily on 
multilateral stru circa, piek as NATO and the Baropeatt 
Union. At the same time, however, it remains sensitive to the 
balance of power and will have few about the use of 
military force to maintain these multilateral structures. From 
the standpoint of international relations theory, this creates a 

The Federal Republic pursues essentially a liberal 
es PO ae it is the “realist” interest of a 
middle power Germany to do so (p. 90). 

To substantiate his argument Otte offers a detailed analysis 
of different dimensions of the Federal Republic’s geostrate- 
gic situation (chap. 3) as well as a detailed and often 
insightful overview of the evolution of German foreign and 
security policy from 1989 to 1999 (chaps. 4-7). Especially 
interesting is the discussion of Germany's growing military 
involvement in former Yugoslavia. The Schroeder govern- 
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ment’s decision to support the NATO campaign in Kosovo 
provides particularty compelling evidence for Otte’s thesis 
that Germany is becoming a “normal power” that does not 
eschew the use of force in the pursuit of its national interests. 

Despite its many merits, Otte’s case is not entirely convinc- 
ing. The very notion of a middle power is rather reminiscent 
of what is sometimes called a “Goldilocks theory” in the 
social sciences—not too hot, not too cold, but just right. 
Germany is too large to act like a small power, but it is too 
weak to behave like a great power. Consequently, it behaves 
like a middle power, that is, the kind of cautious status quo 
power that it happens to be—just right. Since Otte does not 
operationalize the term and offers no comparisons with other 
examples of middle powers, past or present, it is hard to judge 
exactly what makes Germany a middle power and what would 
cause it to fall into another category. Germany is a middle 
power, and middle powers act the way Germany does. The 
argument risks becoming tautological. 

Moreover, despite the abundant use of realist language 
and imagery, Otte is hardly consistent in his use of realist 
categories of analysis. Throughout the book he pays a great 
deal of attention to domestic political variables in ways that 
suggest he believes they have a significant causal effect on 
German foreign policy. Moreover, Otte often stresses the 
importance of nontraditional (ie., nonrealist) norms and 
values that shape German elite and public views of the 
international environment (see, e.g., pp. 129, 158). At one 
point he even makes the very unrealist claim that power is a 
social construct (p. 76)! In the end, one cannot escape the 
conclusion that Otte has basically realist instincts and be- 
lieves that power, especially military power, counts. Yet, he 
seems far too deeply immersed in the realities of German and 
European politics and international relations to believe that 
crude material forces mechanically determine policies, as 
some realists, i of the neorealist persuasion, main- 
tain. He evidently believes that ideas play a critical role in 
shaping actor preferences and perceptions. At the same time 
Otte recognizes that these ideas are sensitive to realities that 
lie outside the cultural system per se, including critically the 
distribution of military resources. He thus seems as close to 
the so-called thin constructivism of Peter Katzenstein, 
Thomas Risse, or, for that matter, this reviewer as he is to 
realism. 


These criticisms aside, Max Otte 1s to be complimented on 
writing a provocative yet informative study. It is a welcome 
contribution to the literature on German foreign 
policy and will be of use both in the classroom and to analysts 
of German politics and European regional affairs in general. 
It is certain to stir further debate on this complex and 
important topic. 


Human Rights: New Perspectives, New Realities. Edited by 
Admantia Pollis and Peter Schwab. Boulder, CO: Lynne 
Rienner, 2000. 259p. $55.00 cloth, $22.00 paper. 

The Power of Human Rights, International Norms and 
Domestic Change. Edited by Thomas Risse, Stephen C. 
Ropp, and Kathryn Sikkink. New York: Cambridge Uni- 
versity Preas, 1999. 318p. $59.95 cloth, $22.95 paper. 

Globalisation, Human Rights, and Labour Lew in Pacific 
Asia. By Anthony Woodiwiss. New York: Cambridge Uni- 
versity Press, 1998. 316p. $64.95 cloth, $24.95 paper. 


David P. Forsythe, University of Nebraska-Lincoln 


Over the past fifty years human rights has emerged as a major 
area of study in the fields of international relations and 


foreign policy. There is much international law on the 
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subject. International organizations increasingly link many of 
their programs to these legal norms. Almost all states now 
pay lip service to the desirability of human rights, and almost 
all states now have a bureaucratic unit that deals with human 
rights in international perspective. Much serious diplomacy, 
sometimes linked to military force, is directed to the protec- 
tion of internationally recognized human rights. Agents of 
transnational civil society are numerous and highly active in 
this domain. 

The Power of Human Rights presents a constructivist argu- 
ment about one slice of this reality, Western-based transna- 
tional pressure on non-Western states that violate rights of 
personal integrity (summary execution, forced disappear- 
ance, torture, and so on). The authors do not address 
transnational efforts that focus on Western violations of 
human rights, such as racism in the United States or mis- 
treatment of indigenous people m Australia. The book 
argues for the growing importance of international human 
Tights norms and the transnational advocacy networks that 
operate in relation to them. The authors believe that the 
discourse about human rights generally produces beneficial 
change over time regardless of culture, economics, and other 
particularities of time and place. This is true, they say, 
because the human rights discourse, which is increasing in 
scope and saliency, affects how political actors define and 
pursue their identities and interests. 

One strength of The Power of Human Rights is that it tries 
to specify the process by which social learning occurs so that 
human rights ideas become institutionalized m state practice. 
Another strength is the’ transnational focus, since the book 
tries to integrate international and domestic political factors. 
Yet another commendable feature is the linking of social 
science theory with a list of practical policy recommendations 
for human rights actors. 

In a nutshell, The Power of Human Rights argues that 
transnational networks in behalf of human rights alert West- 
ern states to gross violations of personal integrity in non- 
Western states. This network is made up of both interna- 
tional (e.g., nongovernmental organizations) and national 
(e.g., domestic political opposition) elements. The target 
states, facing both domestic and foreign criticism, are led first 
to accept the validity of human rights norms for defensive/ 
instrumental reasons, then to engage in moral or substantive 
argument regarding the appropriate application of the 
norms, and finally to national institutionalization. Thus, 
unlike other interpretations of the evolution of human rights 
norms, such as the unacceptability of slavery, it is not 
traditional coercive power (e.g., the British navy) that pri- 
marily accounts for change toward more principled behavior. 
It is the moral power of the human rights ideas themselves 
that greatly accounts for such change. Six core chapters 
present case studies of various countries, pairs of countries, 
or regions. 

The Power of Human Rights is a sophisticated and impor- 
tant book, no doubt seminal, that deserves close scrutiny. The 
following questions are merely illustrative of many concerns 
raised by the creativity of the authors. Does the book take the 
casy case, that of rights of personal integrity, and would other 
rights, such as democratic rights or welfare rights, show a 
much less optimistic interpretation of linear movement in 
favor of national acceptance of human rights norms? Does 
social learning really occur as stipulated, since in a number of 
cases the target elites left office before desirable change 
occurred (did targeted elites learn something new, or did 
more sympathetic elites replace them)? Are traditional no- 
tions of power and national interest so as this 
account allows, particularly in the light of U.S. decisions 
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about action or inaction in places such as Rwanda; was it the 
moral power of human rights ideas that significantly affected 
key behavior, or traditional notions of self-interest and 
power, avoiding sacrifice for others, and so on? Are culture 
and economics so unimportant, as pictured, given that the 
domestic opposition to repression is very weak in precisely 
those countries that lack a cultural tradition of human rights, 
and given that the success of human rights networks, partic- 
ularly in Eastern Europe and the Soviet Union after commu- 
nist regimes there, has been discredited by economic failure? 
Is defensive nationalism usually so weak, as pictured, despite 
the fact that brutal dictators such as Pinochet in Chile (or 
Fujimori in Peru, or Milosevic in Yugoslavia) manipulated 
nationalism effectively to reject foreign criticism of human 
rights performance for a considerable time? 

Human Rights is also concerned with ideas, but it is much 
more oriented to national and regional cultures. Pollis and 
Schwab have long raised questions about whether the inter- 
national human rights discourse is valid as a universal public 
good, or whether it reflects a misguided Western cultural 
imperialism. Concerned by the use of the notion of cultural 
relativism by East Asian authoritarians to justify their abuse 
of individuals, Pollis and Scwab now revisit the subject of 
international human rights. The editors still do not fully 
endorse international human rights as they now exist in 
international law, but they inquire into whether some revi- 
sion of that corpus of norms might better accommodate some 
non-Western cultural values. They believe that the West has 
taken individualism too far as a core value, and the notion of 
community not far enough. Unlike Risse et al., they do not 
want to forge ahead in seeking protection of international 
norms; rather, they want to pause and rethink the norms 
themselves (but there is agreement on the validity of rights of 
personal integrity). 

The eight chapters by various authors in Human Rights do 
not always stick to this central theme, and the concluding 
chapter by the editors does not pull things together into a 
satisfactory and fully developed synthesis. This leaves the 
reader mainly with a very stimulating chapter by Pollis to 
start the volume. She argues that “the universalists ignore 
community” (p. 18) and forget that “national interest . . . can 
transcend and preempt individual freedoms” (p. 16). She also 
thinks that non-Western cultures have derived “spheres of 
personal autonomy” (p. 18) that ought to be acknowledged in 
a revised international bill of rights. She seeks “a synthesis of 
fundamental shared elements of... human rights doctrines 
incorporating community and individual autonomy” (p. 22). 
But she argues that, beyond common agreement on rights of 
personal integrity, “human dignity entails different behavior 
and social expectations in Islam and Christianity . . . treating 
persons with respect is culturally specific and cannot be 
generalized” (p. 23 

Although Pollis would no doubt deny it, her view seems 
very similar to that of Singapore’s ruling elites: There is 
universal agreement on a very small core of basic human 
rights norms (e.g., no genocide or political murder); most of 
the rest reflects too much individualism taken strictly from 
Western experience. Pollis, while praising international crim- 
inal courts and the arrest of Pinochet regarding violations of 
personal integrity, mentions the negative effects of economic 
globalization, particularly with respect to its effects on labor. 
She is not altogether keen on the civil right to private 
property. 

This latter point is the central concern of Globahsation. 
Woodiwiss, whose ungraceful style combines the jargon of 
both Marxist and sociological theory, explicitly states that his 
vision for a revised international code of human rights 
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approximates a new and better Singapore. What drives him is 
his belief that labor in Singapore is in much better condition 
than labor in putatively more “free” countries, such as the 
Philippines. 

Woodiwiss is a type of Gramscian or Coxian Marxist who 
combines the importance of ideas with material conditions 
and who favors a transnational analysis that stresses econom- 
ics and classes. He seems to argue for the merits of enlight- 
ened despotism: “a mode of governance where, whilst social 
relations remain distinctly hierarchical, the content of benev- 
olence is democratically decided and its delivery legally 
enforced” (p. 3). Not unlike Tito’s defense of Yugoslav 
communism, Woodiwiss accepts authoritarianism or “patri- 
archalist discourses” in the overtly political realm, but he 
looks to a more democratic labor movement, as in Japan, for 
“industrial governance” (p. 13). “Industrial justice” and 
“enforceable benevolence” are synonyms. Compared to such 
concerns, he is not much interested in participatory rights in 
the governmental realm. Like it or not, he says, most Asian 
states are too patriarchal to adopt individualized notions of 
political freedom taken from the West. He does not pay 
much attention to political freedom in Japan, South Korea, 
and Taiwan. 

It is clear Woodiwiss does not care for American hege- 
mony and its effects in places like the Philippines. He disdains 
“American ideological bankruptcy” (p. 81). The Philippines 
symbolizes American-inspired “mendicant patriarchalism” 
(p. 87). But if one believes that “capitalism as a form of 
domestic, international and transnational economic organi- 
zation is intrinsically subversive of respect for human rights” 
(p. 247), then one can hardly expect much good to come out 
of a West led by the United States. He explicitly attacks such 
scholars as Jack Donnelly and Rhoda Howard, who find the 
current International Bill of Rights to be an appropriate 
means to achieve human dignity. 

Woodiwiss, like Pollis, manifests valid concerns about the 
fate of labor under global capitalism, but he is unpersuasive. 
Not only does he incorrectly state that the highly subjective 
index of human rights performance developed by Charles 
Humana is “the most widely used of human rights indices” (p. 
4), but also, finding little merit in the American model and its 
variants that stress individual liberty, he nevertheless says 
that his analysis is, after all, a “thought experiment” (p. 17). 

ing he writes, he says, is a projection that “might” or 
possibly could lead to a synthesis in the discourses between 
Asian patriarchy and Western-style democratic-capitalism. 
He wants to levy biting and sweeping criticism but then say it 
is all just an academic exercise—a matter of thinking out 
loud. It seems he wants to have his critical cake but also 
partake of the personal freedom it brings. Even in his 
account, one scenario for East Asia “would mean the end of 
patriarchalism but by its own hand” (p. 263). Genuine reform 
in favor of broad political and civil rights is neither ruled out 
nor, more significantly, condemned as a horrible develop- 
ment. Cultures, he says, after all do change. 

Among these books, The Power of Human Rights has the 
most complete and thought-provoking theory of political 
behavior in relation to human rights. It is a prime example of 
why constructivist analysis, with its emphasis on ideas and 
witersubjective understanding, is so much discussed these 
days by students of international relations. The other two 
books, and related scholarship by such as Abdullahi A. 
An-Na’im, try to rethink many of the human rights norms 
themselves in the light of cultural differences in the world. 
Human Rights remains underdeveloped, however, as an ar- 
gument for synthesis between the cultural norms of the West 
and the rest. Globalisation, for all the erudition of the author, 
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is filled with both too much obscurantist jargon and too many 
debatable propositions to be broadly persuasive. 


Managing Ethnic Conflict in Africa: Pressures and Incen- 
tives for Cooperation, By Donald Rothchild. Washington, 
DC: Brookings Institution Press, 1997. 343p. $49.95 cloth, 
$19.95 paper. 

Harvey Glickman, Haverford College 

Casual and even seasoned observers of reported events in 

Africa can be excused if they believe that all conflict there 

comes down to ethnic differences. The Rwanda genocide 

threatens to color the interpretation of politics in all African 
states. When race and religion are added to potentially 
combustible, socially constructed identities, a basic question 
becomes how to explain nonviolent and tractable conflict 
over the years. Recent investigators have noted that the 
influence of state-centered elites in a number of African 
countries is waning and that some states are failing (William 
Zartman) or fragmenting (William Reno). This study lays out 
the analytical basis of a balanced assessment of the elements 
of integration as well as disintegration. It shows how external 
mediators can contribute to reducing internal conflict and to 
peace-building. As such, it will be cited in current debates on 
prospects for internal security and democratic politics in 

African states and in the literature on political conflict 

management, 

The author’ has been a central figure for many years in 
research and analysis of African politics, African interna- 
tional relations, American policy toward Africa, and most 
particularly the politics of ethnocultural conflict inAfrica. In 
previous works he has shown that ethnically based conflicts 
have been and can be managed by African states, that politics 
in this realm can assume a regularized pattern and become a 
sustainable, pluralistic form of state-society relations. He 
calls this “Pattern 1,” which focuses “on -direct bargaining 
relationships between state and societal elites” (p. 9). (“Pat- 
tern 2” involves suprastate bargaining, in which external 
mediation intervenes.) 

Regarding domestic conflict, Rothchild summarizes and 
schematizes previous work. He devotes the bulk of this study 
to the breakdown of domestic patterns that require greater 
investments in internal bargaining, which turns into an in- 
crease in negotiations, and then often into international 
intervention. Central to this approach is a view of institutions 
as structures of incentives to bargaining; much of ethnic 
politics can be seen as bargaining strategies within regimes of 

patterns of restrictions and resources. There is much 


tions, state softness, perceptions, regime strategies, and pol- 
icy outcomes. 

The conclusions offered are modest and restrained. Within 
the regularized patterns of intrastate relations, different types 
of regimes can produce conflict management systems of 
varying effectiveness, largely via patterns of ethnic bargain- 
ing. Despite the longer term effectiveness of constitutional- 
ism and institutional compromises, hegemonic impositions 
retain rationality in the short and medium term, but they 
invite outside mediation of seemingly intractable disputes. 
The conditions under which regime forms and resource 
allocation compromises hold up vary and even decay over 
time. Reciprocity in interest exchanges is altered as the 
environment changes. Rothchild is clear that effective conflict 
Management does not necessarily help economic develop- 
ment or capacity-building. We learn what can happen, what 
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has happened, and what has the potential for some durability. 
When internal management fails, external mediation may 
help. 


In external mediator situations, several coercive and non-` 


coercive incentive strategies as well as combinations of these 
apply, but with mixed results. Results are gauged both in 
terms of the effectiveness measured in this study and in terms 
of anecdotal evidence of the management of conflict in 
African states, since the research on outsiders’ ability to 
reduce disorder remains modest but important. A unique 
contribution is the author yoking of the techniques of 
conflict resolution to its stages and to its strategic interac- 
tions. 

The analytical thrust is reflected in the attention paid to 
internal versus international situations: There are 88 pages 
on domestic politics versus 190 pages on external mediation. 
The bulk of the discussion is about third-party mediation in 
four cases of violent disputes—Angola, Rhodesia, South 
Africa, and Sudan. Intervention was required because inter- 
nal elite-society relations broke down. The four cases vary in 
scope. For example, in South Africa the aim was regime 
transformation to avoid civil war; in Sudan the aim was to 
repair a breakdown in coalition efforts during a brief period 
in the 1970s, when that country’s civil war seemed resolvable. 

Each case is reviewed and reanalyzed with the benefit of 
the author’s original interviews and documentation. Experts 
in diplomacy will be fascinated. Others may wonder whether 
contexts and time periods reduce the analytical sharpness of 
case comparability. In Angola, for instance, the sea change in 
the relationship of the superpowers would seem to over- 
whelm and restructure the intricate strategic maneuvers to 
end civil war and extricate foreign armed forces in the 1990s, 
which Rothchild recounts in some detail. In Rhodesia, Roth- 
child does emphasize the special status of Britain and that 
country’s interpretation of exercising its responsibilities: “De- 
termmed mediators need not passively wait for a ripe mo- 
ment to emerge” (p. 148). One might question (as intimated 
by the author) whether Sudan is an example of success, since 
the mediation lasted ten years but is only an incident in 
almost a half-century of civil war. In South Africa, amid the 
enormous influence of external events, third-party mediation 
seems largely to illustrate the importance of legitimacy 
incentives in aiding internal compromise, which tends to be 
overlooked in the literature on interethnic negotiations—a 
most suggestive contribution by Rothchild. 

The study is in the tradition of applied political science or 
generalized public policy, and it reflects their limits as well as 
possibilities. First, the author informally assumes the utility of 
rational choice theory; he frames his discussion of bargaining 
among elites as altering preferences, mixing incentives, and 
leading toward cooperative bargaining behavior. The propo- 
sitions advanced are suggestive but may be more case sensi- 
tive rather than general. When they are general they are 
usually conditional. Negotiated solutions depend on context 
and leverage as well as the mediational will and skill of 
negotiators. Second, the management of conflict moves in 
stages, in which roughly similar patterns of activity can be 
charted. Third, pressures and incentives by third parties to 
advance “ripeness” for conflict resolution can help, but the 
applicability of the “mutually hurting stalemate” and the 
“ripeness” tests do not seem appreciably advanced. Some 
mediators can force the pace and the direction, and some- 
times ripeness turns to rotten, as agreements fall to pieces. 
All this is illustrated here. 

Rothchild accepts the reality of ethnopolitical conflict, but 
his separation of internal from international processes, while 
evocative, may create some analytical ambiguities. The struc- 
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turing of elite bargains internally usually reflects an interna- 
tional constraint and influence in African states, due to their 
dependency, or the Cold War, or structural adjustment 
programs. The combinations of incentives and decision rules 
within the several regime types can be shaped by extraversion 
and state instability that may be more invariant than noted. 
The author makes the distribution of group benefits central 
to comprehending the management of conflict, but the ethnic 
character of the groups gets melded into class and intrastate 
regional conflicts in the course of the discussion. 

Given “frail institutions, heady expectations and grinding 
economic scarcity” (p. 20), it is remarkable that a number of 
African states have managed to achieve some semblance of 
stable reciprocities. Rothchild lays out how some regimes can 
structure incentives, largely through resource allocation and 
political coalitions. Several examples of constitutional ar- 
Tangements and institutional compromises during certain 
periods are noted, but it is clear that the logic of hegemonic 
and elite power sharing regimes ıs less durable than poly- 
archal regimes. But polyarchal regimes are an elusive cate- 
gory in African politics—South Africa, Botswana, perhaps 
Senegal—and the in-built instabilities of all other regime 
types in Africa invite external mediators when the costs of 
intergroup violence grow excessive. The experiences de- 
scribed and systematized in this volume will be useful to 
observers and practitioners alike. 


Democracy and International Relations: Critical Theories/ 
Problematic Practices. Edited by Hazel Smith. New York 
St. Martin’s, 2000. 278p. $69.95. 

Critical Theory and World Politics. Edited by Richard Wyn 
Jones. Boulder, CO: Lynne Rienner, 2001. 259p. $53.00. 


David G. Becker, Dartmouth College 


The end of the Cold War and a proliferation of nonstate 
actors have transformed the structure of the international 
system. Many of the new players—multinational corporations 
(MNCs) are the most obvious but far from the only exam- 
ples—are pursuing interests that have been directly defined 
in a transnational plane and cannot be reduced to the 
interests of states or national societies. The human collectiv- 
ities that stand behind transnational nonstate institutions 
thus have to be conceptualized as international elites or 
classes, even though elements of them are also present within 
national societies. Moreover, these groups exercise real 
power in international affairs and do so independently of 
states; we know this because the actions of institutions they 
control, as with MNC decisions about investment and pro- 
duction siting, affect states’ interests and constrain their 
actions. 

We are arguably witnessing the early development of 
something new: an increasingly social international order that 
exhibits forms of political action heretofore confined to, and 
studied only in the context of, national societies. For those 
who understand globalization in such terms, the ahistorical, 
State-centric, unitary-actor models of international relations 
(IR) that constitute the discipline’s mainstream are ill suited 
to comprehend the nature and evolution of the new structure 
of relations. No better suited, however, are “progressive” 
theories that persist in viewing political relations in terms of 
political geography, as do all theories of imperialism, “de- 
pendency,” and statist development. 

Theories of politics that focus on relations of domination- 
subordination, hence power (especially power whose work- 
ings have been concealed ideologically and made to look like 
consent), and that find in the dynamics of these relations the 
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forces that drive systemic evolution or change, may prove 
useful for comprehending an emerging international order 
whose power relations, although social in origin and form, 
are transcultural and also have not yet been institutionalized 
or converted into legitimate authority. But it should be 
evident by now that no alternative explanatory framework 
stands a chance of supplanting current mainstream thought, 
at least in U.S. academia, if it entails not merely accepting the 
idea of human equality as a valued political norm but signing 
on to a specific ideological program of radical system trans- 
formation, or abandoning the belief that claims about polit- 
ical phenomena can be tested empirically by objective meth- 
ods whose results are independent of the investigator. 
Critical Theory and World Politics is, for IR scholars, the 
more interesting of these two works. It has value both for 
researchers and as a text for introducing graduate students to 
alternative theories, not least because its chapters form a 
fairly coherent whole and the contributors actually take up 
the concepts and issues set forth by Wyn Jones in his 
introductory chapter. Three principal strands of critical the- 
ory are compared and contrasted: derivatives of Habermas’s 
theory of communicative action, which originates in Frank- 
furt School Marxism but which has long since transcended 
Marxian orthodoxy’s insistence on the explanatory primacy of 
economic relations; the Gramscian Marxist theory of capital- 
ist ideological hegemony; and Foucauldian or postmodernist 
theory (which finds critics but no exponents in this book). 
All but two contributors to the Wyn Jones volume debate 
the ontological, epistemological, and normative (but not, 
save for a few signal exceptions, the practical) implications of 
the three approaches, sometimes relating them to construc- 
tivism as well. There is mention of feminist theory as a fourth 
approach, but the authors are hard pressed to find a uniquely 
feminist contribution to IR theory, besides its irrefutable 
insistence that domination-subordination along gender lines 
is distinct from other forms and requires separate analysis. 
(Unless one thinks that military deployments for such mor- 
ally sound purposes as peacekeeping are thereby exempted 
from negative side effects, such as the tendency of soldiers 
posted to strange countries to frequent prostitutes and 
spread HIV infections, one will learn little of [R-theoretical 
significance from the sole feminist chapter in the book.). 
On the whole, Andrew Linklater and the other Haberma- 
sians (the majority in Crittcal Theory) provide a broader, 
more nuanced, and, consequently, more realistic interpreta- 
tion of the social forces at work in the international sphere 
than does Robert Cox, the book’s chief advocate for Gram- 
scian Marxist theory. Cox’s Marxism (a term he uses rarely 
nowadays) is quite orthodox and regards international poli- 
tics as epiphenomena of globalizing modes and relations of 
production; the Habermasians argue, correctly in my opinion, 
that economism of any kind is incapable of coming to grips 
with the power-charged conflicts over identities, culturea, and 
moral-ethical standards that bedevil international—that is, 
intersocietal—relations in our time. In this regard I applaud 
Jeffrey Harrod for reclaiming the critical dimension of clas- 
sical “power realism” in the writings of E. H. Carr and, 
, Hans Morgenthau and Georg Schwartzenberger 
(pp. 111-25); he compels us to recognize that Waltzian 
neorealism is less a formalization of classical realism, as 
conventionally assumed, than a negation of it. As for post- 
modernism, it receives respectful attention, but the contrib- 
utors find that its insistence on the oppressive aspects of all 
social relations, along with its extreme moral relativism and 
tendency to conflate transgressions of established order 
(irrespective of origins, purposes, and strategies) with resis- 
tance to it, is unhelpful for understanding the nature of IR 
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and of little assistance in steering the international system in 
normatively preferable directions. 

Most of the contributors to Critical Theory deny the 
possibility of a science of politics and reduce all political 
theorizing to a question of whose interests are served, “whose 
side are you on?” Before abandoning the book in despair, 
however, positivistically inclined thinkers should read the two 
last chapters—the responses of Chris Brown and Alexander 
Wendt. Brown replies in Habermasian terms: For him, the 
only social truths in which we can have confidence are truths 
“establish[ed] by consensus, but a consensus achieved via 
dialogue under circumstances in which considerations irrel- 
evant to truth [e.g, power, populist as well as elitist] are 
excluded, in which the force of the better argument can come 
through” [p. 198]; the testing of explanatory claims against 
empirical evidence certainly has a key part to play in deter- 
mining which argument are better. 

Wendt answers complaints that his version of constructiv- 
ism is intrinsically conservative because it competes with 
mainstream thought on the latter’s own positivistic epistemo- 
logical ground and encourages research that can inform 
liberal-democratic foreign policymaking (a Bad Thing insofar 
as it relieves some of the international system’s stresses, thus 
allowing its fundamentally unjust relations to persist that 
much longer). The second complaint translates into IR 
language the Left’s eternal dilemma, “reform or revolution”; 
Wendt’s position is that, given the opacity of the future, all 
whe want to improve the human condition must try to do so 
in the here and now, to whatever extent and in whatever ways 
are presently feasible. As regards positivism, he stands solidly 
on pragmatic ground: If the study of politics is to guide 
practice of any sort, our explanatory frameworks must be 
tested against whatever is “out there” (in the sense of our 
being unable to control it simply and directly), and the tests 
must be “scientific” (objective, potentially replicable by any- 
one) if we are to have any confidence in our frameworks’ 
ability to serve their intended purposes. 

Democracy and International Relations, too, compares and 
contrasts. existing currents of critical thought—in this case, 
with respect to the seemingly paradoxical international pro- 
motion of democracy by the same mstitutions and social 
forces whose program of global economic liberalism appears 
to have accentuated inequalities—without breaking new the- 
oretical ground. The book's problematic, according to Smith, 
involves understanding the nature of the democratization 
being pursued, the reasons it is being pursued, and “What 
does it mean for both the theory and practice of international 
relations” [p. xii]? The first of these issues is addressed by the 
already burgeoning literature in comparative politics on 

democratization. The second is explained by Richard L. 
Sklar’s noneconomistic, power-centered conception of bour- 
geois political action in our time, postimperialism (to which I 
immodestly admit to having contributed), a conception too 
often overlooked by other writers. The third issue is not 
seriously theorized at all except for a tentative nod by Smith 
in the direction of “democratic peace” theory (p. 2) and, 
more to the point, helpful efforts by Vivienne Jabri (pp. 
51~70) and Stephanie Lawson (pp. 71-89) to work through 
the contradiction between moral universalism, which under- 
pins any hope of institutionalizing truly democratic gover- 
nance worldwide, and cultural relativism, which deeply 
clouds any such project. Kimberly Hutchings shows how 
international democratization is complicated by the tradi- 
tionally. close association of democracy with the sovereign 
state, which, Janus-like, serves as both “the legal articulation 
of universal principles of right” with respect to the territorial 
population it governs and as the public face of “a particular 


774 


people or nation” to the rest of the world (p. 41). She also 
cats through the interminable debate over the relative merits 
of various forms of democratic governance by reminding us, 
citing Chantal Mouffe, that democracy is an essentially 
contested concept (p. 48). 

The other six chapters in Democracy are empirical case 
studies in an international context. Their unity, however, is 
no more than thematic, and they seldom acknowledge the 
editor’s framework. They include a nice discussion of Russia 
and “democratic peace”; two reasonably subtle analyses of 
the internal tensions involved in efforts by international 
donor agencies in E) Salvador, Guatemala, and Zimbabwe 
simultaneously to promote economic liberalization (“mar- 
ketization”) and strengthen civil society, which is a welcome 
change from the unidimensionality and demonization too 
often encountered in critical writings on this topic; a solid 
critique of Jack of democracy in the PLO; and a rather brief 
and superficial treatment of Hong Kong’s problems of pre- 
serving democracy after absorption by the People’s Republic 
of China. Although the whole of the book is leas than the sum 
of the parts, in the sense that theorizing about the interna- 
tional aspect of democratization has hardly been advanced at 
all, the individual chapters do have value for specialists in 
their particular areas. 

On balance, both books deserve to be read by scholars who 
seek a better understanding of the sociopolitical processes 
involved in globalization and their significance for the evolu- 
tion of the international system. Their value, however, lies 
more in the stimulus to thought they. provide and the issues 
they raise than in the answers they supply. I cannot help 
observing that of the twenty-six authors represented in these 
two books, just four are based in U.S. universities. Might not 
the scientific skills for which the U.S. discipline is justly 
renowned provide a worthwhile balance to some of the 
frankly populistic and antirationalist thinking that, although 
largely absent from these books, is present in so many 
discussions of the globalization phenomenon? 


State Institutions, Private Incentives, Global Capital. By 
Andrew C. Sobel. Ann Arbor: University of Michigan 
Press, 1999, 304p. $49.50. 


William Roberts Clark, New York University 


Many a tree, not to mention a few Starbucks’ windows, has 
fallen victim to public debate regarding “globalization.” 
Despite the volume of written output, however, we know very 
little about this phenomenon, which, like “capitalism,” seems 
to be-transformed into whatever kind of creature its beholder 
desires. The lack of progress can be attributed to three 
related tendencies in popular discourse, and the scholarly 
literature tly does not avoid them. First, there is little 
agreement about what globalization is. Second, discussions 
rely more on normative assertions than positive analysis. 
Third, even when positive analysis occurs, it often lacks 
rigorous logic or systematic evidence. The first and the 
second are related in that the term has come to mean 
everything about the world that will lead to our salvation or, 
alternatively, Tuination. The second and third tendencies are 
related in that normative assertions, in the absence of logic 
and evidence, can be embraced forever and eventually come 
to be treated as descriptive analysis. 

State Institutions is Andrew Sobel’s second book and, like 
his first, is an important exception to the trends mentioned 
above. First, Sobel says something clear about just one aspect 
of globalization, the access to global capital. Second, he 
focuses more on how global capital markets work than on an 
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evaluation of their fairness or efficiency. Third, he supplies 
considerable evidence relevant to his argument and uses 
sophisticated statistical tools to analyze this evidence in a 
careful manner. 

The overarching theme is that the growth of global capital 
markets was neither inexorable nor monolithic. As does 
Erick Helleiner (States and the Reemergence of Global Fi- 
nance, 1994), Sobel argues that the reemergence of global 
capital markets was the result of state decisions as much as it 
was the result of technological change and market competi- 
tion. He pushes this line of reasoning one step farther by 
arguing that participation in global markets varies consider- 
ably across states and cannot be fully explained by economic 
factors. Aspects of the “regulatory” and “participatory” states 
in borrower nations also influence the degree of access to 
global capital markets. 

Sobel begins with a detailed discussion of the advantages 
and disadvantages of alternative financial instruments from 
the viewpoint of both lenders and borrowers, including a 
fascinating discussion of the multiple strategic interactions 
involved in bringing initial public offerings of disintermedi- 
ated instruments to market. He argues that because these 
interactions are complex, lending decisions occur under 
conditions of mcomplete and asymmetric information. First, 
there is uncertainty about future states of the world that may 
influence the ability of borrowers to repay loans (which Sobel 
terms “risk”). Second, there is uncertainty about attributes of 
borrowers that may affect their ability to fulfill their financial 
obligations (which Sobel refers to as “uncertainty”). These 
two factors interact to influence lending decisions. When 
uncertainty is high, lenders are unable to distinguish good 
risks from bad, and market failure can occur. Governments, 
according to Sobel, can help borrowers gain access to inter- 
national capital by constructing institutions and implement- 
ing policies that reduce uncertainty. 

Sobel takes three passes at the empirical evidence to 
evaluate his claim that policy decisions and institutions make 
a difference. First, he demonstrates that access to interna- 
tional capital and the type available depend on where the 
borrower is located. Participation in global capital markets in 
general and disintermediated markets specifically grew much 
more quickly in the last quarter-century for borrowers in 
OECD countries than in developing countries. This alone 
suggests that capital markets are not as global as some may 
think. The author argues that OECD countries received 
more than their fair share of access, whether one uses 
population or economic aers to generate expectations 
about what allotment would be fair 

Second, Sobel uses data from two financial trade publica- 
tions to develop measures of country credit risk and uncer- 
tainty. Analysis of pooled cross-sectional time-series data 
finds that these measures of economic uncertainty explain 
some, but not all, of the cross-national variance in access to 
capital among developing countries. 

Third, Sobel adds indicators of the “regulatory” and “par- 
ticipatory” state to the baseline economic risk model to 
demonstrate that political factors also have a significant effect 
on lending decisions. By regulatory state Sobel means a set of 
factors that influence the enforcement of contracts, the rule 
of law, the risk of expropriation, corruption of government, 
and bureaucratic quality. The term “participatory state” 
captures the extent to which policies and institutions repre- 
sent the wishes of the members of society. Sobel constructs 
measures to capture how both of these affect lending and 
finds that they supplement economic determinants of lending 
to developing countries. Specifically, he concludes that par- 
ticipatory and regulatory states that reduce political risk help 
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borrowers within their jurisdiction gain access to global 
capital. 

The strength of the book lies in the creative use Sobel 
makes of existing evidence and the care he takes in drawing 
inferences from the data, much of which he includes in a 
generous appendix. He argues persuasively that his reliance 
on data on economic and political risk collected by the 
finance industry is appropriate. In addition, he correctly 
identifies many attributes of the data that impede unbiased 
inference using standard linear techniques. Most important, 
he recognizes that his dependent variable is truncated be- 
cause Many countries in the sample went through years when 
no borrowing occurred. As a consequence, the effects are 
underestimated when linear models are used. 

To correct this problem, Sobel pursues a two-step proce- 
dure. First, he classifies observations into cases in which 
borrowers were and were not granted access to capital and 
then estimates a logit model to gauge the effects of economic 
and political determinants of lending. Second, he uses regres- 
sion on the subsample of lending observations to model 
differences in the amount of funds borrowed. In this latter 
model he uses LOESS regression (or robust, locally weighted 
regression) to correct for nonnormally distributed residuals 
that remained even after censoring and log transformation of 
the dependent variable. Unfortunately, the loess technique 
does not permit interpretation of individual regression coef- 
ficients, so we are left with the dubious task of inferring the 
causal effect of individual variables from their marginal effect 
on the overall fit of a series of models. 

Sobel’s concern for the structure of the data and his use of 
diagnostics is laudable, but a reconsideration of some of his 
research design decisions could prove fruitful. The author 
adopts the two-stage procedure described above after a 
rather summary rejection of the use of Tobit—the standard 
response to truncated data. In addition, although many of 
this decisions are driven by a concern about the ability to 
draw valid inferences from data that display nonstandard 
error structures, no attention is given to the threats to 
inference that typically accompany times-series cross-sec- 
tional data: autocorrelation, panel heteroskedasticity, and 
contemporaneous correlation. Had he addressed these stan- 
dard problems, the result might have been an error structure 
that would have supported valid inferences in an ordinary- 
least-squares model in the second stage of the analysis. In 
addition, given that his theoretical argument emphasizes the 
interaction between risk and uncertainty, it is surprising that 
this interaction does not receive detailed attention in the 
empirical analysis. Furthermore, by measuring access to 
financial markets in terms of the lending agreements actually 
consummated, Sobel seems to make the implicit assumption 
that the demand for credit is constant across time and space, 
and all variance in lending behavior can be explained by 
supply-side decisions. It is at least possible that some in- 
stances of nonborrowing were as much a reflection of the 
decision not to borrow as of a decision not to lend. 

Sobel’s study invites such criticism because it deals with 
threats to valid inference in a straightforward manner. There- 
fore, it is likely to encourage future research to determine 
whether the conclusions reached are robust with respect to 
the research design choices. That is one of the great strengths 
of this study. The rigorous examination of the causes and 
consequences of increased financial integration is in its 
infancy. By providing a framework for understanding differ- 
ences in borrowing experiences, State Institutions makes an 
important contribution to this research program and pushes 
it forward by demonstrating the value of careful empirical 
work. 
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Russia and Europe: Conflict or Cooperation? Edited by 
Mark Webber. New York: St Martin’s, 2000. 233p. $69.95. 


Adam N. Stulberg, Georgia Institute of Technology 
Over the past decade, scholarship on post-Soviet Russian 
foreign policy has been unable to reconcile the two faces of 
Moscow’s European diplomacy: growing anti-Western rhet- 
oric and geostrategic assertiveness, on the one hand, and 
routinization of cooperation across an expanding range of 
political, economic, and security issues, on the other. Amid 
confusion about Russia’s national identity, leadership poli- 
tics, domestic institutions, and great power status, much of 
the writing on Moscow’s European policy is descriptive and 
derives from oversimplified images of Russia’s neoimperialist 
ambitions and declining resource base, which offers little 
systematic explication of this puzzle. 

The volume edited by Mark Webber joins a paveencing 
literature aimed directly at breaking out of this analytical 
straightjacket. The authors address Russia’s hot-tempered 
but pragmatic engagement of Europe through the prism of 
the debate between realism and liberal institutionalism. The 
first suggests a conflict-prone relationship in which coopera- 
tion exists on the margins of Russo-European relations and is 
rendered ephemeral by “disparity, distance, and even es- 
trangement” (p. 1). The second view is more sanguine about 
cooperation and emphasizes Russia’s historical and institu- 
tional links to Europe despite its distinctive political, eco- 
nomic, geographic, and cultural traits. Accordingly, Russia’s 
nondemocratic past and Eurasian expanse are no less com- 
patible with European institutions and identities than are the 
idiosyncrasies of “accepted” states, such as Germany, Italy, 
and Spain. 

Webber's introductory chapter lays down the analytical 
markers for the volume and embraces the second perspec- 
tive. A fragile but enduring pattern of cooperation is attrib- 
uted to the institutionalization of a common European 
political, economic, and security order. The web of estab- 
lished rules and practices lowers the transaction costs of 
cooperation and promotes mutual Russian and 
interests, effectively narrowing the realm of discord to the 
design and purpose of different multilateral organizations. 

The ensuing chapters deftly expound on this theme with a 
set of institutional and issue-oriented case studies. Mike 
Bowler explains that Russia’s weakness at home and abroad 
sustains the commitment to regularizing constructive rela- 
tions with Europe among a Russian elite that collectively 
accepts this as the only realistic option for reclaiming the 
country’ great power status and advancing its recovery. 
Caroline Kennedy-Pipe highlights the dichotomy of Russia’s 
excoriating rhetoric yet pragmatic pursuit of a special rela- 
tionship with an enlarged NATO (outside the Former Soviet 
Union); she attributes Moscow’s restraint, in part, to the 
realization that it is more effective to work toward setting the 
agenda within the alliance structure than to protest from the 
sidelines. 

The chapter by Dov Lynch argues that the Organization 
for Security and .Co-operation in Europe augurs well for 
Russia’s inclusion and equal standing in a wider European 
security dialogue, and it provides a mechanism for Moscow to 
preempt a de facto and hierarchical division of labor on 
“hard” and “soft” security issues among evolving European 
security institutions. Jackie Gower demonstrates how various 
institutional mechanisms combine to cultivate a separate but 
equal partnership between Russia and the European Union, 
reassuring Moscow and allaying anxieties associated with 
growing tensions over trade protectionism and EU enlarge- 
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ment. Webber finds evidence that membership in the Council 
of Europe, and the attendant costs of expulsion, reinforce 
Russia’s otherwise nascent commitments to upholding Euro- 
pean democratic norms and practice. 

Derek Averre analyzes the role of successive arms control 
agreements in reducing the “military factor” in European 
security, noting that continued progress depends on the 
extent to which this new security-building process incorpo- 
Tates political and nontechnical measures that reassure a 
transitioning Russia. Michael Anderson’s study of successive 
crises in the former Yugoslavia succinctly debunks the myth 
of a pan-Slav impulse in Russian foreign policy. He explores 
how Russia’s principled concern for being taken seriously as 
an independent player in Europe, as opposed to its prefer- 
ences for specific developments on the ground, has consis- 
tently motivated both Moscow’s outrage at NATO unilater- 
alism and its eventual acceptance of core Western policies in 
the Balkans. 

All of this is spelled out concisely. The writing is SRDE 
and the chapters provide cogent analysis. In one short book, 
the authors impressively distill a broad pattern in Russian- 

interaction and trace its developments across 
multiple dimensions of the relationship. But brevity has its 
drawbacks. Each author references a correlation between 
European institutional developments and shifts in the debate 
among Russian policymakers, but together they fail to offer a 
roadmap for systematically relating the two processes. En- 
gagement with Europe persists, notwithstanding alterations 
in the balance of power between Russian moderates and 
centrists. Why, for example, did NATO’s commitment to 
expansion and resort to unilateralism subvert Russian mod- 
erates (Foreign Minister Kozyrev) but did not delegitimize 
pragmatists (Foreign Minister Primakov), whose platforms 
also were seen not to be working? The authors imply synergy 
between domestic and international institutions, but the 
failure to specify the modalities of such interaction renders 
the assessment of Russia’s ongoing compromise with Europe 
somewhat ad hoc. 

The authors take a spare view of international institutions, 
privileging functional features of multilateralism but side- 
stepping alternative social-constructivist and competitive in- 
strumentalist attributes. Although the book addresses the 
dense set of European institutions, each chapter takes as 
given state identities and offers surprisingly little analytical 
insight into how Russian preferences might be altered by 
strategic interaction within interlocking organizations, At the 
same time, however, the authors do not reflect on how 
Rugsia’s traditional benign indifference to European institu- 
tions (except NATO) can be sustained amid growing eco- 
nomic and strategic asymmetries to avert conflagration of 
mutual apprehensions. If material interests continue to stray, 
and the institutions themselves fail to reconcile Russian 
preferences for using membership to promote national inter- 
ests with European preferences for forging a common eco- 
nomic and political space, it is hard to see how the existing 
multilateral framework will be well suited to strengthen 
Russian-European relations. Absent analysis or evidence of 
ideational change, it is unclear that the efficiency of existing 
institutions can withstand temptations to exploit contradic- 
tions and “institution hop,” causing further divergence in 
Russian and European objectives. 

These reservations aside, this volume illuminates the ben- 
efits of merging international relations theory with post- 
Soviet era studies. Ultimately, however, it represents the first 
steps of a much larger intellectual enterprise. 
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struggle to keep the victims, crimes, and perpetrators among the unforgotten. I argue that this 


ae a form of remembrance: Memory occupies a vital place at the heart of justice and its 


memory-justice at once informs core judicial practices and ranges beyond them in a manner that leaves 
judicial closure incomplete. It reminds us of a duty to keep crimes and their victims from the oblivion of 
forgetting, of a duty to restore, preserve, and acknowledge the just order of the world. Yet, in the shadow of 
remembrance, other human goods can wither, goods located in the temporal registers of present and future. 
This latter lesson is important, but it is one with which we are familiar. I emphasize another, with which we 
are perhaps less at home: the intimacy of memory’s bond with justice, not as obsessional or as a syndrome, 


but as a face of justice itself. 


_ haron Appelfeld’s novel, The Iron Tracks, por- 
trays a son’s unceasing hunt for his parents’ Nazi 
murderers. The “iron tracks” are the memory of 

the crime, and they suggest both the rectitude of his 
memory work and the compulsion, the life-sapping 
absorption in the past, that drives him: “My memory is 
my downfall. It is a sealed well that doesn’t lose a 
drop. ... Were it not for my memory, my life would be 
different—better, I assume” (Appelfeld 1998, 9, 195). 
We recognize in him, as in other more ancient bearers 
of memory, such as Electra, what Nietzsche called the 
“sleeplessness” (Schlaflosigkeit) of too much bitter 
memory. Might it not be better, we ask, to let go of the 
past, to invest our energy and time in building a future 
rather than dwelling in the evils of an irreversible past? 
Perhaps the price of our future is that we allow the 
poisonous memory of the past, of its victims and 
perpetrators, to pass into oblivion. Our orientation in 
the world seems to take its bearings from the future or 
present. 

From that vantage point, justice as the duty to 
remember is archaic, irrational, even dangerous. We 
look with suspicion on those mired in the past: on those 
in the Balkans, for example, riven by ancient animosi- 
ties and squandering the potential of an open future 
for the sake of settling old accounts. We wonder at the 
waste of 3,000 lives in the Northern Irish conflict 
governed, it seems, by the memory of the Battle of the 
Boyne and the Easter 1916 Uprising.! To sacrifice the 
present and future on the altar of grievances, whether 
of the recent or distant past, appears deeply irrational 
and wrong. Our reaction to Achilles, who is driven by 
the memory of Patroklos’ death, or to Electra, who 
lives and acts in the shadow of her murdered father, are 
likely of a piece with our views of the wars in the 
former Yugoslavia or of the strife in Ulster. It is not the 
quantity of the time passed between the motivating 
event and the present response that puzzles or repels 


W. James Booth is Professor of Political Science, Vanderbilt Unt 
vermty, Station B, Box 8261, Nashville, TN 37235. 

The author thanks Nancy Bermeo, Larry Griffin, Jorgen Hermans- 
ton, Bo Rothstein, Kurt Weyland, and the American Political 
Review’s referees for comments on the arguments of this paper. 

1 “The Irish been living [sic] the last twelve hundred years on dreams 
af revenge.” Tom Wolfe, Bonfire of the Vanittes, quoted in Murphy 
1988b, 185n. 


us, although the greater the temporal distance, the 
more likely we are to be perplexed at the long duration 
of the wound’s presence in their lives. Rather, we see it 
as negative, divisive, and irrational precisely because 
these individuals and peoples have lost the use of their 
future-oriented compass. Justice as memory work 
seems in need of an apologia. 

This essay considers justice in its work of remember- 
ing, grounded in a debt to the past, and resistance to 
that work, opposition made in the name of the present 
and future. In focusing on the moral claims of remem- 
brance, I move against the weight of a modern view 
with a very long ancestry: “It is the living, and not the 
dead, that are to be accommodated” (Paine [1791/2] 
1979, 64). We who pride ourselves on our posttradi- 
tional condition, on our “radical openness” to the 
future (Habermas 1998, 197), may be especially drawn 
to this view, more so perhaps than the ancients, who 
saw both the ethical and obsessional sides of memory- 
justice. These latter warn us against an all-consuming 
absorption in the past. Yet, they tell us of something 
else, too: that the remembrance of past wrongs is not 
wholly a trifling and fruitless or destructive “[laboring] 
in past matters” (Bacon 1965) but, rather, one of the 
faces of justice itself. 

Dike, justice, is in one of its key dimensions the 
memory of evil past. The Furies, Daughters of the 
Night, who “hold the memory of evil,” labor mightily to 
ensure that the evil they remember, the memory of 
miasma or guilt-pollution, does not pass into oblivion. 
In their undying search for those tainted by guilt (the 
polluted), the Furies are the handmaidens of justice 
(Loraux 1997, 275; Ramnoux 1959, 148). They are its 
memory, ensuring that the passage of time does not 
overwhelm the work of justice (Aeschylus 1971, line 
381; Simondon 1982, 224; Sophocles 1994, line 870). 
The divine Furies and the mortal Electra are driven, 
which costs them (and others) dearly, but still their 
imperatives to remember and requite evil, not to let go, 
belong centrally to justice: “The triad of Zeus, the 
Erinyes, and Justice maintain the moral order of the 
world” (Simondon 1982, 223-4, 227). Whether divini- 
ties or mortals, their actions are driven by justice, by its 
remembering eye cast back into the past, and they do 
not abandon its work until retribution is made. This is 
a valuable reminder, as is also the counsel that forget- 
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ting is an antidote to the dangers of too much memory, 
and I shall return to both. 

Although this essay is about justice and memory, I 
use transitions to democracy as the principal illustra- 
tive setting because there the weight of the claims of 
the past, and their clash with those of the present and 
future, are most visible. I argue for the centrality of 
Memory to justice (illustrated in such transitions) 
through a discussion of its varied memory practices. 
Here, in a contemporary political transition, is an 
example of memory-justice at work in three registers: 
both institutionalized and socially diffuse remembrance 
and the o claims of a restorative forgetting. 
After decades of conflict in Northern Ireland, a some- 
times faltering transition is under way that is designed 
to introduce a fuller democratic life and bring an end to 
civil strife. As part of that transition, Prime Minister 
Blair created the Saville Inquiry to probe the circum- 
stances surrounding the killing of 13 unarmed Catholic 
civilians by British soldiers in Derry on January 30, 
1972 (Bloody Sunday). Blair reaffirmed the victims’ 
innocence and spoke of the importance of remember- 
ing the dead. He also set the inquiry in the context of 
his government's peace initiative: “I believe that it is in 
everyone’s interests that the truth [about Bloody Sun- 
day] be established and told. That is also the way 
forward to the necessary reconciliation that will be such 
an important part of building a secure future for the 
people of Northern Ireland” (Blair 1998a). Justice as 
the institutionalized remembrance of the past is seen 
here, as in other truth commissions, as a duty to the 
dead and as a condition of reconciliation. 

At the sites of the Bloody Sunday killings in Derry, 
the nationalist community mounted wall-sized photo- 
graphic murals of some the 13 civilians in the moment 
of their death (Melaugh 1997a, 1997b). This also was 
an act of memory-justice, but not of the institutional- 
ized kind. Rather, the community paid a debt of 
remembrance to its own dead and called to mind the 
injustice inflicted on both the individuals and their 
community. Such memory is unlikely to be fully as- 
suaged by the results of the legal proceedings. Justice’s 
memory in this case keeps the bitter well open, and for 
a very long time indeed. In a Catholic neighborhood in 
Belfast, there is another wall mural. Titled an Gorta 
Mor (The Great Famine), it depicts bodies being carted 
from a field and cites a line of poetry by Seamus 
Heaney: “They buried us without shroud nor coffin” 
(Rolston 1997).? That also is a memory in the service of 
justice, a giving of remembrance and dignity to those 
who were not granted it in their time, and is part of the 
community's long memory and identity, linked no 
doubt to the fresher wounds depicted on the Bloody 
Sunday murals. 

But justice’s memories (institutionalized or informal; 
of long or brief duration) also can be seen as a 


2 The Hne should read “without shroud or coffin” and is from 
Heaney’s “Requiem for the Croppies” (Heansy 1980, 54). The dead 
in the poem are not victims of the Great Famine but rebels from the 
1798 Irish Uprising. I am grateful to Roy K. Gottfried for his 
assistance In identifying this passage and placing it in its historical 
' context. 
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destructive wallowing in the past and so be challenged 
by the call to forget for the sake of peace. Winston 
Churchill wrote of Ulster: “But as the deluge subsides 
and the waters fall short we see the dreary steeples of 
Fermanagh and Tyrone emerging once again. The 
integrity of their quarrel is one of the few institutions 
that has been unaltered in the cataclysm [World War I] 
which has swept the world” (quoted in Mandelson 
2000). Against an absorption in the injustices of the 
past, Tony Blair (1998b) urged the parties “to put our 
histories behind us, try to forgive and forget,” and he 
appealed for “a Northern Ireland free from... the 
battles of the past. Offering the children here the future 
they deserve” (Blair 1999). 

These three registers are not simply separate paths 
of memory and resistance to it. Rather, they often 
coexist and conflict, especially in times of regime 
change. - Judicially institutionalized memory-justice 
seeks to address and sometimes limit the effects of the 
informal, bitter memories of injustice; the latter press 
upon the institutions of justice demands that often are 
unanswerable; and justice as forgetting attempts to still 
the voice of memory in the name of the future. 

In the last section I will return to the informal and 
diffuse presence of memory-justice. For now I want to 
mention three judicial or quasijudicial faces of memo- 

justice as it deals with the past: trial and punish- 
ment (criminal charges); illumination and acknowledg- 
ment (truth commissions); and forgetting for the sake 
of a future in common (amnesty). First, criminal 
charges may be brought against the perpetrators or 
some subset of them. Examples are Nuremberg, the 
Eichmann trial, and the proceedings against the Greek 
junta leaders, those responsible for the Argentinean 
“dirty war,” and the erstwhile head of the East German 
Stasi, Erich Mielke. Second, in truth commissions the 
strategy is not prosécution or punishment but disclo- 
sure and, perhaps more important, acknowledgment of 
the evils committed and of their victims (Cohen 1995, 
18). Ilustrations are the Truth and Reconciliation 
Commission (TRC) in South Africa, the various official 
and unofficial Nunca Más (Never Again) efforts in 
Latin America (Chile, Argentina, Brazil), the Saville 
Inquiry, and the Study Commission for the Assessment 
of the History and Consequences of the SED Dictator- 
ship in Germany (Hayner 1995). Third, amnesty is a 
form of political-judicial forgetting that puts the past 
out of sight. The past is here moved beyond the reach 
of justice and into the shadows of civic forgetting.> The 
objective of such amnesties is almost always civil peace, 
born of the need to protect a young and vulnerable 
democracy from being torn apart by an absorption in 
the past and an attendant spirit of revenge. Democracy 
and its future must, in this view, take precedence over 
the past and its demands that justice be done. Urnguay 
is perhaps the most recent example, although amnes- 


3 Amnesties and similar arrangements are sometimes used by truth 
commissions to induce witnesses to offer testimony. The disputes and 
resentment that these amnesties generate suggest the power of the 
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ties were employed in much of postwar Europe (De 
Brito 1997, 125-51). 


RETRIBUTION 


In the panoply of human passions and interests that 
inform momentous and traumatic transitions, there is 
likely no single motivation to which we can ascribe 
retributive judicial proceedings. Undoubtedly, revenge 
is often a factor. Another is the need to delegittmize or 
Toot out the former nondemocratic elites so as to 
lessen the prospect of a counterrevolution and the 
related need to reintegrate the victims and enhance the 
credibility of the new regime by underscoring its break 
with the past.* I will consider another claim: An ethics 
of remembrance informs the calls for retributive justice 
that mark the efforts of the past century to craft a 
Tesponse to massive state-led human rights abuses and 
genocides. Retroactive justice, Carlos Nino (1996, x, 
19, 21, 33ff} argues, is essential to democratic transi- 
tions, whether in Argentina (where he was a leading 
figure in efforts to bring to justice those responsible for 
the “dirty war”), in Greece after the fall of the junta, or 
in the 1991 Czech lustration law. We could say that a 
trial is an assertion of the power of memory-justice. 
Memory seizes the crime, keeps it among the unfor- 
gotten, and insists on retribution. No other good or end 
to be achieved is invoked to justify this process. By the 
same token, countervailing considerations, such as the 
unhappy effects of the relentless pursuit of justice, are 
not entertained. The language is not that of healing, or 
of sustaining the reemergence of democracy, or of 
identity but, rather, of the imperative to do what justice 
demands (Minow 1998, 63; Weschler 1990, 244). 

Perhaps this is what Minow (1998, 25) means when 
she writes that to respond to mass atrocity with legal 
prosecution is to embrace the rule of law. To embrace 
the rule of law is to put oneself in opposition to the 
lawlessness and violence of dictatorship, but it does 
something more. The great fear for memory-justice is 
that the crime will be allowed to slip into oblivion, into 
the forgotten; that the passage of time will, like a 
natural solvent or a willed forgetting, free the perpe- 
trators and weaken the already weak hold of justice in 
the world. A trial is one forum of resistance to this; it 
is a venue for seeking the victory of the memory of 
justice over the will to forget, for seeking, in a sense, 
the rule of law. Justice and memory resist the passage 
of time and deny to it any power of moral/legal 
absolution (Améry 1977, 115-6; Jankélévitch 1967, 53; 
1986, 26). 

Faced with the power of the process of becoming 
and with a concern for the future, memory-justice, as 
the voice that insists on keeping the past present, must 
seek to prevent the effacement of the memory of the 
crime (Jankélévitch 1986, 39). Justice thus becomes the 
memory of evil, and it fights a desperate battle against 


* Archbishop Desmond Tutu (1998), in his Foreword to the TRCs 
final report, makes clear the multiple concerns at work in such 
proceedings, including amnesty so as to encourage truth-telling and 
opposition to lustration of implicated persons. 
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the oblivion that always threatens to engulf it, that 
gives sanctuary to the perpetrators and a victory to 
injustice (Jankélévitch 1967, 23, 25, 29-30; 1986, 39). 
What at first sight appears as a straightforward conflict 
between the vengeful Furies and the demands of lawful 
justice becomes, in the light of memory, more nuanced. 
The Furies, for all the real tension between their 
pursuit of those polluted by guilt and the requirements 
of a law-governed civic order, at least in part are doing 
the work of justice. Their ferocity is directed equally 
against the criminals and the forgetting that shields 
them from retribution, and their refusal to let go of the 
past expresses a core demand of the rule of law 
understood as justice’s memory work. Their work may 
be (to borrow Francis Bacon’s term) “Wilde,” but in its 
determined insistence on the memory of crime, it is 
justice, too. 

Retribution, then, is first and foremost restorative 
not of the status quo ante of the victims but of a just 
condition as such. The Furies, Michéle Simondon 
(1982, 223, 227) writes, belong to a universe ruled by 
justice, and in awakening us, in calling us back from our 
forgetfulness, they do the work of justice and restore or 
bring to light again its presence in the world. Yet, 
retribution answers not only to the cali to see justice 
embraced but also to the appeal of the victims. “Those 
who have been hurt need a response” (Williams 1993, 
70). Justice through trials in the aftermath of dictator- 
ship and human rights abuses is one such response. But 
that type of response raises difficult issues. It is in the 
nature of certain crimes, and especially those involving 
state-led terror, that the victims are no longer alive to 
hear the response that justice offers them for their 
sufferings. Under these circumstances, and allowing for 
the passage of often considerable time, it is a thorny 
issue to determine who the recipient of the response 
ought to be and how one gauges the appropriate 
magnitude of compensation, given the radically differ- 
ent conditions that prevailed at the time of the crime. 
And what can be given the victims in compensation for 
their loss? Human practice can alter the present and 
future but is unable to reach into the past. At best, we 
can ide only ersatz substitutes for what has been 
lost (Ackerman 1992, 3, 89/f Elster 1998, 23-7; Schlink 
1998, 433-4; Waldron 1992, 12%). 

How to calibrate compensation is one issue, but 
there are still deeper problems. The imperative to 
answer the victims’ need for a response is often under- 
stood as a duty to the dead. Memory- -justice’s tenacious 
clinging to the memory of the crime and its victims is 
something owed to them, rather than to the living. The 
face of justice is here at least turned entirely toward the 
past, and the rightness of its present actions is under- 
stood squarely in light of a debt to the dead.5 We find 


3 This is the imperative to give the wronged (the victims of the 
Holocaust, the disappeared, the victims of the apartheid regime) 
their full measure of justice. A related but inflected view 

of judicial response to the past is that the core debt is not (m the first 
instance) to the dead but to justice itself. The dead and surviving 
victims are no longer the centerpiece, and what takes their place is an 
almost Kantian concern for a just law-governed condition as such. 
Respect for justice (and morality) demands that past crimes be 
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striking the idea of indebtedness to those with whom 
we have neither contract nor bargain nor even any 
tangible community except perhaps the communion of 
remembrance itself. Is the idea of a debt grounded in 
the imperatives of fidelity so very odd? Perhaps So- 
phocles’ understanding that Orestes and Electra owe it 
to their dead father, Agamemnon, to see that his 
murderers do not escape punishment captures a di- 
mension of memory-justice that is still at work among 
us. 

Remarking on the various efforts to deal with the 
past in the course of South Africa’s transition to 
majority rule, Arieh Neier (Neier, Zalaquette, and 
Michnik 1997, 3) states that the motivation was to 
recognize the dignity of the victims. To do otherwise, 
he concluded, would be to acknowledge that only the 
future has value. Related sentiments of the morality of 
memory lie behind our practice of public and private 
commemoration, monument building, and so forth. 
What is striking in all these practices is the idea of a 
debt to those who are not present, that they are entitled 
to hear the response of justice to their fate or to receive 
compensation, and of the weight accorded to this past. 
That is surely among the lessons of the Furies, Electra, 
and Achilles: Their absorption in their debts to the 
dead costs them (and others) mightily, but those costs 
and the obliviousness to the future that helps bring 
them about are (at least partially) justified and seen as 
good in light of the commitment to fidelity, to keeping 
faith with the victims (Jankélévitch 1967, 73; 1986, 60). 
We may be tempted to see in the logic of the Furies an 
obsessional immersion in the past, or an illustration of 
the need for a law-governed political order to vanquish 
their “Wilde” and private vengeance (Shklar 1990, 
94—5).6 I look at another of their faces: as memory- 
justice giving voice to the past and its victims and 
insisting on the restoration of justice in the world 
against the oblivion of forgetting.” 

The work of trials as acts of memory-justice in the 
wake of massive state crimes and during the transition 
to democracy can be framed in language other than 
that of the restoration of justice. Preventative and 
didactic purposes are examples of other justificatory 





answered. In the former view, it is memory of the victims and the 
crimes agamst them that is central to the project of justice; in the 
latter view, it is justice itself that is recalled and set against the crimes 
of the past. 

6 The Furies are the “agents of pollution who embody the anger of 
one alam by a Kusman” (Parker 1983, 107). Their conflict with the 
city in Aeschylus’ Eurmerudes can be seen as a pubhe/private justice 
tension, much as the ban on dueling was an attempt by the modern 
state to establish its monopoly on legitimate violence. I emphasize 
another facet of their behavior: Not so much ther private character 
needs to be tamed as their destructive smgle-mindedness in light of 
the multiple goods sought by human beings. So tamed, the transition 
to (legal) justice in the city is seamless. 

7 This is not to deny the destructiveness of the Furies’ work and 
(sometimes) of memory’s role in politics. Memory (individual and 
collective) can indeed be obsessional and destructive, not of goods 
external to justice but of the very core of justice itself. Memory, in 
brief, ıs readily available for abuse. Todorov (1995) provides a very 
good account of this dimension of memory, and the ancient Greeks 
were, as I note at the end of this essay, well aware of it. My purposes 
here is rather to draw attention to the ways in which the Funes and 
memory are the ssters of justice. 
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idioms. Both are future oriented, and their claims rest 
squarely on the accuracy of their empirical assumptions 
about the long-term effects of such measures. The 
preventative rationale is perhaps most familiar in the 
recent period from the Nunca Más process in Argen- 
tina after the “dirty war.” The purpose of these trials 
was, according to Raúl Alfonsín (1996, 16), a former 
president of Argentina, not so much to punish as to 
prevent. Trials and other actions against the perpetra- 
tors can serve as a warning. Even if they do not wholly 
uproot the personnel and institutions responsible for 
state crimes and dictatorship, they put them on notice 
that there is no sanctuary from justice. Something like 
this motivation was no doubt part of the rationale for 
the Nuremberg trials and the trials that followed the 
collapse of military juntas in Greece and Argentina. 

A second and no less powerful reason for such trials 
is didactic and identity-shaping: What we remember 
will determine what we become. They can contribute to 
shaping the collective memory of a community (Osiel 
1997, 6; Smith 1997, 19). The ingathering of the past in 
the form of institutionalized memory-justice allows 
societies to cement their political-legal identity in the 
aftermath of trauma and disruption (Huntington 1991, 
211; Waldron 1992, 5-6; Wieviorka 1998, 81, 95). 
Courts can be seen as the locales of a narration or an 
account of responsibility and guilt, placed in the wider 
canvas of the community’s political identity, of what 
was lost and what needs to be restored. The restoration 
is of a (partial) status quo ante, a former version of the 
community: Republican France before Vichy, Greece 
before the coup d’état, or Chile before Pinochet seized 
power. And the narrative of remembrance takes its 
bearing precisely from that earlier condition. 

When there is no past democratic and law-governed 
regime to serve as the focal point of a restorative 
narrative, restoration means the recovery of a law- 
governed condition not from the community’s own past 
but, one might say, from justice itself. This is the 
memory of justice not in an empirical or historical 
sense but as an almost Platonic recollection of justice: 
a reminder to the community that it has strayed, not 
from its own past but from justice (Plato, Phaedrus 
249c, 249e, 254b). The courtroom then becomes the 
site of a rupture. The past, acknowledged and con- 
demned, is not seen as a regrettable interregnum, as if 
it were the work of criminal interlopers, but as the past 
of the community simply. A trial serves under these 
conditions as a call to remember the claims of justice 
on us and to repudiate one’s former self in the light of 
those claims (Jaspers 1965, 100; Schlink 1988, 66-7; 
1998, 437). 

Whether by restoring integrity to a community gone 
astray from justice or by condemning a deviation from 
its own core values, these applications of memory- 
justice are arguably of special importance to the resto- 
ration or creation of liberal and democratic regimes 
after the defeat of dictatorship. This is so because 


5 The idioms of restoration and remembrance also are important in 
the language of revolutionary foundings. The recovered past serves, 
among other things, to establish filiation, to mark out an (alleged) 
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properly conducted (i.e., not as democratic show trials 
that grossly violate the basic norms of a law-governed 
society) they reinforce the constitutionalfudicial foun- 
dation of society. In other words, they demonstrate in 
practice the difference between a lawless regime and a 


mode of governance restrained by law. In its legal © 


guise, memory-justice holds onto the past for the sake 
of retribution and restoration, of the rule of law, and of 
a Jaw-ordered moral-politica) universe. In so doing, it 
can also help restore a political identity or announce a 
rupture and induce a new birth. It can reject the old 
selffregime in order to free the new society of the 
pollution of its predecessor. 


JUSTICE AS TRUTH 


Societies undergoing regime transitions and attempting 
to find a way to deal with the past can attend to the 
demands of memory-justice in two other ways. Memory 
and retribution are closely entangled (Minow 1998, 14, 
quoting Geoffrey Hartman). Grief, remorse, revenge, 
and retribution all participate in memory as weil as in 
the resistance to forgetting. They cling to the past and 
fight against the erosion of memory brought about by 
the passage of time, its unceasing movement toward 
the openness of the future.® But the memory of evil 
that the Furies hold, that central part of memory- 
justice, is not bound up with retribution alone. Once 
more, the ancient Greeks offer some guidance. Their 
word for truth, aletheia, means literally the “unforgot- 
ten” (from the root leth, forgetting) (Loraux 1988a, 
37). Forgetting rather than falsity is the antonym. To 
remember is to preserve the truth of the phenomenon. 

It is not surprising that truth as the unforgotten, on 
the one side, and justice as memory-justice, on the 
other, are closely linked (Detienne 1994, 69~70, 76). 
Justice seeks the truth, we might say, although con- 
cealed beneath that commonplace is the deeper obser- 
vation that justice wants truth the way memory desires 
phenomena to remain (or become) unforgotten. It is 
the work of justice to bring the truth to light, to secure 
the deeds, victims, and perpetrators in aletheia, in the 
realm of memory, of the unforgotten. To forget is to 


continuity so as to help secure authority for the new regime. 
Innovation 1 cast as renaissance (Le Goff 1992, 9). The importance 
of doing justice to the past in the context of transitions to democracy 
can be seen as part of this broader and complex relationship between 
polrtical past and present. 

Because my principal concern is to understand the relationship 
among justice, remembrance, and duties to the past, I treat the core 
issues of the memory of justice as analytically separable from the 
wider canvas of regime foundmg. Nevertheless, these issues point to 
just that wider canvas of time, memory, and regime identity. I touch 
on this elsewhere (Booth 1999), and see Arendt 1963, 42-3, 196-7, 
201, 207-8. I am grateful to a reviewer for emphasrzing this point. 
°? Revenge is a form of remembrance bot ws quite apart from and, 
mdeed, opposed to judicial, law-governed retribution. Shklar (1990, 
93-4) argues it ił subjective, a pasmon often irrational and 
dissatisfied with lawful retribution. The posmbility of a lawful reck- 
oning with past injustices sometimes depends on vanquishing the 


it 1 a sister of justice and has a certain kinship to lawful retribution, 
above all m its passion to remember wrongs done, to preserve them 
and their victims among the unforgotten, and to requite them. 
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live at once in untruth and injustice, which explains the 
often (but not exclusively) negative valuation attached 
to forgetting in the classical literature. Forgetting is 
opposed to both memory and justice. More precisely, 
forgetting is opposed to the core of justice that hes in 
the preservation of truth/memory (Detienne 1994, 6; 
Yerushalmi 1988, 20). 

In our time, this memory-truth-justice connection 
has become, if anything, even more compelling. One 
goal of the twentieth-century perpetrators of mass 
crime was to obliterate any memory of the victims 
(Minow 1998, 1). To erase the memory of crimes and 
their victims is, of course, desired by criminals great 
and small, but I refer here to something more: By 
effacing their names, histories, and fates, it was hoped 
that the fact, the truth of their existence would also 
disappear. 10 We need to preserve the memory of the 
crimes, in the words of the Polish poet Zbigniew 
Herbert, because only “a fatal defect in our tools/or a 
sin of memory” could leave the disappeared in the 
shadows of forgetting. “Ignorance about the disap- 

dermines the reality of the world” (Herbert 
1985, 65, 67). 

Tf the victims of mass crime are left faceless and 
nameless, if the hour, manner, and place of their last 
moments are unknown, then they are outside the light 
of truth, lost to forgetting. The world is left incomplete; 
its integrity broken; its reality undermined. The very 
incompleteness or the empty spaces that such efface- 
ment creates command memory’s attempt to recover 
these persons. The need for precision (Herbert titles 
his poem “Mr. Cogito on the Need for Precision”) is 
precisely the need to restore the truth against forget- 
ting or effacement, whether willed or simply the con- 
sequence of the passage of time. What is missing and in 
need of restoration is not just a fragment of an - 
incomplete whole, a shard to be restored to a shattered 
vase. Rather, justice is missing. “We are despite every- 

e guardians of our brothers,” Herbert writes. 
To be their guardians is to keep them in the sanctuary 
of truth-memory, which at the same time preserves the 
Gust) reality of the world. 
sae victims, especially the voiceless dead, are there 
“the immensity of historic memory, constantly men- 
bred by an unacceptable forgetting” (Semprún 1995b, 
91). Remembering, bringing them into the light of 
truth, restores a kind of life to them. Memory is a sort 
of sepulcher that gives survival to what is remembered: 
“In spite of them [the Nazis], the souls of your brothers 
and sisters will live on, the martyrs whom they sought 
to destroy. For no one can annihilate letters. They have 
wings, and they fly around in the heights... into 
eternity” (Kugelmass and Boyarin 1998, 192). Remem- 
bering is to make what is past present, to rescue it from 
the status of what-had-happened, or more radically 
from the oblivion of forgetting. It connects what is lost 


10 The Nazs planned to transform a Prague synagogue into a 
“Museum of a vantshed race” once the annihilation of European 
Jews was complete. They thus would have destroyed them twice over. 
first, their camstence, and then the remembrance of them. See 
Kiderlen 1990. 
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to what is here (Carson 1999, 38). Not to bring the 
dead into the sanctuary of truth-memory-justice is to 
annihilate them a second time: “forgetting: a second 
death” (Lenz 1992, 10). It is a second death because it 
ratifies and deepens the oblivion to which the initial 
crime seeks to consign them. “I feel guilty that after 
they [the murdered Jews of Jedwabne, Poland] died 
they were murdered again, denied a decent burial, 
denied tears, denied truth about this hideous crime, 
and that for decades a lie was repeated” (Michnik 
2001). 

Bringing the victims into memory/truth saves them 
from being forever lost among the forgotten. Memory 
is the truth of things, whether of victims or crimes, 
because it preserves the deeds and persons. What is left 
in silence dies and languishes among the lost. What we 
do not remember is as if it never happened (Lenz 1992, 
17; Simondon 1982, 124). Out of this sense of menace 
to those who have already suffered comes an impera- 
tive of justice to remember and restore the truth. 
Hlustrations abound. Nicole Lapierre (1989, 240) re- 
ferring to the extermination of the Jews in the Polish 
town of Plock, states that “the effacement was total... 
what was left was my rage to write and to describe.” 
Germaine Tillion (1946, 19, 50}, a survivor of Ravens- 
briick, recounts that her will to live in the camp was 
fueled by the desire to bear witness and to bring the 
truth out from the catastrophe. Patrick Modiano, born 
after World War IL, set out to discover as much as he 
could about Dora Bruder, a Parisian Jewish teen who 
was murdered at Auschwitz: “If I was not here to write 
it, there would be no trace” (Modiano 1999, 65).U 

The language of truth-memory-justice is not, at its 
core, that of the “truth will set us free” kind (Soyinka 
1999, 37). What principally justifies this pursuit of truth 
through the various public and private institutions of 
memory is the language of fidelity, of what is owed to 
the dead. To quote Czeslaw Milosz (1991, 281) in his 
Nobel Lecture: “Those who are alive receive a man- 
date from those who are dead and silent forever: to 
preserve the truth about the past.” This mandate 
demands of those who survive or who come after that 
they act as witnesses to the truth of what happened, 
that they speak on behalf of those who cannot (Lapi- 
erre 1989, 10). Their responsibility to the truth is an act 
of fidelity, of faithfulness to the victims (Jankélévitch 
1986, 60, 79). Silent deeds die, whether those of 
greatness and heroism or those of abominable crime. 
All are in need of words, letters, monuments, of 
witnesses and makers of words, to preserve them 
among the unlost (Carson 1999, 40). There is an 
electiveness about this aspect of memory-justice: It is 
up to us, in the present, to give or withhold voice. But 
this voluntariness, far from diminishing the moral 
weight of the responsibility, rather redoubles the re- 
quirement to remain faithful to the victims of mass 
crime by not completing the perpetrators’ work of 
effacement. 





1 Klarsfeld (1995, 1,535) located and published a photograph of 
Dora Bruder and wrote: “From now on, Patrick Modiano knows the 
face of Dora Bruder.” 
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The impetus that draws us to the work of truth- 
memory-justice is at once both palpable and perplex- 
ing. Palpable is the moral claim that we owe the victims 
the light of truth/remembrance; although they are 
silent, they call on us to witness the fact of their 
existence and their fate. We know and are familiar with 
the hold of truth-memory-justice over us. We see it in 
the “memory books” composed by survivors of the 
Holocaust. We see it in the work of Serge Klarsfeld 
(1995), who compiled 1,731 pages of names, photo- 
graphs, addresses, and convoys of Jewish children 
deported from France to Nazi concentration camps. At 
the legal-political level, we see it in the work of truth 
commissions and similar processes that have accompa- 
nied transitions to democracy in South Africa, Chile, 
and FEl Salvador as well as in the release of Stasi files in 
teunified Germany (Huyse 1995, 52-3). Yet, perplex- 
ing is the notion that we have a bond to which we must 
remain faithful, a bond never stated or articulated but 
that nevertheless “denies us the right to be silent” 
(Raczymow 1979, 106). That thought goes beyond our 
customary understanding of obligation and its sources, 
as does its consequence: the weight that it attaches to 
this duty over our present and future concerns. 

Is this preservation in the light of remembrance 
sufficient? For Sophocles, Electra and her brother are 
not simply the living truth and memory of the crimes 
committed against their father. They are also the 
instruments of retribution, and the justice they embody 
is complete in the unity of truth, remembrance, and a 
corrective. It is not surprising that some consider truth 
commissions and similar strategies for dealing with the 
past to be inadequate. They are found wanting ‘both 
(and relatedly) because their focus is on disclosure, not 
on punishing the perpetrators, and because amnesties 
are sometimes needed to heal divisions and encourage 
the perpetrators to come forward. Referring to the 
TRC, Wole Soyinka (1999, 13) writes: “Truth as a 
prelude to reconciliation, that seems logical enough; - 
but Truth as the sole exaction or condition for Recon- 
ciliation? That is what constitutes a stumbling block in 
the South African proceedings” (see also Holiday 1998, 
47). Truth alone, the mere opening of files, cannot 
substitute for the vindication that can be achieved only 
when the oppressor appears in a court before his 
victims. Some measure of restitution is necessary (Soy- 
inka 1999, 30-1, 36, 80). 

Soyinka discusses this vindication in terms of healing 
and catharsis, but a less psychological way to phrase it 
would be to say that justice-memory and justice-retri- 
bution are two faces ,of justice and its relation to the 
past. Remembrance, as preserving the truth of the past, 
of the victims and perpetrators, at once saves the 
phenomena from oblivion and fulfills a debt of fidelity 
to the dead. But as Electra’s life makes clear, the light 
of truth has no exhaustive claim on the entirety of 
justice. We do not simply want crime to stand exposed 
in the light of truthful remembrance; we want it to be 
punished as well. Truth and retribution are what is 
wanted. 
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AMNESTY 


Justice and retribution stand in an uneasy relationship 
with other comportments, even virtues, that seek to 
moderate or season justice. Mercy and forgiveness are 
two of these: The former lessens the justly deserved 
punishment; the latter changes our attitude, how we 
feel, about the perpetrator (Minow 1998, 15; Murphy 
1988a, 21, 24). The first is a not uncommon part of 
judicial or quasi-judicial proceedings; the second is 
more subjective. Forgiveness is the overcoming, on 
moral grounds, of the feeling of resentment, and it is 
particularly important in allowing human relations to 
continue that otherwise would be disrupted by resent- 
ment (Murphy 1988a, 20, 24). Because of its restorative 
power, forgiveness is a virtue, even (in Archbishop 
Tutu’s phrase) a “civic sacrament,” often recom- 
mended in times of regime transition (quoted in Gut- 
mann and Thompson 2000, 29; Sachs 2000, 223-4). 
Yet, there is an uneasiness in the relationship of mercy, 
forgiveness, and justice, a tension that comes from the 
fact that although all three are counted virtues, the 
work that forgiveness and mercy strive to do is to lessen 
or moderate the full measure of retributive justice’s 
rightful claims (Murphy 1988b, (167). 

There is another uneasy way in which justice may be 
seasoned, one that shares much in common with mercy 
and forgiveness but that is more explicitly bound up 
with justice as the work of memory. I stated earlier that 
keeping deeds and persons in memory is an elective 
matter, almost a matter of resistance against the natu- 
ral course of things, of fighting against the corrosive 
quality of the passage of time and our preoccupation 





2 Murphy and Hampton (1988), Gutmann and Thompson (2000), 
and Minow (1998) provide valuable discussions of forgiveness and 
justice. The tension between forgiving and merciful dispoations, on 
the one side, and justice, on the other, is nicely in Kant’s 
theory of justrce and morality. Kant (1968d, 337; 1991, 145) was 
suspicious of philanthropy, mercy, and forgrveness in their efforts to 
limit justice. Fellow feeling a duty but should not be allowed to 
dilute the demands of morality and justice (Kant 1964, 66; 19688, 
398-9; 1968d, 448fF 1991, 243ff). Of course, and by the same token, 
Tesentment would have been abhorrent to him, not because it bs 
msufficientty forgiving, but because it substitutes feeling for respect 
for the moral law. Kant sees punishment as governed by a categorical 
imperative, not to be modified by “eudaemonism” or indeed any 
conmderation of external social benefit (Kant 1968d, 331-7; 1991, 
140-5). 

Kant’s moral and legal philosophies are only ambiguously related 
to the arguments presented here, but they offer some fertile ground 
for reflection In general, moral judgment and remembrance are not 
related for Kant. The categorical mperative alone provides guidance 
as to the demands of the moral law under any circumstances, and it 
frees moral agents from the extramoral (and unrehable) calcuhis of 
experience (Kant 1968c, 555; 1968b, 583; 19680, 286f; 1974, 53-4). 
Moral conduct and justice ss institutionalmed morality (in relations 
of external freedom) are not driven by the notion of debts to persons, 
dead or living, but by respect for the (moral or external) law itself, 
More speculatively, it is worth entertaining the thought that the 
categorical imperative and tts illustration, the Kingdom of Ends, are 
devices to remmd us of the clams of the moral law. They are 
reminders not in the sense of calling to mind our emprrical past but 
just the reverse: by reminding us of justice in the midst of the false 
and misleading lessons of the world as given in expenence, a world 
filled with the forgetting of the truth of justice (Plato, Phaedrus, 
248c) These devices are, one might say, a variant of the Platonic 
memory of yustice. 
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with the future. Sophocles and other ancients reflect 
upon what it would mean for children to forget the 
murder of their father, to live untroubled in the 
presence of his killers and their unanswered crime, as if 
no injustice had been done. That would undermine the 
“[just] reality of the world.” Antagonistically related to 
this retrospective glance of memory-justice is precisely 
a rejection of that orientation, that is, an effort to 
overcome the hold of the unforgotten, of remem- 
brance, on us. This is not one of the faces of memory- 
justice but is a rejection of memory and, being 
grounded in that-refusal, is intimately connected to the 
object of its refusal. Broadly speaking, this is the view 
that, in one fashion or another, a surfeit of memory is 
destructive of life. Remembrance draws us to what is 
dead and to the irreversible. It is nostalgia, bitterness, 
or the thirst for revenge. All these dwell in what is 
beyond human agency to modify; all, it seems, irration- 
ally resist the becoming of time. Worse, all (therefore) 
sacrifice the present and future for the sake of the past. 
Perhaps forgetting, the letting go of the past, as 
Nietzsche (1957, 7-8) argued, is essential to life. 

Forgetting can also sometimes be necessary therapy, 
a way to minimize the aftershocks of trauma. Jorge 
Semprún (1994, 292, 332), who emerged alive from 
Buchenwald in 1945, tells us that memory was deadly 
for him,-and consequently “I chose forgetting. I put in 
place, without too much concern for the good of my 
own identity . . . the strategy of voluntary amnesia, cru- 
elly systematic” (see also Appelfeld 1994, ix). In the 
ancient view, Mnemosyne, memory, is also the poddess 
who allows us to forget our ills (Hesiod 1983, lines 
54-5; Simondon 1982, 141; Vernant 1996, 117). For- 
getting permits us to live in the present and to be open 
to the future. In the absence of this salve, memory can 
chain us to the past and, when the memory is of crime 
or injustice, to bitterness and resentment. We can see 
this clearly in the case of Electra and Orestes. They are 
the embodiment of memory-justice, but at the same 
time, or precisely because of that, they are bound hand 
and foot to the past. They are not creatures of the 
present and future, and their absorption in-the past 
bars them from those other human times and their 
associated joys. 

Homer’s Odyssey and Aeschylus’ Ewnenides present 
the case for forgetting in a more directly political 
context. After Odysseus kills Penelope’s suitors, Zeus 
tells Athena: “Let us make them forget the death of 
their brothers and sons, and let them be friends with 
each other, as in the time past, and let them have 
prosperity and peace in abundance” (Homer 1977, 
Book 24, lines 480-90; Loraux 1988a, 33). Forgetting is 
here the precondition not of individual well-being (as 
in Sempriin’s account) but of a return to peace. In the 
Eumenides, when Orestes is acquitted, Clytaemnestra’s 
Furies, anxious for revenge against her son and killer 
Orestes, rage against the great dishonor and ills 
(megala toi korai dustuxeis Nuktos atimopentheis) that 
they feel they have received at the hands of Athena and 
the jury. Athena, in what Pierre Vidal-Naquet (1995, 
266-7) calls the West’s first argument for prescription 
(statute of limitations), convinces the Furies (who are 
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motivated both by anger over the acquittal and by 
shame at being bested) not to visit conflict and death 
on the city (Aeschylus 1971, lines 825ff, 860ff, 975). 

The putting aside or forgetting of past crimes for the 
sake of peace and civic unity is not just a device of 
ancient epic and tragedy. As part of the Athenian 
reconciliation agreement of 403—402 sc that restored 
democracy after the dictatorship of the Thirty, an 
amnesty was proclaimed: “not to remember evil” [me 
mnésikakein|, that is, the evils of the Thirty (Aristotle 
1952, section xxix Elster 1998, 9-13; Loening 1987; 
Loraux 1997, 174; Nippel 1997). It was clear that civic 
peace could not be restored, much less democracy, 
without a public forgetting, the “institutional oblivion” 
of the past. Memory-justice as retribution divides and 
poisons; it impedes renewal. It seems that, as the 
Athenian democrats recognized, forgetting is an essen- 
tial part of politics because it is an ally of peace and 
unity, just as the Furies, vengeance, and memory- 
justice are antithetical to civic peace (Loraux 1988a, 23, 
30; 1988b, 13; 1997, 38). 

Because it effects a rupture with the past, political- 
judicial forgetting can be an instrument of peace 
(Simondon 1982, 45). It seeks to draw a thick line 
between past and present and to debar memory- justice 
“from keeping .. . [the] Wounds greene” (Bacon 1965, 
104), from forcing the present to drink from the bitter, 
polluted cup of the past, from sacrificing renewal to 
revenge. For the authors of the Athenian reconciliation 
agreement, peace and unity were great goods and more 
important than the search for full retribution. The 
justice of the demands for retribution was not in 
dispute. Rather, at issue was a choice (to put it rather 
too starkly) between peace and justice when, as is often 
the case, the two lead us in quite different directions. 
Forgetting answers one vital need of a community, 
especially after deeply divisive political traumas; it 
allows an end, a final point, to strife. Ernest Renan 
(1992, 41) is surely right that nations are rooted in both 
the things they remember together and the things they 
have willed to be forgotten. 

Amnesty is not mercy but a “mutual forgetting” (ein 

itiger Akt des Vergessens) (Schmitt 1995, 219) 
that effectively precludes the evocation of the perpe- 
trators’ deeds.'4 It is particularly useful in ending civil 


43 An equal number of votes are cast for and agamst acquittal of 
Orestes (Aeschylus 1971, lines 795-6). Even in the city of institu- 
tionalized justice, the Furies are not weak. 
14 The English word “amnesty” has its root in the Greek amnesua, 
“the forgetfulness of wrong” or “oblivion.” (See Plato, Menexenus, 
239c: “but those exploits ..which lie still buned m oblivion 
lates taller AGA: 21) remaria that the use of amnesia for 
cases was a much later employment. See also Loraux 
1988a, 73.4 Bacon (1951, 212) uses the term in that sense and ties 
it to reconciliation: “Reconalement 1 better managed by an am- 
nesty, and passing over that which is past.” Amnesty is a legal/ 
political forgetting that allows the wrong done to pass into (legal) 
oblivion. 

“Amnesia” is also rooted in the Greek term for forgettmg, but the 
relationship with amnesty is not merely etymological There have 
been efforts to combine the goal of reconciliation via amnesty with 
the duty to remember, with a resection of amnesia, such as the 
“amnesty—yes, amnesia—no” proposal made during the Central 
European tranmtions to democracy Michnik (1997) states that this 
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strife, when the need to restore unity is paramount 
(Gacon 1994, 104; Ricoeur 1995, 205; Schmitt 1995, 
218). Hence the political importance of amnesties 
following the defeat of the Paris Commune, in Ger- 
many and formerly occupied Europe after World War 
IL and in France in the wake of Algerian independence 
(Frei 1997; Gacon 1994, 98; Rousso 1990, 67-8, 145- 
6). Of course, the process of forgetting extends beyond 
such jegal-political actions as amnesty. Amnesia, a sort 
of informal collective forgetting and the crafting of 
conciliatory counternarratives of the past, also may 
play an important role. According to Hans Enzens- 
berger (1990, 80), after World War I, Europeans took 
shelter behind a great amnesia; this occurred generally 
but especially in Germany, where the National Socialist 
period was left in silence; and in France, where De 
Gaulle managed to weave a tale of Vichy as a paren- 
thesis in French history and of a national resistance to 
German occupation (see also Arendt 1989; Judt 2000, 
299; Lagrou 2000). 

There can be little doubt that a combination of 
formal amnesty and informal amnesia can serve cur- 
rent interests: the pursuit of civil peace above all as well 
as the desire of perpetrators to escape justice, of 
collaborators not to be stained by their acts of betrayal, 
and of passive bystanders to find redemption in a story 
of courage and moral uprightness. Equally certain is 
that amnesty and amnesia can be instruments of polit- 
ical actors in the present who seek to install one 
memory or expunge another (Finkielkraut 2000, 37-52, 
135-6). Yet, the motivating presence of current inter- 
ests in the politics of amnesty and amnesia should not 
lead us to stop with a facile view of the “constructed” 
or “imagined” character of memory and forgetting. 
Rather, memory and forgetting speak to something 
more elemental in our lives in common than the mere 
opportunism of those who want to save themselves or 
profit from a regime change. 

Amnesty overturns the moral imperative of memory- 
justice so that we may be oriented to the present and 
future rather than (or not solely) to the fulfillment of a 
debt to the victims or to the carrying out of the 
demands of justice for its own sake. It asks: “What is 
the practical importance now of a judgment that injus- 
tice occurred in the past (Waldron 1992, 4, 27)?” As 
Raúl Alfonsín (1993, 18) argues, the consequences of 
punishing the past must be weighed against the present 
and future, and too high a present cost must be 
rejected (p. 18). The thick line between past and 
present that amnesty permits should not be seen as the 
avoidance of moral judgment but as the view that 
justice needs to be seasoned not so much by compas- 
sion for its own sake as by a concern for the future, the 
goods of which may be lost in a too strict adherence to 
the demands of the iron tracks of memory-justice. For 
example, the amnesty in Uruguay following the military 





formula turned out to be too difficult for the people of the demo- 
cratic opposition Why? Perhaps the mmperative to remember cannot 
so easily be confined to extrajudicial remembrance; ın 

crmes to legal oblivion, a wrong 15 committed that symbolic gestures 
of remembrance, such as flowers on graves, and monu- 
ments, cannot overcome (Jankélévitch 1986, 61). 
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dictatorship was justified as a moral decision “to give 
priority to the possibility of a future of agreement over 
a past of division” (Senator Manuel Flores Silva quoted 
in Wechsler 1990, 184). The then president of Uru- 
guay, Julio María Sanguinetti, offered similar argu- 
ments and invoked Renan’s observation on the need 
for forgetting (pp. 175, 191). 

It may just be that amnesty and, in general, a 
displacement of the grievances of the past are of 
particular importance in transitions to (or restorations 
of) democracy (Huntington 1991, 214; Minow 1998, 14; 
Smith 1997, 11). The degree of civility, trust, and 
tolerance necessary for a democracy to flourish would 
be jeopardized, it is argued, by too overzealous a 
pursuit of either prosecutions or disclosures. Correc- 
tive justice, the memory-justice of retribution, Acker- 
man (1992, 3, 69-71, 88) writes, divides a people into 
the guilty and innocent, collaborators and resisters. He 
views it as a negative device, focused on individual 
culpability rather than institutional reform, oriented 
toward the past, and one that can easily have counter- 
productive consequences for the new democracy. In 
Ackerman’s account it is not only prosecutions that are 
wrongheaded but also policies of disclosure: The secret 
police files should be burned, he urges, not made 
available to the public as in postunification Germany 
(see also Nino 1996, 128ff). The well of bitterness and 
division must be drained. 

In this view, amnesty, political-judicial forgetting, is 
an answer to the Furies, to the vengeful face of 
memory-justice (Ricoeur 1997, 452). As in the Eu- 
menides, the claims of retributive justice are not dis- 
puted as such but seasoned or moderated. An amnesty 
does not say that (retributive) punishment is unde- 
served but that it ill serves other purposes (e.g., recon- 
ciliation). The Furies should be tamed or, better, 
brought into the city’s institutions and their claims 
moderated by other imperatives, one of which is to ask 
how retribution will affect the present and future. The 
demands for justice by victims of past wrongs must be 
weighed against the claims of today and tomorrow. 
Humans live in all three temporal registers, and all 
insistently call to us, depend on us, and insist that we 
attend to their just claims. 

For retributive justice-memory, to put aside the past 
in the name of present and future needs is unaccept- 
able. That would allow the erosion that accompanies 
time and would complete or ratify the efforts of the 
perpetrators to erase their victims (Jankélévitch 1974, 
247; 1986, 48). Amnesty and amnesia, as forms of 
forgetting, are not positions that take their bearing 
from the strictures of morality and justice; they are 
ways of yielding to the extramoral course of becoming. 
As agents capable of acting under self-given norms, we 
ought to resist forgetting and not simply collaborate 
with it (Améry 1977, 115, 116). 

Memory-justice-retribution, as the voice of the si- 
lenced victims, commands: “Do not forget truly it is not 
in your power/to forgive in the name of those betrayed 
at dawn” (Herbert 1977, 79; see also Michnik and 
Havel 1993, 25). “Not in your power”: Can anyone 
other than the victims extend the gesture of reconcili- 
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ation that is amnesty?15 Samuel Pisar (1986, 72) writes 
of the Holocaust: “Who will pardon? Who could 
pardon? The dead? The survivors? The rest of human- 
ity? No one” (see also Soyinka 1999, 26, 28). And 
because amnesties typically forget the victims and the 
crimes committed against them (Ertel 1993, 121), they 
may outrage our moral sensibilities for two reasons. 
They raise the question aboyt who can forgive and 
forget, and they seem to violate our debt of fidelity to 
the victims, which is redeemed through remembrance. 

A core vulnerability of strategies of willed forgetting 
is that injunctions not to remember evil may put crimes 
into the civic shadows, beyond judicial evocation, but 
cannot undo the fact that they were done (Jankélévitch 
1974, 289). The irreversibility of time means that past 
injustices both cannot be directly undone and cannot 
be made to disappear entirely. The past and our 
memory of it always threaten to resurface. The Furies 
sleep but can be awakened (Ricoeur 1995, 208, quoting 
Hegel). Semprún (1994, 297) cited earlier on the need 
to forget for the sake of life, also states: “Despite the 
detours, the deliberate or involuntary censoring, the 
strategy of forgetting ... despite all the pages written 
to exorcize this experience ... despite all this, the past 
preserved the shattering power of the smoke and snow 
[of Buchenwald], just as on the first day.” De Gaulle’s 
strategy of Vichy as a parenthesis overlaid with a tale of 
heroic national resistance unraveled when the true 
extent of collaboration and passivity during the “dark 
years” was brought from the shadows by films such as 
The Sorrow and the Pity. 

No society, Benjamin Stora (1991, 319) writes in 
relation to a later French national trauma, the war in 
Algeria, can live forever in amnesia. Past victims and 
perpetrators, or our memory of them, have the ability 
to return almost unbidden. The Furies are always 
there, waiting to return and rouse us from our sleep, 
should we become oblivious to the demands of mem- 
ory-justice.16 Since forgetting cannot erase what has 
been done; amnesty and amnesia are at best provi- 
sional means to deal with the past (Jankélévitch 1974, 
289). Our embeddedness in the present and future, and 
their just claims on us, are not exhaustive of the 
universe of justice. Perhaps attempts to make them so, 
to draw a thick line between past and present through 
amnesty and amnesia, only serve to awaken the Furies 
and in so doing to show how futile is the attempt to 





15 We consider forgiveness something that can rightfully be grven 
only by the person (or community) wronged. As Jeffrie Murphy 
(1988a, 21) observes, it would be odd to say that those born after 
World War I, and with no connection to the victims of the 
Holocaust, forgrve Hitler for those crimes. 

16 Stephen Holmes (1995) remarks on the waning passion for 
decommunization in much of Eastern Europe and on the reason- 
ableness of that lack of interest. The trajectory of the passions 
surrounding the Holocaust, from neglect or indifference in the 
immediate postwar years to a powerful at the start of the 
new millennium, suggests that the quet of the Furies should not be 
mistaken for therr Michnik (2001) comments on the 
furor in Poland over Jan Grose’s (2001) book on the massacre of 
Jews at Jedwabne: “It is a serious debate, full of sadness and 
sometimes terror—as if the whole society was suddenly forced to 
carry the weight of this terrible 60-year-old crime.” 
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draw such a line. The claims of memory-justice, al- 
though vulnerable to our pressing current needs and 
concerns for the future, can always transgress the 
boundaries we erect in the vain attempt to be done with 


the past. 


LAW’S LIMITS 


I have analyzed the relationship between memory and 
justice along three paths: retribution, truth preserva- 
tion, and amnesty. I also have looked at the legal or 
quasi-legal instruments through which memory-justice 
does its varied work. I now want to explore the limits of 
the legal expressions of memory-justice. Do they ex- 
haust the work that memory-justice. seems to demand 
of us? Or do they leave a sense of incompleteness, as if 
law’s empire cannot reach as far as memory-justice 
requires? According to one interpretation, it is from 
the standpoint of revenge, or “Wilde Justice” (again to 
use Bacon’s phrase), that the work of the law in dealing 
with the past is judged frustratingly incomplete in 
wiping the slate clean (Shklar 1990, 93-4). 

I want to suggest that this incompleteness is the 
recognition of something important left undone. This 
may occur even when significant legal action is taken 
against the perpetrators. 

In part, of course, the incompleteness may simply 
reflect unfinished legal business, such as the concern 
that many Nazis escaped justice altogether or were 
dealt with too leniently when Germany went from 
being a defeated enemy to an ally in the Cold War. 
Similarly, there is the concern that some French offi- 
cials who collaborated with the Germans in the perse- 
cution of Jews and members of the Resistance were 
never held to account. Yet, the continuation of some- 
thing like the “Vichy syndrome” not only in France but 
in postcommunist East Europe and Germany suggests 
that the legal expression of memory-justice does not 
exhaust the claims of the past on us (Judt 2000, 308; 
Rousso 1990, 18). Indeed, the continuing politics of 
memory can be accounted for, at least partially, by the 
fear that a completion of the trials (and the passing of 
the generation that witnessed or perpetrated those 
events) will become an occasion for closure, on the 
pretext that all that can be done has been done. For 
this among other reasons, Alam Finkielkraut (1989) 
entitled his book on the trial of Klaus Barbie, the 
former head of the Gestapo in Lyon, La mémoire vaine 
(translated as Remembering in Vain). 

Trials can help draw ea thick line between past and 
present, and the crimes of the past then can become 
historic rather than present (Finkielkraut 1989, 12-3). 
Consider the concerns raised about buildmg monu- 
ments and museums to the victims of mass state crime. 
The worry is that the object, the monument or inscrip- 
tion, becomes in effect the final gesture of compliance 
with the demands of memory-justice. Those demands 
are transformed into history, the past perfect, and 
cease to be part of the lived world of justice (Young 
1993, 28-37). Remembrance itself, in other words, can 
sometimes be used to quit the debt once and for all, to 
throw off the weight of the past. Henri Raczymow 
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(1985, 213-4) writes of Mathieu and his efforts to be 
done with the memory of his sister, Esther, who 
committed suicide years earlier: “My son [Mathieu 
says] will be saved from the past. He will carry no 
stigma from it. I will never speak to him of Esther. ... 
My book will have effaced her. Strangely, it required 
words for that. Words, and not silence. ... Esther is 
buried. Good and buried. Her tomb can be seen there, 
somewhere in Bagneux cemetery. Her name is on the 
tomb, and her body inside it. Localized. Esther is no 
longer in me. I expelled her.” The memoir and the 
tomb are not ways.of preserving the memory of Esther 
but of freeing her brother (and his son) from her. So, 
too, trials, monuments, and days of remembrance can 
free communities from a burdensome past. 

Laws empire does not reach as far as that of 
memory-justice. I want to sketch in some detail the 
distance between them. I will begin once more with the 
trial or retributive face of memory-justice. From the 
standpoint of the law, criminal trials of former regime 
officials may involve difficulties, such as retroactivity 
and statutes of limitation. From the vantage point of 
memory-justice, trials are hobbled by two related dif- 
ficulties: their focus on guilt and their definitiveness. 
They may be focal points for shaping collective mem- 
ory, and it may well be that the public face they give to 
memory-justice is essential to the work of justice (Osiel 
1997, 6; Smith 1997, 19). But there is at once an 
incompleteness and an excessive definitiveness about 
them, properties that help mark out their limits. 

First, as legal events that involve criminal charges, 
trials of the perpetrators of state crimes must inevitably 
look to individual accountability (Cohen 1995, 47). In 
the case of a regime with a wide grey area of collabo- 
tation and passive acquiescence or even support, that 
creates a very narrow focus: The weight of responsibil- 
ity rests only on those held to be direct authors of the 
crimes (Osiel 1997, 61). Part of the past is selected out 
by legal processes as the locale of accountability and 
may only partially overlap with the sphere of responsi- 
bility as understood in memory-justice (French 1984; 
Williams 1993, 63-5). Although the Nuremberg trials 
made clear German accountability for the war and its 
crimes, they did not condemn the German people as a 
whole, only those held to be directly responsible for 
crimes (Jaspers 1965, 98, 100). The concern with guilt, 
understood as a “clear line” and “threshold” concept 
(French 1992, 65-6), focuses attention on the discrete 
actions of an individual rather than on a responsibility 
more diffuse (but perhaps more important, politically 

Memory-justice, in its ingathering of the past, seems 
to need something else, the recognition of a responsi- 
bility that includes individual accountability but 
reaches beyond it to something not reducible to guilt. 
What is wanted, we might say with Habermas (1997), is 
a self-understanding by members of a political commu- 
nity that they are co-responsible even if not. legally 
accountable. This we could call, in the aftermath of 
mass crimes, a sense of shame that emerges not from 
direct authorship of actions but from membership in a 
community implicated in these deeds (Morris 1976, 
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135;-Schlink 1988, 59; Williams 1993, 80). Consider 
Albie Sachs, a South African antiapartheid activist 
severely wounded by a security service booby trap. 
When approached by an Afrikaner who asked him for 
forgiveness, Sachs (2000, 222-3) was perplexed: Since 
we do not accept collective responsibility, why should 
this man feel guilt and apologize?!” Perhaps, as Stanley 
Cohen (1995, 40) suggests, he was expressing a quite 
appropriate shame, not guilt. Memory as shame seems 
to be part of the demand of memory-justice, and it 
ranges far beyond the issue of guilt at play in criminal 
trials. During her 1950 visit to Germany, Hannah 
Arendt (1980, 45) was struck by the absence of a sense 
of shame, of the recognition of community responsibil- 
ity. The lack of shame also may explain President 
Mitterrand’s reluctance in 1992 to acknowledge any 
French co-responsibility for the deportation of Jews 
from France during World War I. 

Shame and guilt differ, as does responsibility by 
virtue of membership in a community versus individual 
authorship. Communities exist across time, whereas 
the actionable deed occurs at a discrete moment; 
shame endures, whereas legal action and punishment 
of the guilty provide closure. Trials and retribution, no 
matter how thoroughgoing in the prosecution of the 
accused, ‘do not reach a recognition. of broad and 
enduring responsibility. Furthermore (and therefore), 
a trial offers premature closure: The guilty leaders are 
punished, and we are freed of any burden of responsi- 
bility. This may partially explain why the legacy of 
World War I still disturbs Europe, despite trials and 
purges of those most directly accountable. The sense of 
incompleteness is not principally due to any perpetra- 
tors left unpunished but to the limited way that legal 
action weaves the past into the national biography, into 
the memory of a community.’ 

- Second, another way for the memory-justice of law 
to secure the past is through truth-telling, by saving the 
deeds, victims, and perpetrators from the darkness of 
oblivion and falsehood. But the truth and memory of 
the witness as well as the collective memory of the 
community may only partially overlap with the truth 
sought in legal or quasi-legal proceedings (Yerushalmi 
1988, 16). Some of this difference is suggested in 
Tillion’s (1973, 203-4) comments on the postwar trial 
of her former Ravensbrück persecutors, in which she 
writes about the tremendous distance between her 
experiences and the descriptions she heard in court. In 
Claude Lanzmann’s Shoah there is a sharp contrast 
between the testimony/memory of witnesses and the 
remarks of the sole historian to appear in the film, Raul 
Hilberg. As Tillion (1988, 12, 306) observes, part of the 
explanation lies in the difference between first-person 
memory and the transmitted, objectified past of the 
courtroom or historian. At one level, of course, we can 
describe these differences in terms of the intimacy of 


17 The “Epilogue” to Sach’s (2000, 228-9) account of his part in the 
struggle for democracy m South Africa offers a rich discussion of 
retribution and forgtvenesa. Sachs notes that many of the victims of 
apartheld were not interested m t, although 
families and people outside South Africa often seemed to be focused 
on just that. 
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the remembered event for the (first-person) witness 
and the emotional content of trauma (Lanzmann vir- 
tually had to compel some of his witnesses to speak). 
Some might also suspect personal recollection of being 
partial and not always accurate. The courtroom, or the 
historian’s study, aspires to detachment, to an objective 
view, and the drive for truth is directed by the desire to 
explain or to assign culpability. 

But the. differences lie not just in the opposition 
between emotion-laden intimacy and detached objec- 
tivity. Two related dimensions are of special impor- 
tance. An essential part of witnessing is the desire to 
keep the victims of mass crime in the light of truth. 
Truth-memory is, in this sense, an act of fidelity, 
typically to members of one’s own community who 
have been lost. Memory-truth is not a gesture of fidelity 
to just anyone; it is faithfulness in the context of a 
community, whether a marriage, a religion, or a nation. 
The truth of remembrance differs from the truth of law 
and history in that its core is fidelity to the victim 
(Bédarida 1993, 7). And that fidelity occurs within the 
context of a shared something, a life in common across 
generations. Remembrance serves to reintegrate the 
victims into their community and to restore that com- 
munity after the rupture induced by the crime. Faith- 
fulness and the (re)integration of the community are 
two sides of the same phenomenon. 

Memory-truth serves not to establish that such-and- 
such took place, that x was its perpetrator and y its 
victim, but to reintegrate the lost into the narrative 
unity, the collective memory, of the community. In this 
role, memory-justice-truth is the ingathering of the past 
for the sake of the continuity of the community across 
time. What is sought is not an (historical) explanation 
or a (legal) determination of responsibility so much as 
the restoration of unity to a shattered community. The 
truth at issue is the truth of that unity or identity, 
secured through remembrance. Remembrance may be 
the recovery of the lost, or the securing of those who 
risk being lost to forgetting, or the recognition and 
repudiation of a criminal regime as a precondition of a 
new life in common. Memory’s truth, as it appears m 
such contexts, is uneven, jagged, and.tied to collective 
memory and the community’s autobiography. Great 
trials, and especially those in the wake of regime 
changes, can help shape that memory-identity; indeed, 
they can help awaken it. Memory-justice and memory- 
identity are not two utterly distinct phenomena; they 
are an ingathering of the past of justice and injustice as 
part both of doing justice and of affirming the continu- 
ity or identity GE rupture and new identity) of a 
community. 

Serge Klarsfeld’s testimony at the trial of Klaus 
Barbie illustrates memory-justice at work in both reg- 
isters, legal- and the memory-justice of a community 
identity, and their overlapping but differentiated char- 
acters. Klarsfeld’s work to apprehend Barbie and his 
testimony belonged to the world of law, but on the 
stand he read the names, ages, and some of the 
correspondence of the Jewish children of Izieu, de- 
ported to Auschwitz and murdered there. This went 
beyond giving evidence, although it referred to one of 
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the most tragic of Barbie’s war-time crimes in France. 
Its substance was not, Klarsfeld (1986, 132) said, a 
counsel’s speech (plaidoirie) but an “introduction”: “It 
seemed important to me to have all these children 
enter into the court.” In his address to the court, 
Klarsfeld describes the proceeding as an act of justice 
bound up with an act of memory, and the purpose is “as 
much to remember, in actions, as it is to deny a 
pleasant old age to the executioner of the children of 
Izieu” (p. 138). 

It was important to introduce and name these chil- 
dren, lest “the winds of forgetting inguish the 
names” (Aichenrand 1993, 149). Lanzmann (1986, 55) 
refers to Klarsfeld’s speech as “the act of naming 
[nommation]. He restored their proper name to each 
child of Izieu.” The naming of the victims, so that they 
will not be lost to oblivion, is a way for memory-justice 
to fulfill its obligation to the dead. “After April 23, 
1945 [Tillion’s liberation from Ravensbruck], .. . I was 
in mourning, and I used this vast assembly of memories 
[in the recovery camp] to gather that which they 
retained about all those whom we had lost. At least 
their names, their only sepulchers” (Tillion 1988, 11). 

Having the children “enter into the court” makes the 
dead present, and Klarsfeld’s language draws on an- 
other important dimension of memory-justice. It is 
apparent in both the classical world (e.g., the dead 
Clytaemnestra appears in the Eumenides) and in the 
modern, as when André Malraux addresses Jean Mou- 
lin in the present tense as his ashes are interred in the 
Panthéon.1® Memory-justice in this role collapses the 
distance between past and present; it is, as Lanzmann 
puts it, an “incarnation” (Lanzmann 1986, 51). The 
victims are restored, if only in memory and speech, to 
their place in the community, and the living are re- 
minded of their continuity with the past: “Remem- 
brance is the proof itself of filiation” (Ertel 1993, 22). 
The naming and incarnation aspects of Klarsfeld’s 
testimony make plain the second nonjudicial register 
(Rousso 1990, 247). The memory work being done in 
his speech has less to do with demonstrating Barbie’s 
guilt than with the observance of a commitment to the 
victims; it is an evocation of their presence that affirms 
the community’s identity across time and even through 
such violent traumas. 

At the same time, the court is not merely a platform 
for a speech entirely foreign to its purposes. It is not a 
trivial fact that this testimony was delivered in a court 
of law, for it speaks to the proximity of memory-justice 
in its capacity as law and as bound up with bearing 
witness and identity. Both do the work of bringing the 
past into the light of truth through remembrance. The 
Barbie trial was an attempt to wrest that past from the 
“shroud” of history, of the past perfect, by drawing (via 
the memory of justice and injustice) his crimes into the 
“judicial present” (Finkielkraut 1989, 12-3). Klarsfeld 
contributed not only to that attempt but also, in 
naming and restoring the dead to their community 
(and its own past to the living community), to the other 


13 “Entre ia, Jean Moulin, avec ton termble cortège” (Malraux 1971, 
135). 
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tasks of memory-justice. His testimony is at once 
evidence, part of a judicial pr ing to prove an 
individual’s guilt, the fulfillment of a debt, the bringing 
into the light of truth, and the assertion of a commu- 
nity’s identity through time. 


CONCLUSION 


We can begin to understand why the extensive use in 
the twentieth century of trials, truth commissions, and 
amnesties in the wake of dictatorial and totalitarian 
regimes failed to bring closure. The memory of justice 
works in a number of ways. Within the law’s orbit, it 
can have the face of retribution and punishment, of 
exposure and truth-telling, or of public amnesia. Var- 
ied (and overlapping) imperatives drive these phenom- 
ena: to fulfill a debt to the dead by punishing the 
perpetrators; to preserve justice; to save victims from 
the second death of forgetting; to put our present and 
future goods over our ties to the past. All these, in their 
different ways, display the intimate bond between 
memory and justice. At the same time they point to the 
limits of law’s empire. 

Memory-justice demands more than what a court or 
truth commission can provide. Freed from the con- 
straints of determining individual guilt, memory-justice 
finds’ the conviction of only the direct perpetrators 
and/or their political masters to be too narrow an 
understanding of responsibility. Freed from prohibi- 
tions on the retroactive application of law, memory- 
justice raises its voice of condemnation (Soyinka 1999, 
14). Concerned lest trials draw so thick and final a 
concluding line between past and present that we are 
thereafter absolved of the work of remembrance, mem- 
ory-justice refuses to let this past become simply the 
historical past, the past perfect. Finally, in looking for 
the truth about the past not for its evidentiary or 
explanatory value, memory-justice seeks to make the 
past present, to bring the lost back into our midst. In 
this way we do justice to the dead and affirm the reality 
and enduring quality of the community we share with 
them. The demand for a recognition of wide co- 
responsibility across generations, of shame (to give it 
its title); for the genuinely imprescriptible character of 
these sorts of crimes, with the result that condemnation 
does not end with the conviction of one or many 
perpetrators; of memory-justice as the core of identity 
across time and even through the most radical ruptures 
in a community’s life: these three faces of memory- 
justice mark out the limits of a legal overcoming of the 
past. 

I began by observing that we could understand the 
need to keep crimes and their victims among the 
unlost, to guard them against the oblivion of forgetting, 
as one of the defining voices of justice as remembrance. 
I also suggested that this memory-justice at once 
informs core judicial practices and ranges beyond them 
in a manner that leaves their mode of closure incom- 
plete. I end by drawing again on Sophocles and Ae- 
schylus, who instruct us not only in the vital place that 
memory occupies at the heart of justice and its struggle 
to keep the victims, crimes, and perpetrators among 
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the unforgotten but also in the meaning of living 
entirely m the shadow of memory-justice. More visible 
to them than to us, perhaps, the reality of memory- 
justice was something tangible, a duty that restored, 
preserved, and acknowledged the just order of the 
world. At the same time, they underscored the way in 
which, in the shadow of remembrance, other human 
goods can wither, goods located in the temporal regis- 
ters of present and future. This latter lesson is impor- 
tant, but it is one with which we are, in our way, 
familiar. We still argue over the virtues of forgetting 
and letting the past go. We still frame the debate over 
judicial answers to state-led mass crime in the language 
of peace versus justice. I have chosen to emphasize 
what is ours, too, but with which we are perhaps less at 
home: the intimacy of memory’s bond with justice, not 
as obsessional or as a syndrome but as a face of justice 
itself, looking to preserve the perpetrators, their deeds, 
and victims among the unforgotten. 
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election caused more than 2,000 Democratic voters to vote by mistake for Reform candidate Pat 


| j J e show that the butterfly ballot used in Palm Beach County, Florida, in the 2000 presidential 


Buchanan, a number larger than George W. Bush’s certified margin of victory in Florida. We use 


multiple methods and several kinds of data to rule out alternative explanations for the votes Buchanan 
received in Palm Beach County. Among 3,053 U.S. counties where Buchanan was on the ballot, Palm 
Beach County has the most anomalous excess of votes for him. In Palm Beach County, Buchanan’s 
proportion of the vote on election-day ballots is four times larger than his proportion on absentee 
(nonbutterfly) ballots, but Buchanan’s proportion does not differ significantly between election-day and 
absentee ballots in any other Florida county. Unlike other Reform candidates in Palm Beach County, 
Buchanan tended to receive election-day votes in Democratic precincts and from individuals who voted for 
the Democratic U.S. Senate candidate. Robust estimation of overdispersed binomial regression models 


underpins much of the analysis. 


eginning on election day November 7, 2000, 
Palm Beach County (PBC), Florida, attracted 
national and eventually international attention 
because thousands of voters in the county complained 
that they had difficulty understanding the now infa- 
mous butterfly ballot. As a result, they claimed that 
they had cast invalid or erroneous presidential votes. 


Lawyers working for the Democratic Party reportedly 
collected 10,000 affidavits sworn by voters with com- 
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plaints about some aspect of their election-day experi- 
ences in the county (Associated Press 2000b; Firestone 
2000a, 2000b; Van Natta 2000; Van Natta and Moss 
2000). Shortly after election day, eleven groups of PBC 
voters filed independent lawsuits seeking relief, claim- 
ing they and others had made mistakes in their votes 
for president because of the confusing format of the 
ballot.: Many of them stated that they had intended to 
vote for Democratic candidate Al Gore but by mistake 
chose Reform candidate Pat Buchanan. The number of 
votes involved was more than enough to have tipped 
the presidential vote in Florida from Republican can- 
didate George W. Bush to Gore, thus giving him 
Florida’s 25 electoral votes and the presidency.” 

PBC is a heavily Democratic, politically liberal 
county that conventional wisdom says should provide 
few Buchanan votes. Two days after the election, Bay 
Buchanan, Pat Buchanan’s sister and campaign man- 
ager, said “she was startled to hear Bush strategist Karl 
Rove argue Thursday that Buchanan has strong sup- 
port in a county where his campaign never bought an 
ad and never paid a visit” (Garvey 2000).3 Yet, the 


1 The cases filed in the Fifteenth Judicial Circuit of Florida, West 
Palm Beach, were CL 00-10965, CL 00-10970, CL 00-10988AE, CL 
00-109922AF, CL 00-11000AH, CL 00-11064AH, CL 00-11098A0, 
CL 00-1146AB, CL 00-1240AB, CL 00-1290AB, and CL 00 
11302A0. These were consolidated by Adminsstrative Order No. 
2.061-11/00. Texts of the filings and of the Fifteenth Circuit Court 
rulings in the cases are available from http:/Avww.pbcountyclerk. 
com/. 

2 Bush recerved 271 electoral votes, one more than needed to win, 
and Gore received 266. One Elector pledged to Gore from Wash- 
ington, DC, left her bellot blank, reducing Gore’s count from 267 

2001; Stout 2000). 

3 The story further observes that “longtime Reform members in the 
state described a party in ‘disarray’ with little organization, much less 
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FIGURE 1. 
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county supplied 19.6% of Buchanan’s votés in Florida. 
In contrast, only 5.4% of his Florida votes came from 
PBC in the 1996 Republican presidential „primary, 
which did not use a butterfly ballot.* 

The butterfly ballot, shown in Figure ‘1, was an 
innovation of Theresa LePore, Supervisor of Elections 
for PBC.‘ The distinctivé format was used only in PBC 
and only for election-day ballots for president. It is a 
“butterfly” because two columns of names of candi- 
dates (the wings of the butterfly), all for the same 
office, sandwich a single column of punch holes be- 
tween the names. These punch holes arè alternately for 
the left-hand and right-hand side of the ballot. Thus, 
the first valid punch hole (identified on the ballot as 
#3) is for Bush, the first candidate on the left-hand 
side. The second valid punch hole (identified on the 
ballot as #4) is for Buchanan, the first candidate on the 
right-hand side. On the left, however, the second 


a gronndswell di support for Buchanan in a place eveni ho concodes 
is not his base.” 

4 In 2000, Buchanan received 0.787% of the presidential vote in PBC 
while garnering only 0.3% of the overall Florida presidential vote. In 
contrast, Ross Perot, the Reform candidate for president in 1996, 
received only 7.7% of the PBC vote and garnered 9.1% of the Florida 
vote. These data are from the Florida Department of State. 

5 Reportedly, LePore “split the names over two pages to make the 


again” (Engelhardt 2000). 
Merzer and Miamı Herald (2001) describe how LePore went about 

the ballot and many other defects in the administration of 
the election in Florida. 
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candidate listed is Gore, and someone ‘who scanned 
down the left-hand column without looking to the right 
could mistakenly conclude that the first two’ punch 
holes ‘corresponded, respectively, ‘to Bush and Gore. 
Having made such an incorrect a Bush voter 
would still be likely to punch the first hole, but a Gore 
voter might mistakenly punch the second and vote for 
Buchanan. 


Sinclair. et al. (2000) report experimental evidence 
that a double-column ballot format like the one used in 
PBC can be more confusing and cause more voter 
errors than d single-column ballot. Other published 
research on the effects of ballot design is scarce, and 
does not provide-much guidance regarding the errors 
the PBC butterfly ballot may have induced (Campbell 
and Miller 1957; Darcy 1986; Hamilton and Ladd 
1996). f : ' 

Did the butterfly ballot cost Gore the presidency? 
The lawsuits filed.by citizens of PBC were thrown out 
because the Supreme Court of Florida ruled that the 
ballot was not illegal,° but the ruling neither depended 
upon nor implied anything about the ballot’s effect on 


6 The court stated: “Even accepting appellants’ allegations; we 
conciude as a matter of law that the Palm Beach County ballot does 
not constitute substantial noncompliance require- 
ments mandating the voiding of the election” (Supreme Court .of 
Florida, Fladell, et al. v. Palm Beach County Canvassing Board, etc. et 
al Caso Nos. SC 00-2373 and SC 00-2376). The cases did not 


Progress to hearings regarding the facts. ' 
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voter behavior. The court did not rule on whether Gore 
lost the election because of the ballot. Our goal is to 
investigate that question by determining whether the 
butterfly ballot caused several thousand Gore support- 
ers to vote mistakenly for Buchanan. 


OVERVIEW 


Among scholars who posted an analysis on the Internet 
shortly after the election, a consensus quickly formed 
that the vote for Buchanan in PBC was anomalously 
large. Of the 3,407 votes that Buchanan received in the 
initial, uncertified count of PBC ballots, a typical 
estimate was that he received about 2,800 more votes 
than were to be expected based on voting patterns 
elsewhere in Florida.” The number of apparently acci- 
dental votes for Buchanan exceeded Bush’s official 
margin of victory in Florida, which was only 537 votes 
more than Gore.’ But the early analysis did not show 
unambiguously that the butterfly ballot was the cause 
or that the erroneous Buchanan votes would otherwise 
have gone to Gore. 

To test whether Democratic voters mistakenly voted 
for Buchanan because of the butterfly ballot, we use 
multiple methods and diverse data sources to rule out 
alternative explanations for the Buchanan vote in PBC. 
First, we show that Buchanan’s vote was anomalously 
high on election day. Specifically, we prove three key 
facts. (1) Anomalies in the Buchanan vote as large as 
the one in PBC did not occur in any other county in the 
country in 2000. (2) The vote for the Reform candidate 
in the previous presidential election, Ross Perot, was 
not anomalous in PBC in 1996. (3) PBC voters who 
used election-day ballots (the butterfly format) re- 
corded unusually high support for Buchanan, while 
those who used absentee ballots (no butterfly format) 
evinced the expected level of support for him. 

Second, we show that Buchanan’s excess of support 
was almost entirely from Democratic voters. Here we 
demonstrate two key facts. (1) The unusually high level 
of support for Buchanan was concentrated in precincts 
with high levels of support for Democratic candidates 
for other offices and not in precincts with high levels of 
support for Reform candidates for other offices. (2) 
Individual ballots confirm these patterns: Democratic 
voters (as measured by votes in the US. Senate 
election) who voted on election day were much more 
likely to support Buchanan than were Democratic 
voters who used absentee ballots. We conclude that the 
butterfly ballot caused at least 2,000 Democratic voters 
to vote mistakenly for Buchanan. 


7 In Brady et al. (2001) we list the carly posters, including ourselves 
Brady posted analysis on November 9, and Wand, Shotts, Sekhon, 
Mebane, and Herron posted on November 11. Lists of empmcal 
work posted on the Internet through the end of November 2000, 
appear at http:/Avww.bestbookmarks.com/election (created by 
Jonathan O'Keeffe), http://www.sbgo.com/election-htm (created by 
Sebago Associates), and http://madison.hes.cmu.edu (created by 


automatic recount across Flonda (Wakin 2000). 
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Our conclusions depend upon repeated compari- 
sons: across counties, between election-day ballots and 
absentee ballots, and across precincts. To determine 
whether the Buchanan vote is anomalous, we compare 
the actual number of votes received by Buchanan in a 
county to the votes predicted by a well-specified statis- 
tical model that uses past votes and demographic 
characteristics of all the counties in the same state to 
form estimates. When actual votes deviate significantly 
from what is expected, we have what statisticians call 
an outlier. Then, to isolate sharply the effect of the 
butterfly format, we compare Buchanan’s share of the 
votes on election day to his share on absentee ballots 
across all Florida counties. Next, to determine whether 
Buchanan’s votes in PBC were concentrated in areas 
with high levels of support for Democratic candidates, 
we compare how well Democratic strength in a pre- 
cinct predicts Buchanan votes versus votes for Reform 
candidates for other offices. Finally, we use a compar- 
ison between Buchanan votes on individual election- 
day (butterfly) and absentee (nonbutterfly) ballots to 
compute our minimum estimate for the number of 
Democratic voters the butterfly ballot caused to vote 
mistakenly for Buchanan instead of Gore. 

A simple way to estimate the expected votes for 
Buchanan is to take the percentage of Buchanan voters 
in each county and regress it, using ordinary least 
squares (OLS), on past votes and demographic char- 
acteristics. Then an examination of the difference 
between the actual-and expected vote might tell us 
whether the county was anomalous. This procedure 
might be misleading, however, because of three major 
statistical problems. First, the counties differ in size— 
the smallest have only a few hundred voters, while the 
largest have millions—lading to the problem of het- 
eroskedasticity. Second, heteroskedasticity is exacer- 
bated because the expected proportion of votes for 
Buchanan varies over counties. Third, because OLS is 
notoriously subject to outliers, any OLS-based esti- 
mates designed to detect them are suspect. All three 
problems must be solved in order to get reliable results. 

To understand why differing county size is a prob- 
lem, consider a hypothetical example. Suppose there 
are two counties, in both of which Buchanan is ex- 
pected to receive 1% of the vote. One county has 100 
voters, so the expected Buchanan vote is one, and the 
other has 100,000 voters, so the expected vote for 
Buchanan is 1,000. In the small county a 2% excess of 
votes over the expected value corresponds to an ob- 
served total of three Buchanan votes: (3 — 1)/100 x 
100% = 2%. In the large county an observed vote of 
3,000 for Buchanan would be required to produce the 
same 2% discrepancy. But two “extra” Buchanan votes 
by chance in the small county are much more likely 
than an extra 2,000 in the large county. For instance, 
using a simple binomial model for the count of votes 
for Buchanan, the z-score for the discrepancy is (3 — 
1V100 x .01 x .99 = 2 in the small county but 
(3000 — 1000)/V 100000 x .01 X .99 = 64 in the large 
county. 

To avoid the heteroskedasticity problem, we use a 
generalization of the standard binomial model to esti- 
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mate expected vote counts, and we use statistics that 
adjust for the counts’ variances when making compar- 
isons across counties. In the model the probability of a 
vote is a function of past votes and demographic 
factors. The variance of the vote count can be greater 
than in a standard binomial model. The resulting 
model is called an overdispersed binomial model. The 
statistics we compare across counties are studentized 
residuals (Carroll and Ruppert 1988, 31-4), which are 
defined as the simple difference between observed vote 
count and expected vote count, divided by the esti- 
mated standard deviation of the observed count. 

The overdispersed binomial regression model treats 
the count of votes for Buchanan in a particular geo- 
graphic area as having the mean and variance of a 
binomial random variable, except that the variance is 
multiplied by a constant scale factor. As described in 
detail by McCullagh and Nelder (1989, 125), the scale 
factor may reflect a process in which each vote count is 
a sum of vote counts produced at lower levels of 
aggregation, where each lower-level aggregate is a 
binomial random variable. According to such a moti- 
vation for the model the binomial distribution that 
describes each lower-level count may have a distinct 
probability parameter (p. 125). In this way the over- 
dispersed binomial model recognizes the heterogeneity 
among the clusters of voters whose choices comprise 
the vote counts we analyze. We describe the overdis- 
persed binomial model in greater detail in Appendix A. 

The most important methodological innovation in 
our analysis is that we develop new methods for robust 
estimation of the overdispersed binomial regression 
model. These guard against the possibility, which is 
very real with OLS estimation, that an outlier may 
destroy the estimation. Without robust methods, a 
single large outlier may mask other outliers (Atkinson 
1986), which means that the distorted data appear to 
be the norm rather than the exceptions. Masking also 
may occur when there are several outliers that are 
similar in magnitude. For instance, in the simplest case 
of estimating the average for a set of numbers, it is easy 
to see that the sample mean can be greatly affected by 
a single exceptionally large value. Masking occurs when 
this one very large point so greatly alters the sample 
mean that other large but smaller points do not appear 
to be as discrepant as they truly are from the mean 
value that characterizes most of the sample data. 
Similarly, masking can occur when a few exceptionally 
large or small data points, all of about the same size, 
pull the sample mean toward themselves so that none 
of them are far from the sample mean value, although 
in fact they do not represent the same statistical 
distribution as the rest of the data. Masking makes it 
difficult to. determine which data points really do 
deviate from what we should expect. 

The primary reason to use robust estimators in the 
PBC situation is that the voter complaints, legal cases, 
and media reports strongly suggest that the electoral 
results there were produced by processes substantially 
different from the standard political factors (partisan- 
ship, liberalism-conservatism, policy positions) that 
cause voters to act predictably from one election to 
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another, and that produced the results elsewhere in 
Florida. We want our models to predict what would 
have happened without idiosyncratic factors such as a 
confusing ballot form. Significant departures from 
those predictions will indicate that idiosyncratic factors 
must be at work. 

Data weakness is another reason for robust estima- 
tion. Because our models are based on the results from 
other elections, anomalies in those elections will give a 
distorted picture of the standard political factors that 
predict the vote in the current one. The robust estima- 
tors we use protect against the influence of such 
distortions. A county in which the previous election 
results are highly distorted will not affect the parameter 
estimates and, indeed, will itself appear to be an 
outlier. We describe our robust estimation methods in 
more detail in Appendix A. 


BUCHANAN’S VOTE IN COUNTIES ACROSS 
THE UNITED STATES ` 


To assess how excessive Buchanan’s PBC vote appears 
to be when compared to outcomes across the country 
in 2000, we use the overdispersed binomial regression 
model to estimate the expected number of votes cast 
for Buchanan out of all votes cast for Buchanan, Gore, 
Bush, Ralph Nader (Green Party), Harry Browne 
(Libertarian Party), Howard Phillips (Constitution Par- 
ty), John Hagelin (Natural Law Party), or any write-in 
candidates. We predict votes for Buchanan, given the 
total number of votes cast for all presidential candi- 
dates, in each U.S. county in 2000.9 Two kinds of 
information, the results of previous elections and the 
demographic characteristics of the county, are avail- 
able and highly relevant for making such predictions. 
The previous election outcome is a proxy not only for 
the array of interests and party sentiments in each 
county but also for the strength of local party mobili- 
zation. We use two variables to represent the preceding 
election result: the proportion of votes officially re- 
ceived by the Republican candidate in the 1996 presi- 
dential election; and the proportion of votes officially 
received by the Reform Party candidate in the 1996 
presidential election. We supplement them with a set 
of nine demographic variables. Seven of the variables 
come from the 2000 Census of Population and Hous- 
ing: the proportions of county population in each of 
four Census Bureau race categories (namely, white, 
black, Asian and Pacific Islander, and American Indian 
or Alaska Native); 2000 proportion Hispanic; 2000 
population density (computed as 2000 population/1990 
square miles); and 2000 population.!° The eighth and 
ninth are the 1990 proportion of population with a 
college degree and 1989 median household money | 





° We ignore undervotes (no apparent vote recorded on the ballot), 
overvotes (votes for more than one presidential candidate on a singie 
ballot), and other spoiled ballots. For discussions of undervotes and 
overvotes m PBC see Engelhardt and McCabe 2001a, 2001b. 

10 The 2000 Census data were built from Census 2000 Redhstricting 
Data (Public Law 94-171) Summary File, Matrices PL1, PL2, PL3, 
and PLA (U.S. Census Bureau, 2001, “FactFinder Tables,” accessed 
April 7, 2001, at http://factfinder.census.gov). 


American Political Science Review 


income. We do not use the number of voters registered 
as Reform Party members because voter registration is 
nonpartisan in many states. 

We cannot use all eleven variables (plus a constant) 
in our models at once, because we estimate the model 
separately for each state (except for four states that we 
pool because each has only a few counties; see below), 
and with that many variables a state would need to 
have about 40 to 60 counties to produce reliable 
estimates. But 21 states (including the District of 
Columbia) have fewer than 50 counties. Therefore, we 
include the two vote proportion variables in all the 
models and use principal components of the demo- 
graphic data. To maximize the efficiency of the infor- 
mation gained from the demographic variables, we 
compute principal components of the set of residuals 
obtained by regressing each demographic variable on 
the previous election proportion variables and the 
constant.11 The regressions on the election variables 
and the computation of principal components are done 
separately for each state. 

For the results of the county-level analysis that we 
report in detail here, we use only the first principal 
component of the Census data. As we explain below, 
using more principal components does not substan- 
tively change the results that bear on PBC. Hence, the 
expected vote for Buchanan in county i in the given 
state is based on a linear predictor defined by: 


xB = Bo + Bixu + Boxa + Bs ra, 


where x, is the 1996 proportion of votes received by 
the Republican candidate, xz; is the 1996 proportion of 
votes received by the Reform candidate, x3, is the 
principal component, and Bo Bı, By, and B, are con- 
stant coefficients. 

We compare PBC to the 3,053 counties in the 
United States for which we can robustly estimate the 
overdispersed binomial model and hence compute 
studentized residuals. Because they have too few coun- 
ties to analyze separately, we pool the data for Con- 
necticut, Delaware, Hawaii, and Rhode Island (which 
have, respectively, eight, three, four, and five counties), 
using dummy variables to give each state a different 
intercept but requiring the other coefficients to be the 
same for all four states. The dependent variable is the 
count y, of votes for Buchanan in each county i in the 
2000 election. The studentized residuals are compara- 
ble across states.4 

PBC has the largest residual’* among the 3,053 
counties in our analysis. This result can be seen in 


11 We standardize the residuals of the demographic variables to have 
variance equal to 1.0 before computing the principal components. 
12 With only one county, the District of Columbia cannot be ana- 
lyzed. Our collection of counties (or equivalents) includes all states 
except Michigan, where Buchanan was not on the ballot and could 
receive only write-m votes. For Alaska we use the 25 county- 
equivalents defined by the U.S. Census Bureau for reporting 1990 
Census data. 

13 We use the final, certified vote counts for each state See Appendix 
B for data sources. 

14 Throughout the rest of this article the word residual always refers 
to the studentized residual, as defined in Appendix A, equation A-1, 
unless otherwise indicated. 
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Figure 2, which presents boxplots that display the 
distribution of the residuals for each state. The 
residual for PBC not only has the largest positive value 
but also is the largest in absolute magnitude. Buchanan 
received vastly more votes in PBC than predicted by 
the county’s electoral history and demographic profile. 

Appendix Table B-1 lists the residuals, 
Buchanan vote proportion, actual Buchanan vote pro- 
portion, and number of valid presidential ballots for all 
counties for which the residual is greater than 4.0 or 
less than —4.0, namely, all the outliers. There are 68 
positive outliers and eight negative outliers. The dif- 
ference between the number of positive and the num- 
ber of negative outliers reflects the overall positive 
skew of the residuals that is visible for most states in 
Figure 2. Outliers occur in 31 of the 49 states covered 
by the analysis. 

The outlier status of some counties is readily ex- 
plained. Jasper County, South Carolina, which has the 
second largest residual in our analysis, did not receive 
much national media attention because the outcome 
was immaterial to the 2000 presidential contest. Bush 
defeated Gore in South Carolina by 220,376 votes, but 
only 6,469 presidential ballots were cast in Jasper 
County. Nonetheless, there were serious problems with 
a voting machine in the county’s Tillman precinct, 
where Gore and Bush each received one vote, 
Buchanan 239 votes, and Nader 111 votes. The prob- 
lems in the precinct affected vote totals for other 
offices. Indeed, “the State Board of Canvassers unani- 
mously said [...] that problems in the county council 
election were so numerous that a new election should 
be held.”16 

PBC is not geographically contiguous to any other 
outlier, but many of the counties listed in Table B-1 are 
contiguous to another outlier. Table 1 displays the sets 
of counties from Table B-1 that are geographically 
contiguous. Sixteen of the twenty-five largest positive 
outliers belong to such a cluster. Two clusters include 
counties from two states (West Virginia and Ohio, 
Kansas and Missouri), and another includes counties 
from three states (South Dakota, Iowa, and Nebraska). 
Because these span state borders, it is highly unlikely 
that the exceptional support for Buchanan reflects 
problems of ballot format or electoral administration. 
Most likely the reason is special success in mobilizing 
voters for Buchanan in those areas. 

Plausible explanations also can be produced for 
some of the remaining outliers, but we do not empha- 
size these because the residuals for all of those counties 
are much smaller than that for PBC, and some outliers 
are not stable over variations in the model specifica- 
tions. In contrast, the size and relative position of the 
residuals are stable for counties such as PBC or Jasper, 
for which voting irregularities were reported, or Han- 
cock County in West Virginia and Pottawatomie 


15 Whiskers extend to the nearest value not beyond 1.5 times the 
mterquartile range. The residuals for the pooled states (CT, DE, HI, 
and RI) are omitted. 

16 See the December 28, 2000, issue of the Beaufort Gazette and 
Associated Press 20002 for allegations regarding the Tillman pre- 
cinct. 
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County in Kansas, which are in contiguous clusters with 
unusually large numbers of Buchanan votes. PBC has 
the largest residual and Jasper County the second 
largest, whether we use one or two principal compo- 
nents to represent the demographic variables. With 
three principal components, PBC has the second larg- 
est residual (35.5) and Jasper the third largest (20.9). 
With no principal components (i.e., only past vote 
proportions), Jasper has the largest residual (25.6) and 
PBC the second largest (21.5). 

No other county in Florida comes close to PBC in 
terms of excessive votes for Buchanan in the 2000 
election. The only other outlier in the state is Pinellas 
County (see Table B-1). The parameter estimates for 
Florida give a point estimate of 438 for the number of 
votes expected for Buchanan in PBC, which implies an 
excess of 2,973 accidental votes in his certified tally of 
3,411 votes. 

Although PBC was an outlier in 2000, it is possible 
that support for the Reform Party is typically unusually 
high there. Because no butterfly ballot was used in 
1996, an unexpectedly high number of Reform votes in 
the county that year compared to other Florida coun- 
ties might support an alternative explanation for the 
2000 result. Another possibility is that the 1996 Reform 
vote was exceptionally low in PBC, so the anomality of 
the 2000 vote could be exaggerated. In that case, the 
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2000 vote would appear to be excessive even if the 
county’s Reform vote were simply returning to nor- 
malcy. The 1996 data support neither of these possi- 
bilities. 

Using the overdispersed binomial model to analyze 
the votes received across the counties of Florida by 
Ross Perot, the Reform candidate in the 1996 presi- 
dential election, we find that PBC was not an outlier in 
1996. We model the number of votes cast for Perot out 
of all votes cast either for Perot, Democrat Bill Clinton, 
or Republican Bob Dole. The regressors are defined to 
be earlier versions of the county-level variables we used 
to analyze the 2000 vote data: the proportion of votes 
Officially received by the Republican candidate in the 
1992 presidential election; the proportion of votes 
officially received by the Reform candidate in the 1992 
presidential election; and earlier demographic data.17 
We use our robust estimators. 

In 1996 no county in Florida has a residual even 
remotely as large as the one for PBC in 2000. In 1996 
only St. Lucie County has a residual of absolute 
magnitude greater than 4.0 (—4.92). The largest posi- 
tive residual is for Holmes County (2.30). PBC has the 





17 The race, Hispanic ethnicity, and population variables are taken 
from the 1990 Census. 
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TABLE 1. Contiguous Counties among . 
Those with the Largest Studentized Residuals 
for the Buchanan Vote 
State 


County Name 
Hocking 








seventh most' negative residual (—1.86). PBC was not 
an outlier in 1996. 

The 2000 vote for Buchanan in PBC was extremely 
unusual—clearly, among the most unusual im the en- 
tire country. The vast difference between the residual 
for PBC and the residuals for the other counties 
confirms that a large anomaly occurred relative to the 
vote predicted for Buchanan based om PBC’s voting 
history and demographic profile. 


A NATURAL EXPERIMENT: FLORIDA’S 
ELECTION-DAY AND ABSENTEE VOTERS 
IN 2000 


Buchanan’s vote total in PBC in the 2000 presidential 
election was anomalously large, but how can we be sure 
that the cause was the butterfly ballot? Because that 
format was not used for absentee ballots, the election 
gives us a natural experiment: One group of PBC 
voters (election day) used a butterfly ballot but a 
second group (absentee) did not. If Buchanan’s vote 
total in the county reflects true support among the 
voters, then this support should be present in both 
pools of ballots. But if the butterfly ballot is responsible 
for Buchanan’s vote then his support should come 
disproportionately from votes on election day. 

A limitation of this natural experiment is that the 
mechanism that allocates voters to either the election- 
day pool or the absentee pool is not random assign- 
ment (Achen 1986). Voters self-select to be in the 
absentee group.!8 Some, such as military personnel 
stationed overseas, must cast absentee ballots. Absen- 
tee voters may not be representative of voters in 
general, but there are good reasons to believe that the 
influences on their voting behavior are similar across 
the counties. PBC’s absentee voters are probably sim- 
ilar to those in at least some of the other counties, and 
we would expect their level of support for Buchanan to 
be similar as well. Therefore, if we take the difference 
between the proportion of those voting for Buchanan 
on election day and the proportion voting for him on 
absentee ballots, PBC’s difference should cluster with 
the differences for other counties, unless something 
like the butterfly ballot has caused it to be distinctly 
different. 

This difference analysis shows disproportionate sup- 
port for Buchanan among election-day voters in PBC. 
Figure 3 plots the difference—election-day proportion 
minus absentee proportion—for all 67 counties of 
Florida by the number of presidential ballots cast in 
each county.!9 One can see that PBC has one of the 
largest differences in -the state, although four counties 
have larger differences, and a few others have differ- 
ences close in value to that of PBC. But in all those 
counties the voting population is much smaller than in 
PBC, where 433,186 ballots were cast for president. In 
Calhoun, the largest county with a difference greater 
than PBC’s, there were 5,174 ballots cast for president. 

The significance of the population size disparity is 
that even if voting processes are identical in all the 
counties, there will be greater variability in the differ- 


18 According to 2000 Florida Statutes, title IX, chapter 101.62, a 
registered voter need not give any reason for requesting an absentee 
ballot. 

19 The data are based on certified numbers from the Florida Depart- 
ment of State and precinct-level returns provided by the 67 Florida 
counties. We used the precinct data to calculate the absentee returns. 
2 The difference in PBC m 0.00634 (433,186 presidential ballots), 
exceeded by Liberty (0.0177, 2,410 ballots), Calhoun (0.00959, 5,174 
ballots), Hamilton (0.00731, 3,964 ballots), and Dixie (0.00706, 4,666 
ballots). Union and Baker counties have the next largest differences 
after PBC, in both cases 0.00620. The numbers of presidential ballots 
cast m those counties were, respectively, 3,826 and 8,154. 
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ence in proportions in counties where smaller numbers 
of ballots are cast. With identical processes in all 
counties, the standard deviation of the difference in 
proportions will vary approximately in proportion to 
the reciprocal of the square root of the total number of 
ballots. If in all counties the proportions of election- 
day and absentee ballots cast for Buchanan do not 
systematically differ, then the observed differences 
between the proportions should be contained within 
bounds defined by the reciprocal of the square root of 
the number of ballots.?! In Fi 3 the dashed lines 
are drawn at the values Piha e m denotes the 
number of ballots. One can see that only PBC falls 
outside those bounds. The difference between 
Buchanan’s election-day and absentee proportions in 
PBC appears to be much greater than one would 


21 Tf m is the (large) total number of ballots, q is the proportion of 
election-day ballots, and ~ is the probability of voting for Buchanan 
for both types of ballots, then the overdtspersed binomial model with 
dispermon parameter o? implies that the standard deviation of 
the difference between the observed proportions is approximately 
o[(1/m)(1/¢ + 1/01 — q)) m (1 — )]"?. Because a ~ .003 for 
Buchanan throughout Florida, containment within the bounds will 
fail only if o is large or q is near 1.0—or if the probabilities for the 
two ballot types are not equal 
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expect by chance. The differences for the other coun- 
ties appear to fall within the range one would expect 
given only random deviations from equality in the 
processes that generate election-day and absentee 
votes. 

The distorting effects of vastly different population 
sizes across counties may be fully corrected by explicitly 
setting up the comparison between election-day and 
absentee ballots as a test of the hypothesis that the 
proportion of votes for Buchanan among all the votes 
cast for president is equal for the two ballot formats. 
The simplest test is to determine whether the differ- 
ence between Buchanan’s proportion of the election- 
day ballots differs significantly from his proportion of 
the absentee ballots. We can do this using a straight- 
forward modification of a difference of proportions test 
that appears in standard textbooks (Wonnacott and 
Wonnacott 1990, 275, eq. 8-29). To define the propor- 
tions, let A, denote the total number of absentee votes 
cast for president in county i, let a, denote the number 
of absentee votes cast for Buchanan, let B, denote the 
total number of election-day votes cast for president, 
and let b, denote the number cast for Buchanan. We 
again use an overdispersed binomial model for the 
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Palm Beach County Precincts 


Reform 
Scope Candidate Intercept 
All precincts Buchanan —6.17 (0.15) 
District 35 Buchanan —7.48 (0.51) 
District 35 Lowe —1.98 (0.34) 
District 16 Buchanan —7.00 (0.29) 
District 16 McGulre +-3.37 (0.30) 


Note: Entries are tanh estimates of coefhcient 
(standard 
Distnct 16, 149. 


totals a, and b,, with statewide dispersion parameter o°. 
Because A, and B, are large, the equality hypothesis 
implies that the following z-score is normal with mean 
zero and unit variance: 


z, = 0 1(b/B; — aJA,) 


(b/B)(1 - b/B) | (@/A)(1 — aJA,) hc 
B, A, 


If z, is significantly greater than zero, then in propor- 
tional terms election-day voters cast ballots for 
Buchanan much more often than did absentee voters. 
If z, is significantly less than zero, then support for 
Buchanan was disproportionately great among absen- 
tee voters. To compute z, we need an estimate of o. We 
use the estimate of the scale parameter that we ob- 
tained for the 2000 Florida data when we estimated the 
county-level overdispersed binomial model. That esti- 
mate is 6 = 3.81. 

PBC has z, = 6.3, a value more than six standard 
deviations away from the value of zero that is expected 
under the equality hypothesis. The next largest positive 
value among the remaining counties of Florida is 1.65, 
and 58 counties have a z-score of magnitude less than 
1.0. Only one negative value, a z-score of —1.1 for 
Duval County, is more than one standard deviation 
away from zero. 

In PBC, election-day votes went to Buchanan vastly 
more often, in proportional terms, than did absentee 
votes. The results of the election-day versus absentee 
ballot natural experiment strongly support the conclu- 
sion that Buchanan’s anomalous support was caused by 
the butterfly ballot. l 

Underlying the dramatic test statistic value for PBC 
is the fact that county voters supported Buchanan on 
election day at a rate (.0085) approximately four times 
that of absentee voters (.0022). If election-day voters 
bad cast ballots for Buchanan at the rate that absentee 
voters did, he would have received 387,356 x 0.0022 = 
854 election-day votes. In fact he received 3,310 elec- 
tion-day votes. By this method one might gauge the 
number of accidental votes for Buchanan because of 
the butterfly ballot as follows: 3,310 — 854 = 2,456 
votes. 


TABLE 2. Votes for Reform Candidates by Proportions Voting for U.S. Senate Candidates, for 





of the overdispersed binomial regression model uemg precinctevel deta from the 2000 election 
errors are in parentheses). The last cokumn reports the LQD dlepersion estimate ô. Number of precincts al precmcts, 516; Distnet 35, 106; 


Vol. 95, No. 4 











Senate: Senate: 

Nelson (D) Deckard (Ref) ĉ 
2.08 (0.21) —12.74 (14.79) 1.22 
3.85 (0.71) 13.13 (23.89) 1.26 

—1.88 (0.51) 18.06 (14.46) 1.54 
3.32 (0.48) 3.95 (15.09) 1.15 
—1.00 (0.50) 25.96 (9.10) 1.52 





PRECINCT-LEVEL ANALYSIS OF PALM 
BEACH COUNTY RETURNS 


Who made the mistakes on the butterfly ballot? Was it 
voters who favored Bush or those who wanted to vote 
for Gore? At an intuitive level, mistakes seem less 
likely for Bush voters, who had to match the first 
candidate with the first punch hole, than for Gore 
voters, who had to match the second candidate with the 
third punch hole. Furthermore, it was a large number 
of Democratic voters who complained that the ballot 
caused them to vote for Buchanan by mistake. None- 
theless, it is important to examine the possibility that 
Buchanan received votes intended for both candidates. 

To assess the asymmetry of voting error—whether 
Democratic voters mistakenly chose Buchanan at a 
substantially greater rate than Republican voters—we 
estimate an overdispersed binomial model for precinct- 
level election returns across PBC. We use two regres- 
sors (plus the constant): the proportions of the vote in 
each election-day precinct for two U.S. Senate candi- 
dates, namely, Democrat Bill Nelson and Reform 
candidate Joel Deckard. We use our robust estimators. 
If the butterfly ballot did not cause systematically 
biased voting errors, then support for Nelson (D) ina 
precinct should be negatively associated with the 
Buchanan vote, and support for Deckard (Ref) would 
be positively associated with that vote. If the butterfly 
ballot did cause asymmetric voting errors, then support 
for Nelson (D) should be positively associated with the 
Buchanan vote. 

The first line in Table 2 presents the coefficient 
estimates for all PBC election-day precincts. Support 
for Nelson (D) is positively associated with support for 
Buchanan. This pattern is expected if the butterfly 
ballot caused many Democrats to vote for Buchanan 
and supports the claim that his votes tended to come 
from mistaken Gore supporters. 

The pattern does not occur in votes for other offices 
that included a Reform candidate but did not. use the 
butterfly ballot. In 2000 in PBC, Sherree Lowe (Ref) 
ran for the State Senate in District 35 and John 
McGuire (Ref) ran for the U.S. House in District 16. 
The issue positions stated on their and Deckard’s web 
pages suggest that all three of the Florida Reform 


801 


The Butterfly Did It: The Aberrant Vote for Buchanan in Palm Beach County, Florida 


candidates were in sympathy with the Buchanan fac- 
tion. For instance, all were pro-life and looked askance 
at free trade. Deckard went so far as to express 
concerns about legal immigration. 

We robustly estimate two models for election-day 
precincts in PBC in each district. In both districts we 
analyze the votes received by Buchanan. For District 35 
we also analyze the votes received by Lowe (Ref), and 
for District 16 we analyze the votes for McGuire (Ref). 
Because the butterfly ballot was used only for the 
presidential vote, we expect support for Nelson (D) to 
be negatively associated with support both for Lowe 
(Ref) and for McGuire (Ref). 

The coefficient estimates in Table 2 match those 
expectations, while in both districts support for Nelson 
(D) remains positively associated with the Buchanan 
vote. These results support the claim that the butterfly 
ballot caused systematic, biased voter errors that cost 
Gore more votes than Bush. Democratically inclined 
precincts (as measured by the Nelson [D] vote propor- 
tion) have fewer votes for Reform candidates in gen- 
eral (i.e., Lowe and McGuire) but have more votes for 
Buchanan. The difference is the butterfly ballot. 

These findings refute one possible explanation for 
the positive association between Nelson’s vote share 
and Buchanan votes, which is that Reform Party mem- 
bers in PBC tend to live among Democrats. Such an 
explanation is contradicted by the negative association 
between the proportion of votes for Nelson (D) and 
the votes for Lowe (Ref) and for McGuire (Ref). 

We also can refute another class of alternative 
explanations for Buchanan’s vote total in PBC: It may 
have been caused by a group of anomalous precincts 
within the county. In other words, anomalous results 
concentrated within a few precincts would suggest that 
excess votes were the result of localized phenomena 
rather than the butterfly ballot which was used uni- 
formly throughout the county. For example, malfunc- 
tioning vote machines—such as the one in Tillman 
precinct in Jasper County, South Carolina—could have 
recorded extra votes for Buchanan in a few precincts. 
Merzer and the Miami Herald (2001, 78—80) document 
the existence of malfunctioning machines in numerous 
PBC precincts: 96 of 462 ballots from tests conducted 
immediately before polls opened showed failures. Or 
there may have been intentional fraud in a few pre- 
cincts. Finally, pockets of intense election-day Reform 
mobilization could have delivered the extra votes. 

Such explanations, however, are quite difficult to 
reconcile with our precinct-level findings. Given our 
use of a robust estimator, the coefficient estimates in 
Table 2 would not be affected by a few precinct-level 
anomalies produced by irregular voting processes.2 
Moreover, a localized mobilization effort should affect 
outcomes in multiple races, but the peculiar relation- 


? Candidate issue positions were posted at http://www iain 
com/issues.htm (Deckard), http://www 

(McGuire), http:/Avww.sherreclowe.com/ (no longer available) a 
http-/election.dos.state.fl.us/cgi tmLexe?account=31167& 
elecid=20001107-GEN (Lowe) (all accessed April 7, 2001). 

D Tn the analysis of all 515 precincts, reported in Table 2, ten were 
outliers. 
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TABLE 3. Vote for Buchanan by U.S. Senate 
Vote In Palm Beach County, for Individual 
Ballots by Ballot Type 


Deckard 
Ballot T Intercept Nelson (D) (Ref} 
Electlon-day —5.18 (0.034) 0.61 (0.040) 2.41 (0.138) 
—6.11 (0.156) —0.21 (0.236) 3.68 (0.400) 


change 
each type electon-day, 381,449; abeentse, 36,412. 


ship between Nelson (D) vote share and Reform vote 
is present only in the presidential race, which used the 
butterfly ballot.24 


COMPARISON USING INDIVIDUAL PALM 
BEACH COUNTY BALLOTS 


We extend our analysis to ballot-level data from PBC, 
which enable us to compare an individual’s presidential 
vote choice with the individual’s choices for other 
Offices. The data include both election-day and absen- 
tee ballots but exclude returns for 25 precincts that 
were overwritten by test data when PBC tested its vote 
tabulating machines. The number of Buchanan votes 
lost from those precincts (2.1%) is proportional to the 
number of precincts lost. We use the ballot-level data 
to validate the precinct-level regression results. We 
also produce another estimate of the size of the 
butterfly ballot effect. 

With ballot-level data the binomial regression model 
reduces to the familiar binary logistic regression mod- 
el. The dependent variable is whether the voter cast a 
vote for Buchanan or voted for another candidate. In 
addition to the constant, the regressors are two dummy 
variables that respectively indicate whether the ballot 
records a vote for Nelson (D) or a vote for Deckard 


f). 

The estimates reported in Table 3 for election-day 
ballots show a positive and significant coefficient on 
voting for Nelson (D), which matches the positive 
coefficient on the precinct-level proportion of votes for 
Nelson (D) in Table 2. Among election-day ballots, 
voting for Nelson (D) is positively associated with 
voting for Buchanan. For absentee ballots, however, 
the Nelson coefficient is negative and insignificant. The 
confidence intervals of the Nelson coefficients for the 
two ballot formats do not overlap. Therefore, we reject 
the hypothesis that the effect of voting for Nelson (D) 
is the same regardless of the ballot format. 


% Tn addition, Elms and Brady (2001) show that the excessive 
Buchanan vote is spread throughout the county, and there m a 
precipitous drop in the Buchanan vote proportion in precincts in 
adjoining counties. 


3 The ballot data were acquired from the PBC Supervisors of 
Elections. 

25 In the absence of grouping there cannot be overdispersion (Mc- 
Cullagh and Nelder 1989, 125) and high breakdown estimators do 
not exist (Christmann 1994). 
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TABLE 4. Proportion Voting for Buchanan by 
U.S. Senate Vote Choice and Ballot Type in 
Palm Beach County 


Senate Election-Day Ballots Absentee Ballots 
Candidate Proportion N Proportion N 
0.0102 228,455 0.0017 17,779 


Bill Nelson (D) 


Joel Deckard 
(Ref) 0.0590 1,000 0.0808 99 


Note: Entnes are the proporbon of ballots with a vote for Buchanan out 
of the N ballots of each type voted for each Senate candidate, using 
balot data from the 2000 elecbon.’ Ballots with sposed preesdenbal 
votes (undervotes or overvotes) are omitted. 


Table 4 shows the proportion of votes in PBC going 
to Buchanan among all ballots that record U.S. Senate 
votes for either Nelson (D) or Deckard (Ref). The 
proportions show that PBC voters who support the 
Democratic Senate candidate are significantly more 
likely to vote for Buchanan on the butterfly ballot than 
are their counterparts who use the absentee ballot— 
indeed, six times more likely. Fewer than two m one 
thousand absentee voters in PBC who vote for Nelson 
(D) also vote for Buchanan, but among election-day 
Nelson voters the figure is ten in one thousand. If we 
treat the absentee proportion as the proportion of 
votes truly intended to go to Buchanan, then about 8.5 
of every 1,000 Nelson voters in PBC—about 2,300 
voters—appear to have voted mistakenly for 
Buchanan.? Because most (89.6%) absentee Nelson 
voters voted for Gore, we can further conclude that at 
least 2,000 of the 2,300 would have been Gore votes. 

In contrast, Table 4 shows that individuals who vote 
for Deckard (Ref) are more likely to vote for 
Buchanan on the absentee ballot. Deckard voters who 
support Buchanan should not be affected by the but- 
terfly ballot, and the difference between election-day 
and absentee Buchanan vote proportions is small. 

The ballot data add to the evidence that the butterfly 
ballot caused systematic voting errors in PBC that cost 
Gore votes. In particular, these data help rule out the 
possibility that Buchanan’s exceptional support in the 
county was a result of populist appeals he made or 
policy positions he took that Democrats found attrac- 
tive. The ballot data show that the appeals would 
somehow have to have been effective for Democrats 
who voted on election day but not Democrats who used 
an absentee ballot. 

We gain analytical precision with the ballot data, but 
the ballot-level analysis complements rather than re- 
places the county and precinct analyses presented 
above. Ballot-level data are rarely retained or made 
available after an election, so it is not generally possible 
to compare these results across states or counties. 
Without such comparisons, the ballot-level results must 
be considered with some caution. Moreover, ballot 
data from 2% of the precincts in PBC are not available, 


but no data are missing for our precinct-level analysis. 
77 This number is calculated by 269,835 x 0.0085 = 2,294. The 


number 269,835 is the total Nelson (D) received in the entire county 
on election day, including the precincts missing from the ballot data. 
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Also, unlike the county and precinct analysis, the 
analysis of individual ballots cannot use robust estima- 
tion techniques. In the absence of robust—high break- 
down point—results, it would have been possible to 
claim that the aberrations we found may be limited to 
a few idiosyncratic precincts and not characteristic of 
PBC as a whole. But the aberrations prevail through- 
out the county. 


CONCLUSION 


We have examined the source of the anomalous sup- 
port for Buchanan in PBC by focusing on allegations 
that the county’s use of a butterfly ballot caused 
systematic voting errors that boosted the number of 
votes for Buchanan.” Robust estimation of overdis- 
persed binomial regression models showed that, with 
respect to the Reform vote in 2000, PBC is the largest 
outlier among all counties in the United States we were 
able to examine. We also showed that PBC was not a 
Reform vote outlier in 1996, a presidential year in 
which the county did not use a butterfly ballot. In some 
counties around the country we found clear auxiliary 
evidence of problems with ballots, voting machines, or 
election administration. In still others there are strong 
indications that Buchanan received an exceptional 
number of votes because he had exceptional political 
support in those places. There is no reason to believe 
that he had such mobilized support in PBC. 

Having confirmed that in 2000 PBC was an outlier, 
we sought to verify whether the butterfly ballot was the 
cause. A comparison of election-day versus absentee 
ballot results across all Florida counties shows that 
Buchanan’s success in PBC did not extend to absentee 
voters, who did not use the butterfly ballot. We exam- 
ined the claim that Democratic presidential candidate 
Al Gore in particular was harmed by the butterfly 
ballot. We found that Buchanan’s support in PBC 
tended to come from more Democratic precincts and 
from those who voted for the Democratic candidate for 
the U.S. Senate, which supports the claim that mis- 
taken votes for Buchanan tended to come from Gore 
supporters. 

Was the butterfly ballot pivotal in the 2000 presiden- 
tial race? The evidence is very strong that it was. Had 
PBC used a ballot format in the presidential race that 
did not lead to systematic biased voting errors, our 
findings suggest that, other things equal, Al Gore 
would have won a majority of the officially certified 
votes in Florida. 

Our analysis complements the efforts of media 
groups to inspect ballots throughout Florida in order to 
assess what result would have been produced by com- 
pleting the recount that the U.S. Supreme Court 
terminated, or by conducting a count using uniform 
standards throughout the state. As of this writing only 
the results of a statewide inspection conducted by the 


23 A related allegation is that the PBC ballot led to exceasive 
overvoting in the presidential race (Merzer and Miami Herald 2001). 
The subject of overvoting is beyond the scope of this article (recall 
footnote 9; also see Bridges 2001). 
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Miami Herald have been reported.» Our analysis an- 
swers a counterfactual question about voter intentions 
that such investigations cannot resolve. The inspections 
may clarify the number of voters who marked their 
ballot in support of the various candidates, but the 

ions cannot tell us how many voters marked 
their ballot for a candidate they: did not mtend 1 
choose. 

Citing the results from various scenarios in stich 
votes were counted using one of several reasonable 
uniform standards, the Herald concludes “After study 
and analysis of 111,261 overvotes and 64,826 under- 
votes, [...] the outcome still depends on the standard 
used to gauge undervotes. Gore wins narrowly under 
two undervote standards, by margins of 332 and 242 
votes; Bush wins narrowly under two other undervote 
standards, by 407 and 152 votes (Merzer 2001a; see 
also Merzer 2001b). Evidently, the number of votes 
that were intended for Gore but that went to Buchanan 
because of the butterfly ballot is large enough to have 
changed the election outcome given any. of several 
reasonable standards that might have been used -to 
count the votes in Florida. 

Although we focus: here on the butterfly ballot in 
PBC, our methods could be used on a regular basis as 
part of an ongoing effort to identify election anomalies 
and improve the administration of elections by elimi- 
nating such anomalies. Our robust estimation and 
outlier detection methods offer an accurate and pow- 
erful technology for detecting irregular vote outcomes. 
But determining why a particular irregularity occurs 
requires a strategy of triangulation, such as the one we 
pursue here. Different models and different types of 
data need to be marshaled to eliminate plausible 
alternative explanations.. In the case of PBC and the 
2000 presidential election, such a strategy leads to the 
conclusion that “the butterfly did it.” 


APPENDIX A: STATISTICAL MODEL AND 
ESTIMATION METHODS 


The Overdispersed Binomial Regression 
Model 


The overdispersed binomial regression model is defined as 
follows, Let i indicate one of n geographic areas, i = 1,. 
n. Lety, denote the count of votes for Buchanan in area £, and 
let m, denote the total number of ballots cast for all presi- 
dential candidates in area i. Given m, and a probability value 
T, the expected value of y, is 

EQ, | m, 7) = mm, 
and the variance of y, is 


ELV, a | My Ty o°] T o? m,m, (1 = T), 


with o? > 0 (McCullagh ‘and Nelder 1989, 125, eq. 4.20). If 

= 1, then the variance is the same as the variance of a 
standard binomial random variable. If o? > 1, then there is 
overdispersion relative to a purely binomial model. The 
probability value «r, is a logistic function of a linear predictor 
denoted x,B, where x, is a vector of k regressors, and ß is an 


mm)? 


3 See Miıamı Herald 2001 and Merzer and Miami Herald 2001. 
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unknown constant vector of coefficient parameters. The 
probability v, is defined by: 

1 
T+ app 

We do not treat the overdispersed binomial model as fully 
characterizing a likelihood function for the data. We treat the 
model as what McCullagh and Nelder (1989) describe as a 

“quasi-likelihood.” As they explain (pp. 323-48), under fairly 
mild conditions estimates of parameters of the mean that are 
based on a quasi-likelihood have desirable properties, such as 
asymptotic unbiasedness, asymptotic optimality, and asymp- 
totic normality. In the current analysis, the quasi-likelihood 
approach means that we assume only that the mean and 
variance formulas we define are good descriptions of the 
data. In fact, by using the robust estimation methods defined 
below, we assume somewhat less than that and still obtain 
estimates with good statistical properties. We require only 
that the mean and variance formulas are good descriptions 
for most of the observed vote counts in each collection of 
data for which we estimate parameters. In the analysis of 
county-level data, this means that the model is good for most 
of the counties in each state, and in the analysis of data for 
PBC precincts, for most of the precincts in PBC. 

The idea that the model is good for most of the data does 
not mean that the model fully characterizes the process that 
generated most of the vote counts. Obviously, the process is 
vastly more complicated than our spare model specifications 
could possibly represent in full. Rather, the idea is that our 
estimator of the model’s parameters converges asymptoti- 
cally to a unique value for each parameter. In the theory of 
robust estimation this idea is made precise by the assumption 
that a probability mode] exists relative to which the estimator 
is Fisher consistent (Hampel et al. 1986, 82). If applied to all 
the data, that property would be for all practical purposes the 
same as the unique identifiability and convergence properties 
that White (1994) demonstrates are necessary for what he 
calls a “quasi maximum likelihood estimator” (QMLE) to 
have good statistical properties.° The robust estimation 
methods we use have good statistical properties even when 
the property holds only for a majority of the observations. 


m = 


Robust Estimation 


Breakdown Points. The key property of the robust estima- 
tors we use is that they have a high point. In a 
finite sample, the breakdown point of an estimator is the 
smallest proportion of the observations that must be replaced 
by arbitrary values in order to force the estimator to produce 
values arbitrarily far from the parameter values that gener- 
ated the original data (Donoho and . Huber 1983), The 
general concept of breakdown point (e.g., the asymptotic 
breakdown point) has the same connotation, although exten- 
sive technical is required to achieve full generality 
(Hampel 1971). To illustrate the breakdown point idea, we 
consider again the case of estimating the average for a set of 
numbers. For concreteness, suppose that the numbers origi- 
nate from an unbiased sample of size n from a process that 
has mean zero and finite variance. As an estimator of the true 
mean, the sample mean has a breakdown point of 1/n, 
because only one of the n data points needs to be replaced to 


» The McCullagh and Nelder (1989) concept of quas-likelihood, 

which has to do with using a model that one believes is correct only 
for the first two moments of the data (the mean and covariance 
matrix), is not the same as the QMLE of White (1994), which has to 


do with the asymptotic properties of a model that is muspecified. 
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force the sample mean to take a value arbitrarily far from the 
true mean. If one data point is replaced with another value 
far from zero but all the other data remain unchanged, then 
the value of the sample mean moves toward the distorted 
data value; if the distorted data value is moved indefinitely far 
from zero, the sample mean moves indefinitely far from zero. 
Asymptotically, as the sample size n increases to infinity, the 
sample mean has a breakdown point of zero, because 1/n | 
0. The same is true for any least-squares estimator or, indeed, 
for any estimator that always puts positive weight on all the 
observed data. In contrast, as an estimator of the true mean 
in our example, the sample median has a breakdown point of 
1/2. In order to move the sample median arbitrarily far from 
the true mean of zero, at least half the data points have to be 
replaced with values arbitrarily far from zero. 

The highest possible asymptotic breakdown point for an 
estimator of a regression model’s mean parameters is 1/2. We 
use estimators that achieve that maximum for all possible 
ways of distorting the data. We do not use the well-known 
minimum absolute deviations estimator (also known as the 
L, estimator) because it has a breakdown point of only 1/n 
relative to distortions- of the regressors (Rousseeuw and 
Leroy 1987, 10-2). A robust estimator called least median of 
squares (LMS) (Rousseeuw 1984) does achieve the mari- 
mum breakdown point and is widely used. Western (1995) 
discusses high breakdown estimation of linear models using 
LMS and suggests an approach to robust estimation of 
generalized linear models. Christmann (1994) discusses ap- 
plication of LMS to a grouped binomial model (albeit 
without overdispersion). But LMS converges more slowly 
and is less efficient than the estimators we use. 

The high breakdown point of the estimators we use means 
that the estimates of model parameters remain stable even 
when unusual voting processes occur in several of the n 
geographic areas for which we are estimating the model. 
When we use the estimates to compute studentized residuals 
for counties, a large anomaly in one county will not mask 
comparable or perhaps somewhat smaller anomalies that 
occur in other counties. Hampel et al. (1986, 67) discuss the 
relationship between the breakdown point and masking. 
Without a high breakdown point estimator we would under- 
estimate the frequency of highly anomalous election results. 
When we directly interpret the parameters of precinct-level 
models, large departures from the model in several precincts 
will not make the interpretations untrustworthy. 

The robust methods we use also perform well in the 
absence of anomalies. If there are no anomalies, the robust 
estimator is consistent and is almost as efficient as an 
estimator, such as simple iteratively reweighted least squares, 
that ignores the possibility of anomalous observations. 


Two Robust Estimators. To obtain robust estimates of the 
parameters of the overdispersed binomial model, we com- 
bine two different estimators. We use one to estimate the 
scale, o = Vo, and the vector of coefficients, P, and then we 
use a second to obtain a much more efficient estimate of B. 
The second estimator depends on the first one’s estimate of 
the scale, &. The first estimator is called the-least quartile 
difference (LQD) estimator (Croux, Rousseeuw, and Hossjer 
1994; Rousseeuw and Croux 1993), and the second is called 
the hyperbolic tangent (tanh) estimator (Hampel, Rous- 
seeuw, and Ronchetti 1981). 

Both estimators minimize functions of icular forms of 
residuals. Given a vector of estimates B, let #, = [1 + 
exp(—x‘/A)]~1 denote the estimated probability of voting for 
Buchanan in geographic area i. The residual that we use in 
the LOD estimator is: 
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y,— mT, 


* 
o na A 

Ordinarily, with an overdispersed binomial model in the 
absence of outliers, a good moment estimator for o* may be 
defined in terms of r? (McCullagh and Nelder 1989, 126-7, 
eq. 4.23). The distributional assumption we make is that for 
most of the observations 1, computing r7 with Ê = p would 
produce a set of independent normal random variables each 
having variance o°. If the distributional assumption held for 
all the observations, including in the asymptotic limit as n Î 
œ, then the LOD estimate, 6, would be consistent for the 
scale o. If some observations (up to half the data) do not 
satisfy the distributional assumption, then asymptotically the 
difference between & and o remains bounded. The LOD 
estimator is defined by choosing Ê to minimize approximately 
the first quartile of the set of absolute differences 
{F — r:: <j} Further details are given below. 

Given a scale estimate ô, the tanh estimator for P is based 
on the residual: 


yo mT, 


6 nal — A 


The tanh estimator uses a function {r,) to downweight 
observations that have large residuals. The weight applied to 
each observation is defined by: 


_ fpr) forr, #0 
Wam (Ay forr, = 0 


The definition of y has a complicated functional form that we 
state in equation A-2. Here we characterize the weight values 
that y implies. The weights change qualitatively at thresholds 
defined by two constants, p = 1.8 and c = 4.0) (we further 
explain the constants below). Observations that have residu- 
als in the range —p = r, S p have w, = 1: They are not 
downweighted. Observations that have residuals in the e 
p<y,l Se vaso wcities hak wradaally dindnishyto 210 a 
approaches c. Observations that have residuals of magnitude 
greater than c, meaning c < |, have w, = 0. The tanh 
estimator completely rejects such observations so that they 
have no effect on the tank estimate 8. 

We define an outlier to be any observation that has w, = 0. 

The usual method used to estimate an overdispersed 
binomial regression model is an iteratively reweighted least- 
squares algorithm, which is equivalent to maximum likeli- 
hood estimation of the coefficients of a standard binomial 
regression model with the dispersion o° estimated subse- 
quently (McCullagh and Nelder 1989, 114-28). To imple- 
ment the tanh estimator we modify that algorithm by weight- 
ing observation i by w,, with w, computed using the coefficient 
estimates from the previous iteration and the LOD estimate 
ê. The estimate of o remains unchanged throughout the 
estimation. Numerical convergence is required for both the 8 
values and the weights. To start the iterations we use the 
LQD estimates for the initial coefficients and use the LQD 
values (7; — med; )/6 for an initial set of residuals r, (med, 
r) denotes the median of the r; values, i=1, ... ,7). 

To estimate the asymptotic variance of the coefficient 
estimates and hence standard errors, we use the sandwich 
estimator avar(§) = 67) 12J—1, where / is the outer product 
of the score, and J is the Hessian for a standard binomial 
likelihood evaluated at ĝ, weighting each observation by w,. 
Justification for this estimate, based on White (1994), ap- 
pears below. 


Studentized Residuals Definition. To define the studentized 
residuals that we use to compare counties, we adjust the 
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residuals r, for variation associated with each county’s regres- 
sors. We use the usual adjustment for the influence (or 
leverage) each observation y, has on the estimated expected 
vote counts m,i,, modified to take into account the weighting 
that occurs with the tanh estimator via w,. The adjustment is 
a function of quantities h, which are defined as follows. 
Using W to denote a diagonal matrix that has diagonal entries 
W, = W, V to denote a diagonal matrix with V, = [m,#, (1 — 
Foa a’ and X to denote the n x k matrix of the regressors 
(row i of X is x), define the matrix H = VX 

(X'VWVX)~“X’V. If no observations are downweighted, so 
that w, = 1 for alli = 1, ..., 7, then H is the matrix defined 
in equation 123 of McCullagh and Nelder (1989, 397). In 
that case, each diagonal element of H (ie., a measures the 
infiuence of y, on m,7,. When observation / is downweighted 
(w, < 1), the influence interpretation still applies when w, > 
0. £w, = 0, then y, has no effect on m,f, and —H, measures 
error vaniation associated with forecasting observation i. 
Hence we define h, = H, if w, > 0 and h, = —H,, If w, = 
0, and the studentized residual is: 


Farii- h. (A-1) 


The adjustment by (1 — h,)~‘* makes the variance of the 
residuals ?, constant for all observations that have w, > 0, i = 
1,..., a. For the outhers (w, = 0) we do not make any 
claim about what distribution the residuals may have, but the 
adjustment should at least reduce one source of variation 
among them. 

Because in the county-level analysis we estimate the pa- 
rameters of the model separately for each state (except for 
four states that have few counties), the studentized residuals 
for counties in a state that are not outliers all have the same 
variance and so are directly comparable. Across states the 
variances differ slightly because of the following technical 
variations in the sampling distributions. The variance for 
each state is approximately the variance of a t-distribution 
with degrees of freedom equal to the number of counties in 
the state that are not outliers. We do not attempt to adjust for 
that source of variation across states because it is negligible 
compared to the variation caused by the seriously anomalous 
processes that occurred in some counties, turning them into 
outliers. In no case would one expect to observe a residual of 
magnitude greater than 4.0—which would make the county 
an outlier—for a nonanomalous county. 


Robust Estimation Method Detalls 


The LQD Estimator and Its Properties. One may understand 
the LQD estimator intuitively as an extension of the idea of 
using the mterquartile range to estimate the dispersion of a 
set of data. The LQD estimator focuses on the (3) order 
atatistic of the set {|r; — r'| : ¿ <j} of absolute differences, 
where A, = [(n + k)/2] and {r; — r,|:i <j} has G) elements. 
Following Croux, Rousseeuw, ai “Hossjer (1994) we use the 
notation 


s | re 
Q= {ir} rt fi< D) (5) 
to denote that order statistic. For large n and & small relative 
to n, GAV/G) ~ 1/4, so that Q} 1s approximately the first 
quartile of the absolute differences. To implement LQD we 
choose estimates Ê to minimize Q}. Let A, op- designate the 


estimated coefficient vector, and let Ô} designate the corre- 
sponding minimized value of Q% The LQD scale estimate is 


ae ee 
y207? (5/8) 
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where ~! is the quantile function for the standard normal 
distribution. The tion we make about r7 justi- 


e normality assump 
fies the factor [W271 (5/8)]~1 (Rousseeuw and Croux 1993, 


The LQD objective function is difficult to optimize. Be- 
cause high breakdown point estimators are not smooth 
functions of the data, optimization typically does not rely on 
derivative information but instead depends on combinatorial 
algorithms (Stromberg 1993). We use the global optimizer 
GENetic Optimization Using Derivatives (GENOUD) 
(Sekhon and Mebane 1998}, which combines global evolu- 
tionary algorithm methods with a local, quasi-Newton 
method to solve dificult unconstrained optimization prob- 
lems.3! Sekhon and Mebane discuss the theoretical basis for 
expecting GENOUD to find the global optimum when the 
objective function, as in the LQD case, is not a smooth 
function of the data. They also present Monte Carlo exper- 
iments and examples that demonstrate the algorithm’s effec- 
tiveness. The evolutionary algorithm component of GE- 
NOUD excels at finding the neighborhood of the optimum 
even when the objective function is highly irregular. Local 
derivative information, when it exists, is useful for efficiently 
going from a neighborhood of the optimum to the optimum. 
Even when the derivatives provide no useful information, the 
evolutionary algonthm component of GENOUD is able to 
find the global optimum. Optimization is more efficient, 
however, when the derivatives are informative. In the LQD 
case, derivatives provide useful local information. Therefore, 
the quasi-Newton component of GENOUD significantly 
improves the efficiency of the overall optimization algorithm. 
But the global optimization properties of GENOUD come 
from its evolutionary algorithm component. 

For a linear regression model, Croux, Ronsseeuw, and 
Hossjer (1994) show that the LQD estimator converges at a 
rate of a”? and has a Gaussian efficiency of 67.1% for all the 
coefficients except the intercept, which in a linear regression 
model LOD does not estimate.3? Gaussian efficiency refers to 
the efficiency of the estimator when the disturbance is an 
identically and independently distributed Gaussian random 
variable with conditional mean zero (which implies that the 
model is correctly specified for all observations). LMS, in 
contrast to LOD, converges at the slower rate of n~“? and 
has a Gaussian efficiency of 37.0% (Rousseeuw and Croux 
1993, 1279). LQD also provides a good estumate of the 
dispersion when the disturbance has an asymmetric distribu- 
tion, because LQD does not estimate the scale by measuring 
a symmetric spread of the residuals around a central location 
value (Rousseeuw and Croux 1993). 

To use LQD to estimate o? we need to define residuals 
that are reasonably well described bya reference model of 
normality with zero mean and variance o°. If the overdis- 
persed binomial model correctly describes the mean and 
variance for all the data, then given any consistent estimate 
for B, the residuals r7 are approximately normal with the 
desired mean and variance. 

The approximate normality of ry depends on conditions 
such as independence across į and sufficiently large values for 
m,m (1 — 1,). In view of the small proportion of the votes that 
Buchanan received, it is important to consider for how small 





31 See http; harvard.eda/rgenoud/ for an R version af the 
GENOUD software and http:/sekhon_fas harvard.edu/genoud/ for 
more informahon. 

2 Tn the linear model the constant cancels in the differences (r7 — rÀ. 
In the binomial model such cancellation 
does not occur because of the nonlinear factor [ñ, (1 — &,)]7 172 in 
r.. Hence, the LQD estimator of the overdispersed binomial model 
has information about the constant 
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a value of m,m, the normality model is plausible. Notice that 
X? = Er (rÀ? is the Pearson chi-squared statistic. Larntz 
(1978, 255-6) enumerates the exact distribution of X? for 
binomial data and concludes that, even for sample size n as 
smali as 10 or 15, inferences reached by using the asymptotic 
chi-squared distribution are reasonable when all expected 
values m,n, and m,(1 — 7,) are greater than 1.0. We take 
such results as supporting our use of the reference normal 
model for the county data. No state has more than a few 
counties with m, so small that the expected vote for Buchanan 
is less than 1.0. 

The case is less clear for the precinct date. Because the 
expected proportion of the vote for Buchanan in PBC is 
about 0.001 (see Table B-1), a typical precinct would have to 
have m, = 1,000 voters to have one expected vote for 
Buchanan. Among the 515 election-day precincts in PBC, 172 
have m, > 1,000, but 56 have m, < 100. Koehler and Laratz 
(1980, 337) show (for Poisson variables) that the Poisson 
information kernel declines rapidly to zero as the 
value goes from one to zero. Koehler and Larntz (p. 338) 
observe that, consequently, the asymptotic (in n) mean of the 
likelihood-ratio chi-squared statistic “can be much smaller 
than the chi-squared mean when many expected frequencies 
are smaller than one-half”; McCullagh (1986) demonstrates 
one important respect in which the chi-squared approxima- 
tion survives better for the Pearson statistic than for the 
likelihood-ratio statistic as m, decreases to one. That may 
suggest that our estimates of o are biased somewhat down- 
ward for the precinct data. 


The Tanh Estimator and Its Properties. To estimate B, a tanh 
estimator that uses the LOD scale estimate is more efficient 
than the LOD estimator alone. In addition to achieving the 
maximum breakdown point, tanh estimators minimize the 
asymptotic variance of the estimates for a given upper bound 
d on how sensitive the variance is to a change in the 
distribution of the data. Hampel et al. (1986, 125-36) explain 
the concept of the change-of-variance function that makes 
the foregoing concept of sensitivity precise. Given a good 
estimate of the scale, tanh estimators are by definition the 
most efficient possible estimators of location that have a finite 
rejection point, which means that some observations may 
receive zero weight, and the key robustness property of 
bounded response to arbitrary in parts of the data 
(Hampel et al 1986, 166). Hampel, Rousseeuw, and 
Ronchetti (1981) prove existence and optimality properties 
of tanh estimators (see also Hampel et al 1986, 160-8). 
Hampel et al. (1986, 328) conjecture that the tanh estimator 
of location extends directly to a tanh estimator for linear 
regression. We are applying the estimator to a nonlinear 
regression modeL 

The tanh estimator is a redescending M-estimator of 
location that is based on the function 


u, for 0 < |u| sp 
(A(d- 1))"? tanh [3((d =A) 


u = 
ue X BIA) "(c — |u\)]sign(u), for p < |u| =c 
> forc = |u 
(A-2) 
where p, c, d, A, and B are constants satisfying 0 < p < c and 


other conditions (Hampel et al. 1981, 645). For c > |u], the 
sign of y(u) equals the sign of u. For u increasing from p to 
c, y(u) descends from its maximum value (p) = p to 
(c) = 0, and for u increasing from —c to —p, Ņp(u) descends 
from #(—c) = 0 to its minimum value (=p) ; 
Hampel, Rousseeuw, and Ronchetti (1981, 645) and Hampel 
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et al. (1986, 162) visually depict the shape of y. Optimality 
requires that along the curves between )(p) and w(c) and 
between (—c) and p(—p), the value of y is such that there 
is a constant ratio between the sensitivity of the variance to a 
change in the data and the asymptotic variance. Indeed, the 
value of the ratio is d. Given a choice of d and of the 
truncation point c, minimizing the asymptotic variance of the 
estimator while satisfying such conditions implies unique 
values for p, A and B of (2) (Hampel et al. 1986, 162~4), We 
use c = 4.0 and d = 5.0, which imply values p = 1.8, A = 
0.86 and B = 0.91 (Hampel, Rousseeuw, and Ronchetti 
1981, 645, Table 2).= 

Given a scale estimate ô and a vector of trial estimates Ê, 
we compute the residuals r, and then weights w,. Observation 
i ig weighted by w, in what is otherwise the usual iteratively 
reweighted least-squares algorithm to estimate 8. Full itera- 
tion makes this weighted estimator equivalent to the tanh 
M-estimator (Hampel et al. 1986, 116). Because redescending 
M-estimators such as the tanh estimator have multiple solu- 
tions, starting values affect the results. 


Asymptotic Covariance Matrix of Tank Coeficient Estimates. 
The sandwich estimator avar(A) is valid for the robust 
estimator insofar as the conditions for Theorem 6.4 of White 
(1994, 92) hold. The necessary assumptions are 2.1 (complete 
probability space), 2.3 (measurability, compact parameter 
space, and continuity), 3.1 (uniform convergence), 3.2’ (inte- 
rior identifiably unique maximizers), 3.6 (continuous differ- 
entiability), 3.7(a) (uniform convergence of score vector), 3.8 
(uniform convergence of Hessian), 3.9 (negative definite 
Hessian), and 6.1 (score obeys central limit theorem with 
positive definite covariance matrix). Hampel et al. (1986, 82) 
directly assume 2.1 and 23. The assumption of Fisher 
consistency (p. 83) implies 3.1. Given a suitable model 
parameterization, their Theorem 5 (pp. 160-2) implies 3.6 
and 3.7(a) for all observations that are not outliers, and 
optimization by Newton’s method from suitable starting 
values (compare p. 152) implies 3.2’, 3.8, 3.9, and 6.1. 


APPENDIX B: COUNTY-LEVEL ELECTION 
RETURNS DATA SOURCES AND TABLE OF 
OUTLIERS 


For AL, AR, AZ, CO, CT, GA, IA, ID, IL, IN, KS, KY, LA, 
MA, MD, MN, MO, MS, MT, ND, NE, NH, NJ, NM, NV, 
NY, OH, OR, PA, RI, SC, SD, TN, TX, UT, VA, WA, and 
WI, the data are certified (or “official”) election results from 
each state’s Secretary of State (or comparable office), or 
results reported on the Secretary of State’s website, originally 
collected by David Leip (with updates as of December 16, 
2000) and posted at http://www-.uselectionatlas.org (accessed 
January 10 or 21, 2001). For NY we obtained data separately 
for the City of New York. For ME (December 18, 2800) 
and WY (December 20, 2000) we accessed data from http:// 
www.uselectionatlas.org on the indicated dates. 

For the remaining states except AK we obtained data from 
official websites, as follows: CA, California Secretary of State, 





P Hampel, Rousseeuw, and Ronchetti (1981) use & for the tuning 
parameter of that we have denoted by d. The same information 
about the parameters appears in Hampel et al. (1986, 163, 
Table 2) with notation r and k used for the parameters we have 
denoted by c and d. 

H http:/Avww.vote.nycny. te/2000General/allg2000 
pdf (accessed February 21, 2001) 
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TABLE B-1. U.S. Counties with the Largest Positive and Negative Studentized Residuals from 
2000 Vote for Buchanan 


Studentized ___Vote Proportion Number 
Residual Expected Actual of Ballots 


36.14 0.0010 0.0079 433,186 
28.26 0.0013 0.0379 
19.53 0.0096 0.0528 
18.16 0.0048 0.0289 
15.88 0.0037 0.0080 
15.57 0.0046 0.0286 
14.56 0.0066 
14,23 0.0049 
13.67 0.0010 
12.52 0.0018 
12.25 0.0010 
11.67 0.0009 
11.49 0.0038 
10.35 0.0054 
10.25 0.0031 
9.89 0.0057 
9.89 0.0100 
9.87 0.0048 
9.68 0.0012 
9.41 0.0034 
9.30 0.0005 
8.95 0.0052 
8.55 0.0014 
8.50 0.0052 
8.20 0.0043 
0.0074 
7.60 0.0049 
7.48 0.0034 
0.0010 
7.33 0.0048 
7.22 0.0041 
0.0011 
0.0034 
8.95 0.0050 
6.95 0.0063 
8.59 


g 
g 


OCOn anh op = 


8.39 
6.35 


5.99 
5.84 


State 
FL 
SC 
KS 
WV 
NV 
WY 
OH 
OH 
MN 
NE 
WI 
MN 
lA 
WV 
WV 
AZ 
MT 
KS 
MO 
KS 
MO 
AR 
MO 
AR 
IA 
OH 
co 
co 
FL 
NE 
WI 
NM 
SD 
NY 
NE 
MD 
KS 
co 
VA 
NE 
WI 
TX 


Dubuque 
Allegheny 5.48 
Boone 


Whitfleld 5.35 
5.34 


5.09 
4.96 
4.85 
4.84 


4.60 
4.55 
4.54 
4.48 


4.22 
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TABLE B-1 (Continued) 


MD 
AL 
GA 
OH 
1A 
TX 
GA 
WY 
WY 
LA 
OR 
AL 
LA 
LA 
Note: 


l 
A 
5 


Election 2000%; DE, Secretary of State, Department of 
Elections; FL, Florida Department of State, Division of 
Election, “Data Download Utility”?’; HI, State of Hawaii 
Office of Elections; NC, North Carolina State Board of 
Elections, Official Results”; OK, Election Results and Sta- 
tistics, 2000, Oklahoma State Election Board; VT, 2000 
Vermont Election Results, State of Vermont, Office of the 
Secretary of State, Elections and Campaign Finance Divi- 
siont!; WY, 2000 Official Election Results, Wyoming Secre- 
tary of State, Election Administration.” 

For AK, we use the 25 county equivalents that were in 
effect during the 1996 election, leaving Denali Borough 
within Yukon-Koyukuk Census Area, and Yakutat City and 
Borough within Skagway-Hoonah-Angoon Census Area. We 
aggregate precinct vote data to create the county-equivalent 
units. Precinct-level results for 2000 are in pdf files dated 
December 5, 2000,“ and for 1996 they are in text files dated 
November 27, 1996.44 To map the data into county equiva- 
lents we used voting district and state legislative district data 
provided by the Census Bureau. 


35 http://vote2000.s8.ca.gov/Returns/pres/59-htm (accessed January 
21, 2001). 
36 http: /wrww.state.de.us/election/index.htm (accessed June 26, 


2001). 
37 http://enight.dos.statefLus/report.asp?Date= 001107 


(accessed 
January 21, 2001). 
= http./Avww.state_hius/electiona/ (accessed July 24, 2001). 
P http://www .sboo-state.nc.us/y2000elect/stateresults.htm (accessed 


January 22, 2001). 

© http:/www.state.ok.us/~elections/00result-html (accessed April 4, 

2001). 

41 -//vermont-elections.org/2000geresultshtm (accessed January 

22, 2001). 

© http://soswy state.wy.us/election/2000/results/g-usp.htm (accessed 
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June 19, 2001). 

“s http//factfinder.census gov/serviet/DTGeoSearchByListServiet?ds_ 
name=DEC_2000 PL U&stato=dt&_lang=en&_ts=11623041369 
(accessed June 19, 2001). 
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held to be the greatest American political novel 
(e.g., Blotner 1966, 221, 225-6; Davidson 1967, 
411; Whalen-Bridge 1998, 172, 175). Nevertheless, it 
has not been the focus of sustained explication in the 
political science literature.1 The book has received a 
great deal of critical examination, but the literary 
interpretations may explain why political scientists 
have not looked to the novel for political insight. In 
short, literary scholars tend to discount the political 
character of All the King’s Men as being either too 
simple or perhaps too insidious to yield significant 
political insight, in spite of ample evidence that the 
novel may have a political teaching. In fact, John Burt 
(1988) persuasively argues that all of Warren’s work 
indicates great interest in political problems arising out 
of America’s struggles to realize the promise of democ- 
racy. In this essay, I take issue with the weak analysis of 
the political character of AI] the King’s Men and show 
why it should be studied carefully by political scientists 
who wish to understand American democracy. 
Several recent studies argue that literary works are 
particularly well suited to explore the effects of the 
American regime (in the broad sense) on our political 
and private lives.? The classical characterizes 
political regimes by the people who hold power, their 


R= Penn Warren’s All the King’s Men is widely 
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King’s Men only briefly: “Neither work [Advise and Consent and The 
Last Hurrah!) 


popular origins as well as the ambiguously moral effects of American 
politics, however, the way Robert Penn Warren’s AL the King’s Men 
does through the musings of the narrator, Jack Burden.” 

2 On the reading of literature as a guide to understanding political 
hfe broadly interpreted, see Blotner 1966; Brown 1977; Dannhauser 
1995; Howe 1957; Whalen-Bridge 1998; and Zuckert 1990, 1995. 


methods of using that power, and the ends for which 
power is exercised. Plato, Aristotle, and other classical 
thinkers tried to demonstrate how the shape of partic- 
ular regimes has a decisive effect on the character of 
citizens (see Cantor 1995, 192-5; Zuckert 1981, 684, 
700-2). In many respects, this type of analysis may 
appear congruent with the modern approach to litera- 
ture advocated by the New Historicists, but there is a 
significant difference. The classical political thinkers 
insisted that some people could free themselves from 
the prejudices of the political society around them and 
write self-consciously about both its good and bad 
features (Cantor 1995, 192-3). Zuckert (1981, 706) 
argues that this rare insight may be best communicated 
through literary presentations that provide “the means 
of relating . . . characters and character development to 
political reality.” I will show that Warren’s novel is best 
understood as an illustration of how a particular type of 
democratic regime (closer to our own than we would 
like to admit) shapes certain human beings. Insofar as 
our political activity shapes our character and to the 
extent this shaping may be explained and understood, 
however limited our ability to gain objectivity, we must 
consider the formative consequences of our political 
actions. 

When we examine All the King’s Men from the broad 
perspective of the classical approach to the study of the 
character of the regime, all the disparate parts of 
Warren’s novel can be explained as part of a consistent 
whole that is both a thoroughly political story and a 
great work of American political thought. I argue that 
the purpose of the novel is twofold: to illustrate a set of 
political dangers inherent in certain commonly held 
views about the character of American democracy and 
to show how these dangers are manifested both in the 
practice of our politics and in the character of our 
citizens. Warren presents his unnamed southern state 
as a particular democratic regime, and a careful anal- 
ysis of his presentation shows how that regime is 
related to our own political situation broadly under- 
stood. 

I begin by explaining how this approach to the novel 
may answer some of the outstanding questions in the 
scholarly literature on the work. I then analyze Willie 
Stark’s speeches to reveal the character of his politics 
and the circumstances that appear to make necessary 
his turn toward morally problematic political methods. 
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Next, I demonstrate how Stark’s conversation with 
Adam Stanton reveals the connection between his 


political theory and the view of the human condition on - 


which his political activity is based. I then discuss the 


specific influence of Stark’s politics on the character of > 


the narrator, Jack Burden, whose moral and intellec- 
tual crises permeate the work. Finally, I lay out the 
broad consequences of this connection between Stark’s 
politics and Burden’s intellectual and moral transfor- 
mation. 


IS ALL THE KING’S MEN A POLITICAL 
NOVEL? 


All the King’s Men is generally thought to be about “the 
corrupting nature of power” and how power’s inevita- 
ble effects transform “a well-intentioned, idealistic 
back-country lawyer unable to resist the greed for 
power and lust for politics” into “an American dema- 
gogue.”3 The idealistic lawyer whose rise and fall is said 
to illustrate the awful truth of Lord Acton’s celebrated 
axiom is Willie Stark, the governor of an unnamed 
southern state. According to this view, the key action of 
the novel is the terrible corruption that destroys the 
character of Stark. The novel must be understood as 
political in the sense that it argues against the idea that 
moral decency can ever coexist with political power; 
therefore, the practice of politics and the pursuit of 
powerful positions are inherently corrupting activities 
that ought to be avoided by moral persons. Ealy’s 
(1999, 6) interpretation is consistent with this view: 
“[Stark] began his political career by claiming that a 
hick must depend on himself and no one else, and 
offered himself as the hick spokesman for the rest: of 
the hicks. Along the way he forgot about his political 
roots, and became engaged in political infighting to 
promote his own power, not to fulfill a hick political 
apgenda.”* There are good reasons to think that this is 
not the decisive political teaching of the novel. 

Two features of the novel undermine any attempt to 
read it as a simple morality tale about the evils of 
corrupting power. First, Warren’s portrayal of Stark is 
far from consistently admonitory. Many characters in 
the book speak favorably of Stark even after his turn to 
a form of demagogic tyranny, which has led some 
commentators to suggest that Warren is an apologist 
for the kind of authoritarianism attributed to Huey 
Long (1893-1935), the inflammatory governor and 
senator from Louisiana. It is argued that Burden, who 
makes many of the most i ents in favor of 
a sympathetic view of Stark, should be understood as a 


“thinly disguised authorial spokesman” (Baumbach 


3 The quotations are from the back cover of the Harvest paperback 
edition of AZ the King’s Men (Warren [1946] 1974). All page 
references to the novel correspond to this edition. 

4 For other of this common interpretation, see Cleopatra 
1985, esp. 108-9; Kaplan [1965] 1987, esp. 10; Whalen-Bridge 1998, 
174-5. For a comprehensive survey of reviews when the novel was 
released, almost all of which took this view, see Heilman [1947] 1977. 
This interpretanon appears to have informed the 1949 Oscar- 
winning movie, which completely transformed Stark into a’ dema- 
gogue obsessed with gaining power for its own sake. 
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[1965] 1987, 20). If Burden is Warren and approves of 


Stark, then Warren must approve of Stark. Moreover, 


if Stark is Long and Burden/Warren approves of 
Stark/Long, then Warren must approve of Long. Quod 
erat demonstratum. 

Reactions to this reading of the novel range ont 
mild scolding to the branding of Warren as a neofascist 
who advocates an American type of democratic Hitler- 
ism (Baumbach [1965] 1987, 20; Heilman [1947] 1977, 
22-4; Wilson 2000). Warren was so sensitive to these 
charges that he repeatedly insisted Stark was not Long 
and his view of Stark was not wholly favorable. He even 
went so far as to explain how the vicissitudes of the 
academic job market forced him to accept a position at 
Louisiana State University in 1936, as though to have 
done so by choice would have constituted a de facto 
love of Long, who was the university's patron (Warren 
[1964] 1965, 75-9; [1953] 1977, 97). The idea that the 
novel is Warren’s literary apologia for Long also ap- 
peared in the political science literature around the 
time of the novel’s release (see Dauer 1948, 334 n. 30; 
Irish 1952, 133).5 

Second, we must consider why the grand and terri- 
fying political tale that occupies the foreground of the 
text is presented through the eyes of Burden, a 
thoughtful narrator whose reflections on and reaction 
to Stark’s career appear to be central to the novel’s 
purpose. We see through Burden’s eyes the historical 
trajectory that brought Stark to power and eventually 
his end, but we are inundated with the story of Bur- 
den’s past, including a chapter on his research of his 
doctoral dissertation and another on the unhappy story 
of his first love affair. In these chapters, Stark and his 
political activity are pushed into the background, and 
Burden’s own struggles with self-knowledge take cen- 
ter stage. These seeming digressions from the story of 
Stark’s corruption by power puzzle many interpreters. 
Some of them wish Burden away as a nuisance who 
dilutes the political message, distracts us from Stark’s 


„true character, or deprives the novel of its consistency 


(Heilman [1947] 1977, 19-20, 24). 

A sophisticated interpretation along these lines is 
advanced by Richard King ([1980] 1987), who depicts 
the entire book as moving away from a compelling 
story about politics and into an incoherent tale of 
personal psychology. He contends that what begins as 
Burden telling of “a poor boy who rises to power, his 
attempts to break the ‘interests, and the personal, 
political, and moral costs involved” would have made 

“a fine and rather unique southern (and American) 
novel,” but the oedipal story of Burden’s search for his 
father confuses any real consideration of Stark’s activ- 


5 For discussions about how closely Stark is modeled on Long, see 
Graham 1970, 205; Gray [1972] 1987, 94-8; Johnson 1980; Payne 
1968. Warren’s comments on the subject can be found m “Aff the 
King’s Men: The Matrix of Experience” ([1964] 1965) and in his 
preface to the 1953 edition ({1953] 1977). Note the key statement: 
“Certainly, it was the career of Long and the atmosphere in 
Louisiana that suggested the play that was to become the novel. But 
suggestion does not mean identity” (Warren [1953] 1977, 97). Some 
early reviewers thought the congruity should have been perfect and 
lamented that the novel was “not blographically accurate” (Hellman 
[1947] 1977, 21, emphasis m original). 
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ity. King argues that “the two stories are only tenuously 
connected” by a linking device that “is not entirely 
convincing.” He concludes: “Thus what promised to be 
a profound political novel ends by being swallowed up 
by a private quest for identity” as “Warren seems to 
lose interest in the political issue altogether” (pp. 
153-4). Critics who concur with King find the work 
internally inconsistent and poorly suited for teaching us 
anything about politics (e.g, Baumbach [1965] 1987). 

This hardly constitutes an exhaustive survey of the 
scholarly literature on AH the King’s Men, but it shows 
that questions about the role of Burden’s character, 
Burden’s praise of Stark, and the connection between 
Burden’s intellectual and moral crises and Stark’s 
political activity lie at the root of the confusion about 
the political character of the novel. When we recognize 
why the story of Burden, as Burden himself insists (pp. 
157, 435), is inseparable from the story of Stark’s 
political career, we can see that Warren illustrates how 
certain elements of our particular political regime are 
“formative,” which is to say that they “shape the way of 
life of citizens living under them” (Cantor 1995,:192; 
also see Zuckert 1981, 702). I will argue that the work 
demonstrates how Burden’s time with and study of 
Stark shapes his view of the world and why this 
particular shaping is symptomatic of a shaping that 
occurs in more subtle ways throughout American po- 
litical culture. In order to accomplish this, I will next 
explain the nature of Stark’s politics before turning to 
a discussion of Burden’s transformation.. 


THE RHETORIC AND SUBSTANCE OF 
WILLIE STARK’S POLITICS 


The first step in establishing the political teaching of 
the novel is to understand the true character of Stark’s 
politics. We must distinguish between what is admira- 
ble in Stark’s: politics and what is not, and we must 
understand why these elements are necessarily com- 
bined. The common view of Stark as a demagogue 
overlooks the obvious fact'that his administration is 
preceded and followed by ones that are equally corrupt 
and clearly less interested in the public good. Stark’s 
political goals—an equitable tax plan, streamlined gov- 
ernment, good roads, good schools, better health care, 
and a free public hospital—all appear to be laudable. 
The only people who oppose them are the wealthy, 
who do not wish to pay for them for narrow self- 
interested reasons, or members of the corrupt political 
machines that Stark displaced (pp. 78, 123-5, 233, 
392-4). If there is a political order that can better 
address the problems facing the state, it is only hinted 
at in the novel (p. 436; discussion below). 

All the King’s Men is a line from the old nursery 
rhyme “Humpty, Dumpty.” If Stark is the “King” in the 
title, then it may be interesting to note that the rhyme 
does not treat the king as a tyrant.® It never suggests 
that the king’s goal of rescuing the fallen egg-man is 


It seems obvious that Stark is the “king,” because the novel 
chronicles the lives and actions of “his men” (Jack Burden, Sugar 
Boy, Sadie Burke, et al.). Contrast Ruoff [1957] 1977, esp. 86. 


anything but laudable, although he is unable to accom- 
plish it.7 All the military might of a kingdom is de- 
ployed to save one creature, but Humpty-Dumpty is 
doomed. The rhyme suggests that there are limits to 
what political power can accomplish. I believe the 
novel’s message is consistent with that of the rhyme. 
This is ultimately a story about what Stark cannot do. 
Paradoxically, the common reading of the novel holds 
that Stark accumulates too much power, but my inter- 
pretation suggests that he has too little. He fails to 
accomplish much that he had hoped to do, and his 
character, as well as the character of the people around 
him, is monstrously transformed by his attempts to 
overcome this weakness. 

It cannot be denied that the common characteriza- 
tion of Stark as a demagogue has a kernel of truth. He 
employs passionate oratory, at least in part, to secure 
his own political power. But a careful analysis of the 
evolution of his political rhetoric and its culmination in 
his most revealing speech, on the night after the 
legislature fails to impeach him, reveals what separates 
him from the most usual type of demagogue. Given the 
common opinion that Stark is demagogue, it is surpris- 
ing that no commentator has undertaken a systematic 
reading of his speeches. Some critics assume that Stark 
has no other “true voice” save his ranting cries for 
hammers and meat-axes (e.g., Baumbach [1965] 1987, 
26-7), but a careful study of his rhetoric shows that too 
much attention to the violent imagery of some of his 
speeches leads critics to overlook the consistent mes- 
sage that underlies all his speeches and that informs all 
his political actions. 

Warren reports only four of Stark’s speeches explic- 
itly: at the courthouse in Mason City (pp. 9-11), his 
surprising political transformation at Upton (pp. 90— 
4), just before the impeachment crisis (pp. 145-7), and 
the night after the legislature fails to impeach him (pp. 
147-155, 261-2). Each is a masterpiece of the orator’s 
art and is designed to work upon the hopes and fears of 
the audience, an audience that may have more in 
common with ourselves than we may care to admit. 
Each is anchored by a special connection between the 

r and the people that is based on empathy and 
fellow feeling. Each uses the sanctity of the common 
man’s feelings and needs to justify savage political 
tactics as the necessary means to bring the people’s 
wishes to fruition. 


The Origins of Stark’s Politics 


We cannot, however, really understand the speeches 
that Warren shows us except in the light of the oration 
he only reports secondhand, Stark’s stump speech from 
his first and unsuccessful run for governor. In 1926, Joe 
Harrison, the leader of one of the two major factions in 
the state, arranges to have Stark run for governor. The 
purpose is to split the so-called hick vote so that 


7 There may be some ambiguity about who has “had a great fall” in 
this formulation of the title, but I assert that the “egg” 1s the state, 
which 1s loosely based on Louisiana but never explicitly named. See 
Warren [1957] 1965, 75-81. Contrast Ruoff [1957] 1977, esp. 85. 
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Harrison’s urban constituency will carry him to victory 
in a three-way race. Stark is not aware that he is being 
set up and launches a statewide tour to deliver his stock 
speech. It is “awful” (pp. 71, 75), packed full of figures 
and statistics that bore and confuse his audiences (p. 
71), and completely lacking im humor or popular 
appeal (p. 73). Stark practices this speech for hours and 
tries to refine each line into a “Gettysburg Address” (p. 
70). Like Lincoln’s famous oration, it appears to have 
been a remarkably high-minded speech that turned on 
the connection between grand principles and particular 
policies. Stark hopes to persuade the people to adopt 
the policies, and the candidate who promotes them, by 
an appeal to their belief in and commitment to those 
grand principles. It is a complete and utter failure. 

This first speech, with all its defects, contains most of 
the platform that will characterize Willie’s later suc- 
cesses. Stark advocates a new and more progressive tax 
system, a higher rate of return to the state on oil leases, 
a better road system, and improved schools (pp. 69, 
70-1, 78, 90, 136). All Stark’s goals, except the plan for 
the great hospital, are present at this early stage in his 
career. Even the core idea that leads to the hospital— 
medical needs should be met regardless of ability to 
pay—is suggested by the mention of a public health bill 
that was passed before Stark ever promised to build the 
hospital (pp. 136, 139). The difference between the 
unsuccessful “Cousin Willie” and the all-powerful 
“Boss” is all about means; Stark’s ends never change.® 

When Stark expresses incredulity that people pay no 
attention to his explanation of issues that affect them 
personally, Burden tries to give Stark advice on the 
character of his audience and what type of speech 
might be effective: 


Hell, make ’em cry, make ’em laugh, make ’em think 
you're their weak erring pal, or make ’em think you're God 
almighty. Or make ’em mad. Even mad at you. Just stir’em 
up, it doesn’t matter how or why, and they'll love you and 
come back for more. Pinch ’em in the soft place. They 
aren't alive, most of ’em, and haven’t been alive in twenty 


years ... so it’s up to you to give "em something to stir ’em 
up and make ’em feel alive again. ... But for Sweet Jesus’ 
sake don’t try to improve their minds (p. 72). 


As Burden explains it, the problem is not in Stark’s 
figures or arguments but in his failure to set his speech 
at the level of his audience. Burden argues that the 
people are not interested in “improving their minds” 
and can only be moved to act by appeals to their 
passions. Our contemporary political campaigns are 
plagued by the inability of candidates to attract voters’ 
attention with their most balanced or well-reasoned 
arguments. We may lament the fact, but there is no 
escaping the reality that complex policies, like Stark’s 
tax plan, appear to be most successfully advocated 
when they are condensed into something like Burden’s 
advice to “tell ’em you’re gonna soak the fat boys, and 
forget the rest of the tax stuff.” 

Burden later explains that Stark “flattered human 





8 “The most engagmg thing about Wille is his candor about the 
immorality of bis means. The most difficult thmg about him is the 
sincerity of his devotron to hrs ends” (Burt 1988, 142). 
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nature. He assumed that other people were as be- 
mused by the grandeur and as blinded by the light of 
the post to which he aspired, and that they would only 
listen to argument that was grand and bright” (p. 69). 
Stark wants to give speeches worthy of a governor and 
to prove his own worthiness by displaymg his mastery 
of the policies that he will implement and his devotion 
to the highest principles of democratic self-governance. 
Burden ascribes Willie’s failure to move the people to 
the gulf between his dignity and their complete disre- 
gard for dignity. So long as the people do not have any 
sense of their own dignity, they will be incapable of 
acting nobly. An appeal to principle has no traction 
among people who are disillusioned about principles 
and unwilling to sacrifice for them. At Gettysburg, 
Lincoln succeeded in the difficult task of ennobling his 
audience and raising them to the level of the speech 
that he wished to give, but Stark never succeeds at 
elevating his audience. In order to speak to them on 
their level, his view of human nature has to be lowered. 
It is lowered at Upton. 

At this point, Stark thinks that higher offices call for 
a higher dignity (p. 69). Therefore, he has to be shown 
that the politics of governors and legislatures are as 
corrupt as the machine politics in his home county. 
Before Sadie Burke inadvertently reveals that his can- 
didacy is a decoy, Stark is about to drop out of the race, 
convinced that he is simply incapable of making the 
people recognize what they need within the means 
appropriate for seeking an elevated office such as 
governor (pp. 80-1). When Sadie lets the truth slip, she 
lays bare the nature of gubernatorial politics. Stark 
infers that all means are appropriate in a race for 
governor and rejects the idea, which he had found in 
textbooks, that the great story of American political 
history is a story of political titans speaking and acting 
purely before a decent and responsible people (pp. 
67-9). Later, Stark states confidently that there is no 
way for a governor to do anything if he insists on 
keeping his dignity (p. 38). His rejection of the idea 
that human beings can be moved by dignity marks a key 
transition in his approach to politics. It also marks a 
key turn in Stark’s private morals: He rejects any 
concern for his own dignity and indulges the desire to 
drown his sorrows after discovering he has been used. 
Previously di ished by his complete abstinence 
from alcohol (pp. 16-8), Stark gets drunk, and the next 
day he must get drunk again in order to make his 
speaking engagement. 

Stark begins his speech at Upton by utterly dismiss- 
ing the details about tax reform, poverty relief, or road 
construction that had characterized his earlier speech. 
He proclaims to the assembled mass of country folk 
that “you are the state” and “you know what you need” 
(p. 90). In Stark’s new politics, it is not necessary to go 
into policy details; it is simply necessary to assure the 
people that the policies will serve their needs. Willie 
establishes this by telling them the story of how he was 


9 My argument that Upton 18 a crucial turning pomt ought to be 
contrasted wrth the view that Stark’s speeches reveal a slow but 


steady change ın his view of the world (e.g, Ealy 1999, 2). 
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persuaded to run for governor and Harrison’s plan to 
split the country vote. His fundamental message is that 
he shares their background and knows their struggles: 
“I am a hick and a red-neck just like you” (pp. 91, 
94-5). He uses the fact that he has been used to 
establish his credentials as being just like the people to 
whom he speaks, and he assures them that their 
problems are attributable to the fact that they also are 
being misused and oppressed by the political machines, 
the city people, and the rich. 

The new speech is delivered without a script, and the 
passions to which it appeals are those of fear and 
indignation. Stark explains that he will not run for 
governor this time, but he sets out to destroy Harrison 
by alerting all “fellow rednecks” to the disgraceful way 
that the machine has treated them. Stark also puts 
MacMurfee “on notice” that unless things get better 
for the poor country folks, he will be held responsible 
at the next election (p. 93). The special connection with 
the people that Willie establishes in the Upton speech 
remains the bona fides of his trustworthiness through- 
out his successful political career. The “not a speech” 
oration in Mason City is entirely about his common life 
with the poor people (pp. 9-11), the staged pictures of 
the governor with his family are meant to establish this 
connection (pp. 155-6, 327-8), and his rhetorical 
defense against his impeachment: is built on it (pp. 
145-7). 

By tapping into the power of the people through 
passionate rhetoric based on an intimate understand- 
ing of the people’s deepest hopes and fears, Stark 
opens a new avenue to political power in the state. 
Previous approaches to politics had hinged on personal 
connections among elites. We dimly see in the allusions 
to the old government of Governor Stanton and Judge 
Irwin that there was a period in which the state was 
dominated by a certain alliance of gentlemen (pp. 
124-5). That alliance ceased to hold power sometime 
around World War J and was replaced by political 
machines in which personal connections were built on 
graft and mutual self-interest rather than any shared 
sense of honor (pp. 65-6). Like Sadie Burke’s first 
sponsor, Senator Sen-Sen Puckett, each high-ranking 
member of these machines cut his own deal and 
dictated the terms of his continued cooperation (pp. 
73-4). 

Stark effectively dismantles the Harrison outfit in 
1926, but he does not simply seize control of one of the 
machines in the wake of Harrison’s fall. His political 
methods after Upton change the state’s politics in 
fundamental ways. Burden describes the landslide elec- 
tion of 1930: “And there wasn’t any Democratic party. 
There was just Willie” (p. 97). Stark has his close 
subordinates—Sadie Burke, Hugh Miller, Sugar Boy, 
and Jack Burden—but the heart of the organization is 
Stark himself. His control is maintained by a combina- 
tion of terror, regulated and limited graft, and a file of 
signed but undated resignation letters (pp. 132-3). 
Willie introduces personal politics and replaces the old 
politics of party or of faction with the politics of 
individual leadership. The main qualification appears 
to be the command of a powerful and passionate 
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oratory that is capable of moving the people and is 
based on his ability to claim a special kinship or 
connection to them. The connection between Stark and 
the people is the one that matters, rather than connec- 
tions among different elites, and we will see that the 
terms of this connection have some unsettling conse- 
quences. 


Stark’s Public Philosophy and Its Political 
Consequences 


This revolution in the politics of the state cannot be 
accomplished without resistance. After Harrison is 
destroyed, the MacMurfee machine still exercises some 
power (pp. 97, 136, 148, 152). Although it is confused 
by Stark and uncertain about how to handle him, its 
leaders are still active in politics and anxious to thwart 
the new rival. Furthermore, even though Stark wins the 
governorship in 1930, his political operation leaves in 
place much of the institutional power of his opponents. 
Because Stark’s power is intimately attached to his 
person, Willie does not have a natural set of allies to 
run for the legislature, and MacMurfee’s friends con- 
tinue to dominate that branch of government until at 
least 1934. 

Progressive political leaders in the United States, 
going back to Woodrow Wilson, have recognized that 
personal rhetorical politics are constantly thwarted by 
the separation of powers. Wilson’s attempts to make 
the president “in fact, and not just in name, the head of 
government” as well as Franklin Roosevelt’s infamous 
court-packing scheme were reactions to the vetoes that 
conservatives could exercise against progressive re- 
forms. Similarly, Stark discovers that his power is 
limited by his ability to control the other branches of 
government. He packs the state’s high court with 
judges inclined to agree with his expansive reading of 
executive powers (pp. 124, 136). He simply does things 
and tries to use the precedent of having done them to 
make them appear constitutional (pp. 136-7), and he is 
continually struggling with the problems raised by 
MacMurfee’s power in the legislature. He repeatedly 
speaks of the legislature as something that needs to be 
“busted” (pp. 7, 10, inter alia). The ultimate test of 
Willie’s ability to contro! the legislature by these means 
occurs in the attempt to impeach him near the end of 
his first term as governor. 

One of the most peculiar aspects of the atemporal 
and periodic narrative structure of the novel is that the 
account of the 1934 “impeachment speech” is broken 
up into two places within the book (pp. 147-55 and 
261-3). In the first account, the content of the speech 
is treated as irrelevant. Burden, who claims to be like 
God looking in on History, knows that the speech will 
appear to be relevant because the outside observer will 
think that the crowd gathered around the capitol 
influenced the legislature and that the crowd was 
moved by the speech. Burden, however, claims to know 
that the speech and the crowd were not the cause for 
acquittal; a secret campaign of political blackmail 
against individual legislators has already ensured that 
Stark will remain in office (pp. 148-50). At this point, 
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we see the crowd and hear it roar, but we do not hear 
the speech. 

The second account is related by Burden only after 
he reports a private conversation between Stark and 
Adam Stanton on the “nature of Goodness” that takes 
place in 1937. Burden carefully and explicitly connects 
the impeachment speech with this conversation be- 
cause only then does he realize that the speech is the 
key to understanding all of Stark’s political actions. In 
it Willie’s principles appear more starkly than in any 
other place. In fact, in this speech we discover that his 
plans are truly grounded in a certain commonly held 
vision of American democracy. I will explain what is 
revealed in the speech and its political import and then 
will discuss the conversation with Stanton, which points 
to less obvious implications of Stark’s regime. 

The speech is presented as a dialogue between Stark, 
standing at the top of the capitol steps, and the crowd, 
spread out across the lawn in front of him. This 
dialogue seems remarkably one-sided because the 
crowd has only two lines. Before Stark begins, the 
crowd chants “Willie—Willie—we want Willie.” After 
he starts to speak, the crowd answers his assertions 
with loud roars of approval. Stark opens by declaring 
the ruin of his enemies, and the crowd roars. He then 
asks if they like what he has done, and they roar again. 
Each roar is controlled by Stark, who silences the 
crowd by the motion of his hand. He announces: “I tell 
you what I am going to do. I am going to build a 
hospital. The biggest and finest that money can buy.... 
To heal sickness. To ease pain. Free. Not as a charity. 
But as a right. It is your right. Do you hear? It is your 
right” (p. 261)! He proceeds to call for rights to 
education, food, good roads, fair and progressive tax- 
ation, and “that you shall not be deprived of hope” (p. 
261). We can only understand Stark in terms of these 
promises and his justification for them. To begin with 
the nature of the promises, he claims that people have 
a right to have all human efforts expended in making 
their lives better and more comfortable. By treating 
each of these promises as rights, Stark establishes an 
obligation on the part of the government to the people. 
The government must provide the policies that will 
ensure these rights. They are not optional. 

Stark then launches into a graphic description of the 
violence he will inflict against the man who stands in his 
way. The roar of the crowd intensifies as Stark de- 
scribes the blows he will strike against his foes, culmi- 
nating in his characteristic call for a “meat-ax.” He 
does not hide his intention to destroy his foes.: The 
individual acts of bribery and extortion that Burden 
describes in the earlier account (pp. 145-7) may take 
place out of sight, but even if they were public, we may 
presume, Stark’s supporters would stand with him. As 
his language becomes more violent, the crowd’s roars 
become louder. The crowd’s excitement must be ex- 
plained in light of the last two sentences of the speech, 
both of which are delivered in a more level and 
controlled voice. 

In the penultimate sentence, Stark declares: “Your 
will is my strength” (p. 262). He claims that his power 
comes from the people. He does the people’s will, and 
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that is what makes him powerful. All his political power 
would evaporate if he could not animate popular 
support for his positions. His blackmail and bribery are 
effective only because they are backed by the people’s 
desire to stand by what Willie Stark says and to punish 
any legislator or other politician who stands against 
him. Stark screams for the meat-ax and swears to 
destroy the man who stands in his way, but he identifies 
himself as the arm of the people, delivering the blows 
that they would deliver on their own behalf if only they 
could. 

In this speech, Stark appears as the agent of the 
people, advocating for their rights and destroying 
anyone who stands in the way of the realization of 
those rights. Although American government has al- 
ways treated the securing of rights for the people as the 
purpose of government, this represents a new way of 
looking at both the rights themselves and the role of 
political leaders in securing them. Rights are, accord- 
ing to the political philosophy that Stark enunciates in 
this speech, changing things. Different rights are rec- 
ognized at different times. Stark portrays himself as the 
man who interprets rights and determines which rights 
are worthy of protection at a given moment, and the 
people’s request for the protection of their rights is 
expressed by the call of “We want Willie!” Stark is both 
the interpreter who determines which rights are to be 
protected and the guarantor of those rights. 

Thus far, Stark’s principles (as opposed to his means 
of achieving them) are entirely consistent with those of 
Franklin Roosevelt and other progressives who sought 
to increase the role of government by redefining the 
purposes for which governments are instituted. They 
accepted the assertion in the Declaration of Indepen- 
dence that governments exist to protect rights, but they 
redefined the rights that were to be protected and thus 
authorized a great expansion in the powers of govern- 
ment to define and secure those rights. In his famous 
Commonwealth Club Address, Franklin Roosevelt 
proclaimed: 

The Declaration of Independence discusses the problem 

of government in terms of a contract.... Under such a 

contract rulers were accorded power, and the people 

consented to that power on consideration that they be 
accorded certain nghts. The task of statesmanship has 

always been the redefinition of these rights in terms of a 

changing and growing social order. New conditions impose 

new requirements on government (Roosevelt 1938, 753). 

Stark accepts, or shares, this reasoning and casts 
himself as a statesman of the type that Roosevelt 
discusses. No previous administration would have rec- 
ognized the rights to health care, education, food, or 
progressive taxation that Stark enunciates (p. 124). His 
reconfiguration of the meaning of rights, which paral- 
lels Roosevelt's, clearly creates a sense of entitlement, 
but we cannot assume, as does Adam Stanton when he 
calls Stark’s discussion of rights his “bribe” (p. 261), 
that this is simply rhetorical fluff to garner popular 
support. As Burden points out, Stark is going to be 
acquitted regardless of the content of the speech. Stark 
may gain popular support by enunciating these views, 
but he does not need that support at this particular 
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moment. Furthermore, Stark’s entire political career 
suggests that he truly believes what he says here. - 

If rights do change over time, we must identify the 
standard by which rights are measured and. their 
changes are marked. For Stark, that standard lies in the 
needs of the people. He concludes with the simple but 
crucial sentence: “Your need is my justice.” The stan- 
dard by which Stark judges his political ends is consis- 
tently based on the principle that the needs of the 
people constitute the essence of justice. He began the 
oration at Upton by saying: “I have a speech here. It is 
a speech about what this state needs. But there’s no use 
telling you what this state needs. You are the state. 
You know what you need” (p. 90). Even before he 
recognizes the need for this transformative speech, he 
objects to Burden’s critique of his “Gettysburg Ad- 
dress” speech by saying: “You see what this state 
needs” (p. 72). Stark’s standard is consistent through- 
out the work. His understanding of this standard is 
loaded with crucial implications, and we need to inves- 
tigate it carefully in the specific context in which 
Warren places it in the novel. 


THE BROAD IMPLICATIONS OF STARK’S 
POLITICS: HIS ACCOUNT OF THE GOOD 


Stark discusses the nature of “Goodness” with Adam 
Stanton. This conversation reveals the underlying as- 
sumptions about human nature and the purposes of 
political society that are implicit in Stark’s politics. As 
noted above, Burden only reports the content of the 
impeachment speech in flashback at this point in the 
narrative. He demonstrates his interest in this subject 
by questioning Stark about what he said on each 
occasion. Burden, by his own admission, rarely asks 
Willie why he does what he does, so we must infer that 
he finds these statements especially perplexing and 
especially important. By connecting what Stark says to 
Stanton with what he said earlier in the impeachment 
speech, Burden shows how Stark’s standard of justice is 
based on a certain view of the human condition. This 
view, in turn, is crucial to explaining the connection 
between Stark’s politics and Burden’s intellectual and 
moral travails. 

Stanton is a puritan of sorts. He is emphatically 
interested in seeing that the right things are done (pp. 
236-7, 248). When Hubert Coffey tries to bribe him, 
Stanton strikes the would-be briber (pp. 320-1). This 
sudden act of violent revulsion and anger against a man 
whose actions implicate Stanton in wrong-doing is the 
first glimpse of the impulse that animates his assassi- 
nation of Stark. Like all puritans, Stanton takes a 
proprietary interest in virtue and takes offense at those 
whom he thinks misuse virtue for the wrong ends or 
misappropriate the name of virtue for less than virtu- 
ous acts. When he hears Stark’s impeachment speech, 
he sneers and snarls: “‘Justice! He used that word’” 
(p. 262). Stanton does.not consider the crucial defini- 
tion that Stark gave the word because, in his mind, even 
the mention of justice by a political strongman who 
uses bribery, extortion, and the threat of force is an 
affront. Stanton wants to keep his virtue unsullied, and 
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this obviously creates a significant obstacle to Stark’s 
plan to make Stanton the director of his hospital. 

Burden, who is assigned the job of persuading Stan- 
ton to accept the position, concludes that the only way 
to succeed is to “change the picture of the world in his 
head.” Burden reveals his evidence that Judge Irwin 
accepted a bribe while serving as attorney general and 
that Adam’s father, Governor Stanton, helped cover up 
the bribe to protect his friend (p. 247). This revelation 
causes Stanton to curse his father, whom he had 
adored (pp. 253-4), but it also erases his old way of 
understanding the past. His utter rejection of any 
connection to Stark’s politics was only possible if he 
assumed that previous politics, his father’s politics, 
were purer than those of the present administration. 
With that illusion destroyed, Stanton accepts the posi- 
tion. Stanton, like Stark, changes only when he is 
convinced that possibilities he had assumed were open 
are foreclosed by new evidence. Both men are forced to 
confront facts that undermine their view of the world. 
Stark accepts the irrelevance of dignity and rejects any 
consideration of it in his political activity. Stanton 
accepts that there are no honorable politics and agrees 
to the previously unacceptable connection with Stark, 
but he insists that he will keep politics at arm’s length. 

Stark’s meeting with Stanton is, in many regards, like 
his earlier meeting with his first attorney general, Hugh 
Miller, who resigned after Stark saved the state auditor 
from impeachment for a crime of which the man was 
guilty (pp. 135-40). Stark explains himself to the 
“puritans” with whom he works because he thinks he 
has more in common with them than with any others. 
The problem is that his political tactics convince people 
like Adam Stanton and Hugh Miller that they must 
oppose him, which deprives Willie of those whom he 
considers: his most valuable political allies. He is com- 
pelled to explain why he thinks his positions are 
compatible with their puritanism, but he is never 
entirely successful. 

Warren is very deliberate in showing us these con- 
versations that take place in private and that need not 
take place at all, and he has Burden draw specific 
connections between them and Stark’s public speeches. 
A political leader like Stark can get Stanton to serve as 
director of the medical center or get the legislature to 
acquit him on the impeachment charges without speak- 
ing frankly, as he does in this conversation or that 
speech. Nevertheless, we cannot fairly judge Willie 
Stark if we do not have these pieces of the puzzle (see 
Burt 1988, 142). 

When Stark arrives at Stanton’s apartment with 
Burden, he asks Stanton what he thinks of the hospital. 
Stanton replies that the hospital will “do the people of 
the state some good” and “get you some votes” (p. 
256). Stark claims that there are “other ways to get 
votes.” He then tells Stanton the story of Hugh Miller. 
Stark compares Miller to a man who wants to make 
bricks without getting muddy (compare to pp. 136-7) 
or to eat steak without having slaughterhouses (p. 256). 
He claims that Miller did not recognize that not only 
can you “not have everything” but also “you can have 
mighty little.” Miller thought he could inherit “good- 
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ness” the way he inherited money and his name, but 
Stark insists that if there is going to be any “goodness,” 
you have to “make” it. He then claims that “goodness” 
must be made out of “badness” because there is 
nothing else out of which to make it (p. 257). This is the 
essence of Stark’s justification for his political tactics 
(p. 137). The argument that we must make the “Good” 
or the “Just,” which are defined as meeting the needs 
of the people, by the use of the bad is a variant on the 
familiar argument that the ends justify the means. All 
Means are appropriate and allowable if they accom- 
plish the good. Later in the novel, Burden identifies 
this as the “theory of the moral neutrality of history,” 
which holds that “process as process is neither morally 
good nor morally bad. We may judge results but not 
process” (p. 393). Burden explicitly associates the 
theory with Machiavelli, “the cold-faced Florentine, 
who is the founding father of our modern world.”!° 

Stanton, after a silence, asks Stark: “Tf, as you say, 
there is only bad to start with, and the good must be 
made from the bad, then how do you ever know what 
the good is? How do you even recognize the good” (p. 
257)? The question requires Stark to define “Good 
with a capital G.” He answers that Good is something 
that men make up. Starting with a comical account of 
human evolution from a prepolitical state of nature, 
Stark argues that man is descended from the lower 
forms of animals who had no sense of good or evil. As 
they evolved, they created rules and associated those 
rules with goodness to protect themselves and to 
ensure the survival of the societies on which they came 
to depend. Goodness is, says Stark, created by human 
beings for the purpose of “doing business” (pp. 257-8). 
According to this reasoning, goodness is a changing 
value, constantly adapted to meet our current necessi- 
ties. Stark insists that human beings construct defini- 
tions of goodness in order to protect and preserve 
society as a precondition to their physical well-being, 
comfort, and prosperity. 

In his generally excellent treatment of the problems 
associated with Stark’s political activity, Burt misses 
this crucial point. He argues that Stanton’s question 
about how we identify the good when we make it out of 
bad is decisive and concludes that Stark is dodging the 
issue when he launches into his account of the evolu- 
tion of human law and society (Burt 1988, 150-2). This 
account, however, is Stark’s answer. History/evolution 
can be the source of a standard of good that is both 
always evolving and transcendent. According to the 
standard embodied in the statement that “your need is 
my justice,” the particular needs of the people do 
change, but the fact of their neediness does not. Only 
society's needs can define justice, and when the needs 
of society change, the prevailing definition of justice 
must change to accommodate the new needs of society. 
There is no limit on what behaviors society can tolerate 


10 “The end justifies the means” is the common paraphrase of. “In 
the actions of all men, and of princes, where there is no 
court to appeal to, one looks to the end. So let a prince win and 
maintain his state: the means will always be judged honorable and 
will be praised by everyone” (Machiavelli 1998, 71). 
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so long as they contribute to the meeting of those 
needs." This is a more important step in understanding 
Stark than the recognition that he agrees with Machia- 
velli about the ends justifying the means, because this 
reveals the criteria by which ends are selected. Burden 
will later struggle mightily with the consequences of 
Stark’s argument that the good can be grounded in the 
history of human neediness. 

A widely circulated picture of Stark carries the 
quotation: “My study is the heart of the people” (p. 
6).12 By discovering the needs of the people, Stark 
discovers the purpose for which he governs. He is truly 
interested in providing the people with what they need 
(contrast Ealy 1999, 6), but his confidence that he (and 
perhaps he alone) truly knows their needs makes him 
terribly dangerous because he is freed from all the 
traditional restraints on men in power. He can, with a 
good conscience, do whatever it takes to assure their 
comfort and security (see Burt 1988, 141). Stark has a 
cause that he is convinced is completely just. Although 
it has the force of a moral imperative, Stark’s justice 
knows no rules and respects no boundaries; it does not 
have any fixed meaning, and Stark himself is the judge 
of its temporal interpretation. With such a definition of 
justice, Stark can explain away any rule he needs to 
break. He is, to cite the impeachment speech, “living in 
the will and the right” of the people; therefore, all his 
actions carry their sanction. To stand against Stark is 
not to stand against an ambitious man seeking his own 
glory or fame. It is to stand against the good of the 
people, their wishes and their needs. 

Warren reveals what appears to be a necessary 
connection between passionate oratory, political Ma- 
chiavellianism, and concern for the common good that 
may be particularly endemic to certain types of democ- 
racies. In fact, if Stark can be said to play hardball 
harder than the machines of MacMurfee or Harrison, 
we can trace his ruthlessness not to some defect in his 
character but to the impulse that drives him to do good 
for the people of the state. The old machines just 
wanted to maintain their power and generate some 
graft to line the pockets of their members. They were 
corrupt but not very ambitious. Stark, in contrast, feels 
compelled to overcome all the self-interest, personal 
ambition, inertia, and human conniving that stand in 
the way of his overhaul of the state. His plans for a 
greater society in which human pain is eased and 
human needs are met combined with the realities of 
democratic politics require the use of demagogic rhet- 
oric, systematic corruption, and ruthless political extor- 
tion. Stark is not simply a reformist politician who goes 
bad. His transformation forces us to consider certain 
dangers that lurk in our modern conceptions of Amer- 
ican democracy. 

Each person tends to express his or her perceived 
needs as rights that must be absolutely fulfilled. By 


11 Compare Roger Barrus’s (1994, esp. 25—6) analysis of questions 
asked to presidential candidates in the 1992 town hall debate. 

32 Warren's rendering of Stark’s motto makes significant changes to 
that attributed to Huey Long: “I know the hearts of the people 
because I have not colored my own” (Bloom 1987, 4-5). 
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making this connection explicit in public speeches, 
Stark further legitimizes this view and convinces the 
people that their perceived needs must be met by 
government, regardless of the costs. If meeting the 
needs of the people becomes the very definition of just 
government, would-be governors are required to make 
tremendous exertions in the pursuit of the means by 
which they can fulfill those needs. When these needs 
are accorded the status of “inalienable” rights, there is 
no moral boundary that may not be crossed in fulfilling 
them. 

Taken together, however, the people’s perceptions 
of needs are infinitely mutable and often self-contra- 
dictory, and there is no easy mechanism by which “the 
people” can be brought together to agree on what they 
need. This problem is exacerbated by the fact that 
human beings are generally disinclined to look for any 
vision of the good that lies beyond their self-interest. 
They always tend to confuse their good with the good. 
The American framers clearly recognized this problem 
but hoped it could be mitigated in a republic that relied 
on bargaining and compromise to determine the direc- 
tion and goals of public policy (e.g., Madison, Hamil- 
ton, and Jay [1788] 1990, 45-6). Each citizen would act 
according to his or her own best interest, but the 
common good would be approximated by the compro- 
mises they would reach. This formula, however, is 
dependent upon the ability of individual representa- 
tives to make the compromises that best approximate 
the interests and sense of justice expressed by their 
constituents. When political organizations develop dis- 
tinct interests, independent of those whom they repre- 
sent, it becomes possible that the compromises will be 
dictated by the political ends of those parties rather 
than the good of constituents. We see such a crisis in 
Warren’s novel in the account of politics under the 
machines and the corporate domination of the state 
during their ascendancy. 

Machine politics detached nearly all political power 
in the state from any connection to the needs or desires 
of the common people. The people in Warren’s novel 
are disheartened by the utter failure of representative 
government to approximate the good of the people, 
and this perception is the necessary precondition for 
the rise of Willie Stark (see p. 72). But Stark’s attempt 
to return the state to the consideration of and attention 
to the people’s needs is itself problematic. Stark rep- 
resents a type of democracy that is not driven by a 
compromise of interests; it claims to accomplish the 
amalgamation of all the various self-interests in society, 
or at least all the various self-interests of those persons 
who can be said to constitute “the people,” into one 
common interest. 

This unified vision of the common good of the 
common people is enunciated through a new and 
powerful political rhetoric that allows Stark to create a 
unity that is more apparent than real. He gives the 
people a vision of their common good that they all 
think they share. This vision is misleading for a 


D Woodrow Wilson expresses the aspiration for such an orator and 
such a rhetoric in this way: “A nation is led by a man who .. . speaks 


number of reasons. To name only one, it is based on a 
fiction that the “common people” are “the state” (p. 
90) and that anyone who is not one of the common 
people, meaning anyone who does not recognize 
Stark’s sense of their plight and his competence to 
define and pursue their interest, is by definition not a 
constitutive part of the state. In Stark’s democracy, 
“We want Willie” becomes the sole expression of the 
people’s will. To attain his vision of this infinitely 
mutable “Good,” Stark is willing to employ incredibly 
powerful means, granted to him as the arm of the 
strength of the people. He will destroy anyone who 
stands in his way and becomes as terrible as a tyrant in 
pursuit of bis democratic ends. 


BURDEN’S PHILOSOPHICAL TRAVAILS 


The second story in AH the King’s Men, Jack Burden’s 
quest to understand the world around him, is a source 
of some major problems in interpreting the “political 
meaning” of the novel and has led some commentators 
to conclude that the book is not ultimately all that 
political (see King [1980] 1987). To demonstrate the 
integrity of the novel as a discussion of politics, I must 
explain how Burden’s narrative, including the digres- 
sions on his childhood and love life that seem utterly 
out of place, fits in a political novel. I argue that 
thinking of Stark’s politics in the broad sense, as a 
formative regime that shapes individual characters, 
may explain why Burden’s private struggles take on 
their particular character. I have analyzed the origins 
and nature of Stark’s approach to government and the 
view of the human condition on which his politics are 
based. In this section, I will discuss how Burden’s moral 
and intellectual crises are intimately related to the 
character of Stark’s politics. 

Just as the title points the way toward a more 
complete understanding of the novel’s political teach- 
ing, so Warren provides an epigraph that aids in 
understanding the investigations of Burden: Mentre che 
la speranza ha fior del verde. The line is drawn from 
Dante’s Purgatorio, and the entire stanza is translated: 
“Despite the Church’s curse, there is no one/ so lost 
that the eternal love cannot/ return—as long as hope 
holds something green” (Alighieri 1984, pp. 133-5). 
The epigraph is the last phrase and holds out the 
possibility of “eternal love” or redemption, which 
strikes a sharp contrast to the title’s political subject 
and pessimistic implications. 

It may be in keeping with the subtitle’s reference to 
the possibility of “eternal love returning” that Burden’s 
story begins and ends with a love affair. As a young 
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not the rumors of the streets, but a new principle for a new age. A 
man in whose ears the vosces of the nation do not sound like the 
accidental and discordant voices that come from the voice of a mob, 
but concurrent and concordant like the united voices of a chorus, 


14 Curiously, Burden refers to Dante by name once in the text, p. 236. 
The subject is pride, envy, and flattery, and the comment is directed 
toward Adam Stanton. 
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man, Burden fell in love with Adam Stanton’s sister, 
Anne, and they planned to marry. One of the two 
major issues that Burden associates with their separa- 
tion is his seeming lack of ambition. He thought that 
Anne wanted him to go to law school, When he 
complained of his dislike of the law, she insisted that he 
did not need to study law and that she would “eat beans 
with him” (p. 301). Burden intentionally failed himself 
out of law school and got into a rather scandalous mess 
(pp. 302-3). The couple split up. Burden later thinks 
that she only spoke of law school or work because she 
needed to see some sign that he was going to do 
something with his life (pp. 310-1). Neither Burden 
nor Anne Stanton fared well in love affairs afterward: 
He had a failed marriage and many meaningless liai- 
sons, and she had a number of failed engagements. 

Anne did not know what she wanted Burden to do, 
but she needed some indication that he intended to do 
something, that he wanted to participate actively in the 
world. Burden, however, seemed unable to commit to 
any endeavor. Other than graduate school, he never 
worked at anything with even a little conviction until 
his job with Willie Stark. Burden was unable to force 
himself to achieve success in the ways the society 
around him measured it, and he resisted the opportu- 
nity to gain well-paying and high-profile positions at 
several points in his life (e.g, pp. 118-20, 126-7, 
305-6). He actively avoided being accused of doing 
anything for money (pp. 191-2). 

Burden does spend a good deal of his life trying to 
understand things. He conceives of the purpose of man 
as knowledge (p. 9). He explicitly adopts several dif- 
ferent answers to fundamental human questions: What 
is the nature of history? What are good and evil? What 
is the best way of life for a human being? Throughout 
his story, Burden attempts to act in a manner consis- 
tent with his latest thoughts on these thorny questions. 
He seems interested in living an examined and consis- 
tent life and claims he always has envied those who 
seem to have a “secret knowledge” (pp. 312-3). Bur- 
den’s philosophical wanderings are his effort to gain 
the most important secret knowledge, the truth about 
the world and the people in it. 

Burden describes himself as a “brass-bound Idealist” 
(p. 30). He claims that he “owed his success in life to 
that principle” and that “if you are an Idealist, it does 
not matter what you do or what goes on around you 
because it isn’t real anyway” (p. 30). His idealism, 
which he says he learned in college, is first and 
foremost a view of the world that insulates him from 
the events and people around him. This idealism, like 
that view of the “Ideas” that is often attributed to 
Plato, relies upon a fundamental division of all exis- 
tence into the high and important things that are not 
seen or appreciated by most people and the lower and 
unimportant things that occupy the attention of most 
people but do not truly matter in any enduring sense. 
While an idealist, Burden views the “flux of things” that 
constitutes the activity of the world as something 
divorced from the fundamental underlying reality and 
treats the world with a disdain that makes it inconceiv- 
able he could invest himself in it (p. 189). The world 
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and its events are to him “simply an accumulation of 
items, odds and ends of things like the broken and 
misused and dust-shrouded things gathered in a garret” 
(p. 189).15 Given his indifference to the activity of the 
world, it is easy to see why Burden does not take much 
interest in practical career goals or training to be a 
lawyer. He cannot do anything as long as he is con- 
vinced that nothing matters. 

The defect of Burden’s idealism is first revealed in 
his research for his doctoral dissertation. Working 
from a collection of old letters and journals, Burden 
tries to write about the life and times of his grand- 
uncle, Cass Mastern. A young man reared in poverty in 
the antebellum South, Mastern was saved by his pros- 
perous older brother, who sent him to Transylvania 
College in Kentucky. While there, he became involved 
with the wife of his friend, Duncan Trice. Trice com- 
mitted suicide, and the series of personal tragedies that 
followed from his death dictated the rest of Cass 
Mastern’s life. Until his death in a field hospital during 
the Civil War, Mastern tried to expiate his crimes. He 
freed all his slaves, practiced law, and defended blacks, 
and although he marched in the confederate ranks did 
not fire a shot. 

Burden never completed his dissertation. Although 
he learned the “facts of Cass Mastern’s world,” he 
could not understand the man or his decisions. Burden 
did not even have to demonstrate such knowledge to 
earn the Ph.D. His advisors thought the work was 
perfectly acceptable (p. 100), but Burden did not think 
he really grasped the facts if he could not explain the 
character of the man (p. 188). Eventually, he aban- 
doned the project. His understanding of “History” and 
“Idealism” could not account for Cass Mastern, whose 
very existence hinged on the idea of human responsi- 
bility. His every move from the death of Trice until his 
own death was an act of penance for his actions, but 
Burden could not understand this sense of culpability. 
Stymied, Burden sank into a period of prolonged 
inactivity that he named “the Great Sleep” (p. 190). 

During his three “Great Sleeps,” Burden approaches 
complete inactivity. He only gets up in the morning so 
he can go back to bed at night (pp. 105, 107, 189, 
306-7). During these periods, he considers the possi- 
bility that all human life, all human ‘desire, is part of a 
meaningless game with no grounding in objective real- 
ity (p. 99). This fascination with sleep, inactivity, and 
mistrust of the objective nature of facts marks a 
worldview consistent with Burden’s idealism and his 
corresponding notion of perfection as the complete 
absence of motion, life, and vitality (p. 99). Paradoxi- 
cally, however, each period of “Great Sleep” is a 
prelude to action. The first immediately precedes his 
decision to abandon graduate work and his virtually 
finished dissertation (p. 189), the second leads to 
leaving the comfortable life he had with his wife Lois 
(pp. 306-7), and the third, in fali and winter 1930-31, 


15 Burden’s “garret” may be intentionally sunilar to Socrates’ account 
of the vision of the philosopher.upon returning to the cave (Republic, 
516e, ff). 
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follows his departure from the Chronicle and ends 
when he goes to work for Willie Stark. 

Stark appeared to be a political stooge who could 
change nothing in the state, but when he gave his 
speech at Upton, he raised new possibilities in Bur- 
den’s mind. Warren includes a brief but telling anec- 
dote from later that evening. When Burden makes a 
complex reference to some liquor, Sadie Burke, a blunt 
woman who does not speak in metaphors, responds: “‘T 
forgot, you’re the fellow who went to college.” Burden 
notes: “Yes, I was the fellow who had gone so gram- 
matically to college, where I had not learned, I decided, 
all there was to know” (p. 94). His idealism cannot 
explain everything, and Stark represents something 
Burden does not understand. Before Upton, Burden’s 
detachment from the irrelevant epiphenomena of the 
world makes it perfectly reasonable to drink his way 
along, writing insincere political columns for the 
Chronicle, but he cannot maintain that way of life after 
Stark emerges as a political force and a philosophical 
possibility. He quits the Chronicle when he cannot 
persuade himself to accept the company line on the 
gubernatorial election of 1930 (pp. 98-9) and, after the 
last “Great Sleep,” is called to the governor’s office‘and 
hired as a sort of special assistant. 

The dissonance between the “perfect” idealism of 
Burden’s reflections during “the Great Sleep” and his 
critique of inactivity as antithetical to life becomes 
most apparent in his work for Stark (pp. 99—100, 202). 
He begins a curious double life of action and thought. 
He characterizes his job as an extension of his work as 
a “historical researcher” and specializes in finding out 
the information Stark needs to deal with his various 
political adversaries. He is, by all accounts, very good at 
it (pp. 157, 191, 228). Burden is the one character in the 
novel who is most torn by the conflict between “the 
man of ideas,” most fully embodied in Adam Stanton, 
and “the man of action,” most fully embodied in Willie 
Stark, which he characterizes as “the terrible division 
of their age” (p. 436; see Baumbach [1965] 1987, 20-1). 
Burden not only has elements of both but also radical- 
izes each principle, thinking most abstractly about 
mankind’s insignificance and performing the most pur- 
poseful actions to further Stark’s political ends. In 
going to work with Stark and doing what he does, even 
believing in what he does, Burden becomes a man of 
action in a way that seems inconsistent with the ideal- 
ism he claims to espouse. 

Burden openly claims that he does not work for the 
money but seems incapable of explaining why he 
continues to work for Stark, even when assigned the 
most difficult or morally dubious duties (pp. 126, 
191-2). He is not a sycophant who wants to be near the 
powerful, as does Tiny Duffy; he is not sexually at- 
tracted to Stark, as are the numerous women; and he is 
not easily bemused by great speech-making, as is 
Sugar-Boy (compare pp. 9 and 421). Burden is genu- 
inely interested in Stark’s political projects and en- 
thralled by his ability to accomplish previously unthink- 
able political projects. As Willie’s most trusted political 
assistant, Burden is genuinely intent on getting some- 
thing done, even though his avowed philosophy pre- 


cludes the ability of men to change anything of impor- 
tance. Almost all of Burden’s explicit praises of Stark 
are tied to his fascination with the ability to accomplish 
things, and we should attribute them not to some 
“thinly disguised” praise of Huey Long by Warren but 
to Burden’s intense desire to participate meaningfully 
in the action of the world (pp. 124—6, 202, 208, 393-4). 

The dissonance between idealism and political activ- 
ity comes to a head when Burden discovers that Anne 
Stanton and Willie Stark are having an affair (pp. 
267-9). Burden’s reaction reveals the connection be- 
tween his personal crises and the view of human nature 
that informs Stark’s politics. Following the impeach- 
ment speech and Burden’s defense of Stark’s methods 
as the necessary means to fulfill the people’s needs, 
Anne approaches Willie for help with a children’s 
center and falls in love with him. She admits being 
attracted to him as a man who can get things done, 
which must be understood as a rebuke to Burden, 
whom she all but accused of being incapable of any 
action (pp. 324-6). Upon discovering the affair, Bur- 
den promptly leaves town and drives to California, 
replaying his life in his mind. He compares this driving 
and reliving to “drowning” in “the West” and sinking 
“to the very bottom of the West to lie in the motionless 
ooze of History” (pp. 271-2). 

As Burden reviews his life, he realizes that he has not 
been the idealist he had thought, immune from the 
world and detached from its workings. He struggles 
with the realization that he was once in love with a 
woman and that love, and his failure at it, has governed 
his decisions in life. He realizes that his political 
activity and his defense of Stark as a man of action has 
delivered the woman he loved into the hands of his 
employer. In trying to discover how to live with these 
failures, he has what he calls “a dream.” 


I fled west from the fact [of Stark’s affair with Anne], and 
in the West, at the end of History, the Last Man on that 
Last Coast, on my hotel bed, I had discovered the dream. 
That dream was the dream that all life is but the dark 
heave of blood and the twitch of the nerve . .. which is the 
dream of our age. At first, it is always a nightmare and 
horrible, but in the end it may be, in a special way, rather 
bracing and tonic (311).'* 


This “dream” is the dream of science that sees human 
beings as physical entities with no spiritual component, 
no special role in the universe, and no moral respon- 
sibility. In Burden’s previous thought, he was certain 
that ideas were all that mattered, and physical events 
were utterly meaningless relics. In his new approach to 
the world, there are no higher ideas, and the physical 


16 Burden’s description of hrs dream 1s full of images from Nietzsche; 
see Nietzsche [1874, 1947] 1957, 5, and also [1885] 1954, 129-30 and 
231-3. In thinking about Burden as a historical researcher and his 
participation in Stark’s political activity, consider Nietzsche's ([1874, 
1947] 1957, 49) statement: “The historical sense makes rts servants 
passive and ive. Only in moments of forgetfulness, when 
that sense is dormant, does the man who is mck of the historical fever 
ever act; though he only analyzes his deed again after it is over (which 
prevents it from having any future consequences), and finally puts rt 
on the dissecting table for the purposes of history.” Jack Burden’s 
political activity, indeed, his entre life, is finally considered and 
dissected on the tip to Calr:fornia. 
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events are the only events. Burden passes from radical 
idealism to radical materialism. f 

The dream of radical materialism is useful to Burden 
because, at that crucial moment, it allows him to 
explain away his love for Anne Stanton. If human life is 
simply the twitch of a nerve, then love is nothing but a 
certain biological reaction to sexual stimuli, and it has 
no relation to the quality of the persons involved. 
According to this scientific view of mankind, all per- 
sons are simple automatons and essentially identical 
(pp. 309-10). If we are all just “machines” of blood, 
chemicals, and nerves making biologically predictable 
responses to stimuli, then there is nothing special about 
anyone, nothing to make one person lovable and 
another detestable, nothing to make one great and 
another pathetic. This dream of the world is radically 
egalitarian. 

Some commentators consider the transformation in 
Burden’s character to be artificial and inadequately 
explained. It may appear that Burden changes 180 
degrees in his philosophical orientation in the course of 
a single trip to California and as the result of a single 
personal catastrophe in a life with many catastrophes 
(Baumbach [1965] 1987, 36; King [1980] 1987, 150ff). 
Warten’s account, however, actually reveals that the 
change has been coming for some time. We cannot lose 
sight of the connection between this view of the world 
and the peculiarly political lessons that Burden has 
learned in his connection to Stark. 

Stark’s view of human nature holds that all men are 
“corrupt” in a very particular way. Human beings are 
corrupt because they are needy, and they blindly grope 
toward the things they need. They are incapable of 
doing otherwise and under no obligation, religious or 
moral, to do otherwise. Perhaps the clearest reflection 
on the sordid nature of human neediness comes when 
Willie and Jack discuss slopping hogs (pp. 30-1, also 
see p. 393): “‘And,’ he said, ‘by God I’m still doing it. 
Pouring swill. ‘Well,’ I said, ‘swill is what they live on, 
isn’t it?” Anyone who has seen pigs attack a newly 
filled trough will recognize how powerful an image of 
human neediness this is. In Stark’s political thought, 
politics are the tools by which needy individuals em- 
power themselves to overcome their neediness, and 
justice itself is nothing but a construct that helps needy 
humans define those rules that they think are most 
conducive to meeting their physical needs. These rules, 
the laws, are almost always behind the current needs of 
society because most human beings are short-sighted, 
but a few far-sighted persons with the proper perspec- 
tive recognize that all human institutions are historical 
and temporary; they can look beyond the current rules 
and laws to see what is needed and do what is necessary 
before others realize it is necessary. For those persons, 
any activity, no matter how heinously it may seem to 
offend the current rules of society, is justifiable if it 
meets the needs for which government is created. 

Burden hears Stark explain this understanding of 
political life on at least two occasions (pp. 136-7, 
257-8) and praises him as the “genius” who can 
these principles and act on them (pp. 393-4). In 
California, Burden accepts the broader implications of 
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these principles, learned from Stark and endorsed by 
his participation in Stark’s politics. If human neediness 
is grounded wholly in our character as physical beings 
and dignity is nothing, then all our so-called higher 
attributes are constructs that we use to justify our 
pursuit of our needs and have no basis in ultimate 
reality. Love, like law, justice, and nobility, is only a 
construct used to legitimate our blind groping for 
material satisfaction. 

Armed with this new understanding, Burden claims 
that he has finally found “confidence” by discovering 
the “secret knowledge” of the world and its workings. 
He claims that there is no reason not to have confi- 
dence when you have had “the dream” because you 
know that everything is physical, while those around 
you are deluded by beliefs in human character, moral 
responsibility, and love. In New Mexico, Burden picks 
up an old man who has an involuntary facial tic and 
dubs his new philosophical orientation the “Great 
Twitch” because we are all dominated and controlled 
by compulsory physical responses of which most of us 
are unaware (pp. 313-4). 

Burden glories in his “mystic vision” that he is “at 
one with the Great Twitch” (p. 314). He looks down on 
those around him, whom he considers benighted be- 
cause they do not know the secret of the Great Twitch 
and the nature of human life. He begins a new study of 
the world informed by his conviction that his political 
activity is constantly dictated by the meaningless bustle 
of twitches who are not aware of their own nature. The 
most striking illustration of his new study is his fasci- 
nation with watching Adam Stanton perform a lobec- 
tomy. He characterizes the operation as the scientific 
answer to the experience of Saul on the road to 
Damascus (p. 317; see Simmons [1971] 1977). If human 
beings are entirely physical and the results of a peculiar 
combination of genes, chemistry, and reactions, then a 
frontal lobectomy, the rearranging of the atoms in a 
man’s brain to change his personality, is the new 
version of religious conversion based on the “deity” of 
physical matter. Burden insists on “baptizing” the 
patient, “for he is born again and not of woman. I 
baptize thee in the name of the Big Twitch, the Little 
Twitch, and the Holy Ghost. Who, no doubt, is a 
Twitch, too” (p. 319). Stanton does not get the joke, 
but Burden notes: “That summer from the height of my 
Olympian wisdom, I seemed to find a great many things 
funny which now do not appear quite as funny” (p. 
319). 


THE CHARACTER OF BURDEN’S 
RECOVERY 


After he returns from California, Burden walks amid 
terrible and transformative events, wrapped in the 
insulation of his knowledge of the Great Twitch. He 
witnesses the bargaining, political bribery, and corrup- 
tion that dominate the process of building the hospital. 
As part of that process, he confronts Judge Montgom- 
ery Irwin, his childhood mentor and, as he later 
discovers, his true father, with the terrible secret of his 
past, a bribe that Irwin accepted when he was unable to 
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pay off his debts in the years before World War I. Irwin 
kills himself, and Burden is aware that he is the one 
who led him to do it. Stark’s son, Tom, is paralyzed by 
a football injury. Stark reneges on his earlier deal to 
arrange for the building of the hospital by Gummy 
Larsen. Two days later, Adam Stanton, who has been 
told about his sister’s affair with Stark, confronts and 
shoots the governor in the great hall of the capitol. 
Burden participates in and views all these events with a 
certain indifference because he has objectified human 
beings as being only physical conglomerations of blood 
and nerves. 

The penultimate chapter concludes with Burden’s 
last meeting with Stark, who is dying in a hospital bed 
from the gunshot wound. Stark tells Burden that “you 
got to believe” that “things could have been different” 
(p. 400). Burden gives a half-hearted answer, as though 
humoring a sick but deluded man. The theory of the 
Great Twitch cannot accommodate the idea that things 
could have been different. In order to acknowledge 
that events may occur in more than one way, that the 
direction of events may be the conscious product of 
moral decisions made by people, or even that one way 
is better than any other way, Burden would have to 
reject radical materialism. Such possibilities would 
make individuals responsible for their actions and their 
effect on the course of future events. 

In the final chapter, which takes place entirely in the 
present with none of the extended flashbacks that 
dominate most of the novel, Burden rejects the Great 
Twitch by making a series of decisions based on his 
sense of moral responsibility. Ironically, and in keeping 
with the complexity of this novel that refuses to make 
morality easy or obvious, Burden shows that he is 
responsible for the outcomes of his actions by telling 
three lies. As a historical researcher, he insisted that he 
loved the truth and simply let the truth do its work in 
the world (pp. 127, 228). After he rejects the Great 
Twitch, Burden sees the need to create, or at least 
preserve, certain illusions (see Blair 1993). 

The first lie is told to Sugar Boy, Starks most 
devoted subordinate. Burden investigates a mysterious 
phone call that Adam Stanton reportedly received just 
before killing Stark. He discovers that Tiny Duffy was 
told by Sadie Burke about Stark’s affair with Anne. 
Duffy then called Stanton and told him he was chosen 
to direct the hospital so that his sister would sleep with 
the governor. This call was calculated to spur Stanton 
to kill Stark. After putting all this together, Burden 
meets Sugar Boy by chance in the public library (pp. 
418-23). He asks Sugar Boy what he would do if he 
discovered that Stanton was the pawn of another 
person who was truly responsible for Stark’s death. 
Sugar Boy states, for the first time not stuttering, “Td 
kill the son of a bitch” (p. 420). He continues to insist 
that he would find and kill the man even after Burden 
points out that he would hang for it. 

At that moment, Burden sees Tiny Duffy in his mind, 
and Duffy winks at him. Duffy is a weak and amoral 
man whose understanding of political greatness is a 
poor imitation of Franklin Rossevelt’s appearance in 
suit, jewelry, and cigarette holder (pp. 213, 412-4). 
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Stark considers his ability to make use of Duffy an 
index of his power (pp. 97-8), but the final irony is that 
he cannot control him. Stark makes Duffy lieutenant 
governor and puts him in charge of the “sweetenin’” 
that he considers a necessary lubricant for democratic 
politics (pp. 132, 393), but Duffy continually chafes 
under the restrictions Stark places on him and repeat- 
edly attempts to get the hospital contract thrown to 
Gummy Larsen for his own enrichment (pp. 213, 
231-3, 260, 319-21, 358-63, 386-7).!7 Duffy, who 
thinks Stark’s treatment of him is simply arbitrary, tells 
Stanton about his sister in order to get Stark killed so 
he can become governor. 

In the library, Burden realizes that to use Sugar Boy 
to avenge Stark would be to use him as used 
Stanton. Burden is haunted by the realization that, 
according to the theory of the Great Twitch, there can 
be no difference between him and Duffy since there is 
no difference between any human being and another 
(p. 417). The idea of the Great Twitch becomes less 
comforting as Burden sees himself as an equal to Duffy, 
a cowardly political hack and murderer (pp. 309-11, 
420). He tells Sugar Boy that there is no other mur- 
derer after all. By lying, Burden assures himself that 
there is a difference between him and Duffy and also 
saves Sugar Boy from certain execution. 

Burden also lies to Lucy Stark (p. 425-7) and to his 
mother (pp. 429-32). In each case, he agrees to leave 
in place certain illusions that make their lives more 
meaningful and less painful. In his mother’s case, he 
protects her from the terrible knowledge that he was 
responsible for the death of Judge Irwin, his true father 
and the only man she ever loved. He calls the lie “her 
wedding present.” He lets Lucy Stark continue to 
believe that Sibyl Frey’s baby was fathered by Tom 
Stark, even though he knows from his investigations 
that Sibyl had many boyfriends. When he says that the 
baby “favors” Tom, he gives Lucy a sense of p 
and hope. These lies are, quite possibly, his first noble 
actions (contrast King [1980] 1987, 150). 

Lacy tells Burden that she will name the baby after 
her husband “because... Willie was a great man” (p. 
426). She acknowledges that he did some bad things, 
although she never brings herself to say what they 
were; she “has to believe” he was a great man (pp. 
426-7). Burden thinks: “Yes, Lucy, you have to believe 
that. You have to believe that to live. I know that you 
must believe that... . For you see, Lucy I must believe 
that too. I must believe that Willie Stark was a great 
man” (p. 427). In allowing for Stark’s greatness, Bur- 
den implicitly denies the truth of the Great Twitch and 
allows that some people are better than others and 
some people worse. He reenters the discussion of 
virtue and vice, guilt and mnocence, and begins to treat 
people in a way that recognizes moral responsibility 
while denying the view of egalitarianism that rests on 
turning all humans into interchangeable ciphers. 

Burden rejects the Great Twitch only after a period 


17 Duffy’s relationship to Stark illustrates Alems de Tocqueville’s 
([1850] 1979, 220-1) discussion of the reciprocal corruption of the 
rulers and the ruled in democracies. 
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of assessing what he has witnessed and done. He 
describes himself as a man who “had seen Lucy Stark 
and Sugar Boy and the Scholarly Attorney and Sadie 
Burke and Anne Stanton live and the ways of their 
living had nothing to do with the Great Twitch” (p. 
436). Burden rejects the Great Twitch because to 
believe in it requires one to assume that right, wrong, 
and moral responsibility do not exist, even though 
people act as though they do. The Great Twitch does 
not describe how people live or what they think and do. 
It is a philosophy so distant from the way that men and 
women live that it cannot be considered descriptive of 
their lives. Its “technical perfection” is hopelessly 
marred by its practical imprecision and horrible conse- 
quences (pp. 191, 353-4). Burden becomes convinced 
that belief in the Great Twitch is incompatible with life, 
and he rejects it as a doctrine that cannot be squared 
with the requirements for living (pp. 427, 435). 

Nietzsche argues that humans must, by an act of will, 
reject the consequences of science’s materialism, which 
threatens to transform them into “last men.” Burden 
becomes a last man, but he recovers himself by discov- 
ering that human beings are not simply determined by 
physical causes. He rejects the enervating egalitarian- 
ism that makes the last man appear to be inevitable. In 
contrast to Nietzsche, Warren treats Burden’s recovery 
as more attributable to his progress toward knowledge 
than to a simple act of will (contrast Krieger [1971] 
1987, 89-90). In fact, the novel shows that Stark’s 
politics, which attempt to overcome the defects of the 
world through sheer will, may encourage the view that 
will is all that separates arbitrary notions of good and 
bad. Without some referent outside the will, the prob- 
lems that Burden hoped to remedy themselves appear 
to be chosen arbitrarily and increasingly incapable of 
definition. The freedom of action that Stark promises 
opens up the prospect of making changes for the 
better, but it also provides Burden with a definition of 
better and worse that ultimately is untenable in both 
theory and practice (see discussion of Warren 1975 
below). 


ONE REGIME—ONE STORY 


In the end, we must explain why these two stories go 
together. Burden insists that they are, in fact, one story 
(pp. 157, 435). On the one hand, Willie Stark’s politics 
are the embodiment, perhaps the radicalization, of a 
certain type of democracy. Through Stark, the people 
are to define government’s ends by their needs and to 
achieve those ends by their tremendous strength. They 
could never concentrate that strength without Stark as 
their agent. On the night of the impeachment, “the 
people” are unified only insofar as they can chant, 
“Willie, Willie, Willie.” In order to concentrate their 
divergent interests, the people must assume the unity 
of those interests and allow Stark to handle the com- 
plex particulars for them. The essence of Stark’s polit- 
ical claim to rule, and therefore the essence of his 
policies, is his knowledge of the people and concern for 
their needs. 

On the other hand, we have Jack Burden, who is 
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hard to understand. He even suggests in places that he 
does not know why he does the things that he does 
(e.g., p. 192). He wanders through a sort of philosoph- 
ical tour of late modernity, from a certain type of 
idealism, through a certain democratic Machiavellian- 
ism, to a terribly debilitating belief in material deter- 
minism that culminates in nihilism. In the end, Burden 
accepts responsibility for his own actions and deter- 
mines to live in the “agony of the will.” How can we 
link these two stories in a way that makes sense of them 
both? 

The key to the riddle lies in the moral consequences 
of Stark’s political action and the understanding of the 
world that the regime he creates gives to the people 
around him. We can begin with Stark’s political “edu- 
cation” because it brings to light much of the education 
he gives to others. When Willie discovers the corrup- 
tion at the bottom of his first campaign for governor, he 
gets drunk. The removal of the political taboo against 
dirty politics seemingly destroys the moral taboos 
against drunkenness, adultery, and violence. Without 
the restraint supplied by his previous adherence to a 
model of dignity, Stark recognizes that the very basis of 
his government, the legitimization of the desires of the 
people, provides no rationale by which he should 
restrain his own desires. 

Furthermore, so long as Stark believes that there is 
no other political alternative that can deliver what the 
people need, the first precondition of good government 
becomes the preservation of his own rule, and every 
attack upon his position threatens that precondition. 
Therefore, he thinks that he is compelled to use all 
means necessary to preserve his position when his 
personal indiscretions place his political standing in 
jeopardy (see Gray [1972] 1987, 96-97). Stark must 
overcome challenges from the ambitious who would 
contend with him for rule, the rich or self-interested 
who object to paying for his projects, all those who seek 
a public good defined in different terms than he would 
define it, the lovers of decency who object to his 
destructive methods, and the lovers of law who may 
share his goals but want them to be pursued through 
proper channels. Thus, Stark becomes obsessed with 
controlling men and women. He is forced into tyran- 
nical measures in his attempt to do so, but he is still 
ultimately doomed to failure. 

Stark’s tyrannical actions shape the character of both 
the people closest to him and the citizens more gener- 
ally (see pp. 327-8), but Warren is most concerned 
with Stark’s role in the corruption of Jack Burden. 
Burden’s struggles with the consequences of the Great 
Twitch view of human nature are ultimately tied to his 
recognition of the deep implications of Willie’s politi- 
cal ideas and activity. Burden follows Stark’s combina- 
tion of materialism and democracy to what appears to 
be a logical conclusion in nihilism and falls into the 
malaise of the human spirit that Nietzsche character- 
ized as the mark of the last man. Burden is fighting a 
battle with that malaise long before he joins Willie, and 
he thinks that Stark’s energetic devotion to action will 
save him from the sapping emptiness in which he lives. 

Instead, Burden discovers in Stark’s system of ever 
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malleable principles the source of all his despair. 
Knowledge of the emptiness of life may briefly be 
“tonic and bracing” because it makes the knower aware 
of the joke and innocent of all crimes, but it can never 
enable a person to live truly for the future, and a 
concern for decent, democratic government is inextri- 
cably tied to recognizing the future consequences of 
political actions and accepting responsibility for those 
consequences. Burden comes to these realizations only 
after watching the agonizing drama of his friends and 
close relations and recognizing that their actions are 
not explained by the Great Twitch. In many respects, 
he realizes it far too late. 

If Burden is eventually able to overcome his adher- 
ence to the enervating principles that he discovers in 
Stark’s political rhetoric, albeit only after they exact a 
terrible toll, we might conclude by asking whether 
there is any political redemption that can accompany 
the personal redemption of Jack Burden. Warren’s 
presentation of democratic politics holds out little hope 
for a better arrangement of things in the state. The old 
politics of Governor Stanton can never provide an 
activist government that meets the needs of the people, 
but it is clearly more decent than the administrations of 
MacMurfee, Harrison, Duffy, or Stark.18 The first three 
were neither decent nor effective, and Stark’s effective- 
ness is purchased at a terrible cost. Before Upton, 
however, when Stark was decent, he was wholly inef- 
fective. Lf effectiveness in government, defined as the 
ability to improve materially the lives of the people and 
to ensure their comfort and security, can only be gained 
by means that corrupt officeholders and citizens alike, 
then the outlook for progressive democracy is quite 
bleak. 

In two places near the end of the novel, Warren 
suggests, but only suggests, that there may be hope for 
a better politics. First, Stark’s last words to Jack 
Burden are: “It might have been all different Jack” (p. 
400). To discover how it might have been different, we 
must begin on the night after Tom Stark’s injury. As 
they wait for news from Tom’s surgeon, Stark tells his 
estranged wife that he is going to name the hospital for 
Tom rather than himself. Lucy is truly apolitical and 
thinks all human needs can be satisfied in private life 
(pp. 61-3, 155-6, 334-5). Her response to the sugges- 
tion illustrates her distaste for what she takes to be the 
ends of all political activity: 


Those things don’t matter. Oh, Willie, don’t you see? 
Those things don’t matter. Having somebody’s name cut 
on a-piece of stone. Getting it in the paper. All those 
things. Oh, Willie, he was my baby boy, he was our baby 
boy, and those things don’t matter, they don’t even matter, 
don’t you see (p. 381)? 


18 Even Judge Irwin's crime resulted from his sense of himself as one 
of the gentlemen. He needed to preserve the symbols of a man of 
honor, his plantation and his house on the Landing, and he acted 
dishonorably to do so. Similarly, Governor Stanton was, in his own 
mind, no doubt being honorable when he chose to protect his friend 
from the consequences of his action. The novel suggests that even the 
government that is too honorable to be active is occasionally 
compelled to act dishonorably. 
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She thinks Willie is in politics to gain fame. She is 
certain that the reason for building the hospital is to 
name it for himself or a loved one, and she insists that 
Stark ought to focus on the concerns of home and 
family. It is better to have one’s children than to lose 
them in the pursuit of great projects that may be 
named for them. 

Although Stark does not answer his wie he under- 
goes a great transformation after Tom’s surgery. He 
breaks off his affairs with Sadie Burke and Anne 
Stanton (pp. 391, 410-1). He cancels the corrupt deal 
with Gummy Larsen to build the hospital in excharge 
for selling out MacMurfee (pp. 386-7). Yet, he does 
not accept Lucy’s position that the projects he accom- 
plishes through politics do not matter. He will build the 
hospital, and he is busily assuring passage of his tax 
reforms three days after Tom’s injury, when he is shot 
by Adam Stanton (pp. 394-7). He does not forswear 
political action for private decency. Because Stark is 
interested in more than fame, he cannot seriously 
consider withdrawing from politics. We must assume 
that he still finds the standard of justice in the needs of 
the people and that he still thinks government is 
necessary to overcome the neediness at the core of 
human existence. He does, however, appear to recover 
some of the concern for dignity that marked his earlier 
political career. His reformation, which never has a 
chance to succeed, must be understood as an attempt 
to live more decently while continuing to be politically 
effective. 

The second suggestion of a better politics is the 
passing reference to Hugh Miller in the novel’s sum- 
mation. Burden claims: “It looks as though High Miller 
will get back into politics, and when he does Pll be 
along to hold his coat. I’ve had some valuable experi- 
ence in that line” (p. 436). This statement appears out 
of place to some commentators, who consider it wholly 
inconsistent with the rest of the novel (e.g., King [1980] 
1987, 152-3). Miller resigns from office when told that 
Stark is going to save State Auditor Byram White from 
impeachment (p. 135). Stark argues that if he allows his 
opponents in the legislature to dictate to him on this, 
they will recognize his weakness and destroy his legis- 
lative accomplishments. To preserve those accomplish- 
ments, he has to “take the damned government away 
from the behind guys and keep it away from ’em. 
Whatever way you can” (pp. 136-7). Miller never 
disputes this reasoning, but he insists that White must 
be punished for his felony. When he is not, Miller 
leaves the administration and remains absent until this 
reference at the very end of the novel. 

If Miller represents the honorable man who wants 
decent democratic government capable of meeting the 
needs of the people, then the novel suggests that his 
chances for success are small. Warren gives no evi- 
dence that Stark’s positions are wrong or that demo- 
cratic government does not require the condoning of 
certain dishonorable practices. Nevertheless, Miller 
appears unwilling to cross certain lines, even if this 
self-control concedes power to less scrupulous or less 
public-spirited adversaries. He represents an approach 
to democratic government that is willing to. lose power 
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in order to remain honorable. He prefers not to 
accomplish certain ends, even if he genuinely believes 
they are worthy, rather than to accomplish them in the 
wrong way. This conviction must rest on some set of 
principles that are not grounded solely in the needs of 
the people, are not infinitely malleable, and therefore 
might not engender the nihilism that Stark’s politics 
create in Jack Burden. There is no indication in the 
novel that the people will provide or enforce any 
standard of personal honor for their representatives or 
will react to dignified ‘appeals to their reason. Further- 
more, when Stark concedes that there is no dignity in 
democratic politics, he further legitimizes a view of 
human nature that makes all forms of dignity and 
principle suspect. Given these facts, it is not at all clear 
that Millers new foray into politics will be more 
successful than Stark’s pre-Upton campaign. 

All the King’s Men is not a very hopeful novel. We are 
confronted with the limitations of certain all too famil- 
iar forms of democratic government, and we are given 
only unsubstantiated hints that perhaps, maybe, a 
decent democratic statesman may emerge and be suc- 
cessful at fulfilling the people’s legitimate needs with- 
out corrupting those he employs and serves. The novel 
shows that the complex and contradictory needs of the 
people and the harsh realities of a system based on a 
combination of egalitarian political theory, an eco- 
nomic orientation toward comfort and physical satis- 
faction, and a system of electoral politics can corrupt 
even those persons most bent on doing good. Further- 
more, this corruption has an especially corrosive effect 
on society’s most thoughtful members, who are apt to 
see in the realities of modern, materialist, democratic 
politics the sources of nihilism. The only truly hopeful 
lesson of the novel lies in the personal conquest of 
despair by Jack Burden, whose redemption requires a 
tremendous effort as well as the blood of his friends 
and of his father. There is no reason to suspect that this 
effort can be expected of society as a whole. 

Burden is, in fact, the only character who overcomes 
the obstacles that stand in the way of such redemption 
without some fundamental delusion, whether provided 
by others or imposed on himself. Burden has to test his 
philosophical thoughts by directly confronting the re- 
quirements for activity in this world that are embodied 
in “the man of fact,” Willie Stark (p. 436). It is very 
important to understand Stark’s political activity and 
Burden’s reflections on it because Burden’s crises are 
those of a thoughtful man seeking to commit himself to 
action in the midst of our political problems. Warren 
identifies “the terrible division” of the man of ideas and 
the man of fact, “each incomplete without the other” 
and “doomed to destroy each other,” as the crisis of 
our age (see Gray [1972] 1987, 92-4, 100-1; Mizener 
[1967] 1987, 49). 

King ([1980] 1987, 154), however, characterizes Bur- 
den’s “redemption” as the culmination of “a private 
quest for identity” and denies that the end of the novel 
is to be properly understood as “political” in any 
meaningful sense. Indeed, Warren ({1953] 1977, 97) 
suggests that “the book, however, was never meant to 
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be a book about politics.”19 We should be hesitant to 
accept this view of the matter because Burden’s crisis is 
precipitated by his study of political action and is the 
result of his intermittently suppressed but genuine 
longings for effective democratic reform (pp. 123-5, 
392-3). His return to politics with Hugh Miller is 
premised on his rejection of the nihilism that he 
confronted in his association with Stark’s political 
activity. The novel shows that Burden’s views, both 
political and moral, are closely tied to the view of the 
world that animates Stark’s political action and that is 
communicated through Stark’s speeches to the people 
of his state. The shape of Burden’s crisis, and the 
insight through which he overcomes that crisis, are 
intricately tied to the regime in which he lives. 

We might say that this work connects politics in the 
common sense of the word—the activities of running 
for office and conducting the affairs of government—to 
politics in a broader sense, that is, the many ways in 
which our understanding of the world is shaped by the 
underlying presuppositions of our social and political 
environments. Warren confirms this in the context of 
his comment that All the King’s Men was not meant to 
be “a book about politics.” In that passage, he claims 
that the original idea for Stark emerged from his 
ruminations on Talos in Spencer’s Faerie Queen, a 
character Warren describes as “the pitiless servant of 
the knight of justice.” Stark is often pitiless and ruth- 
less in his pursuit of justice as modern democracy is all 
too inclined to conceive of it, namely, as the needs of 
the people: “Your need is my justice” (p. 262). Warren 
({1953] 1977, 97) claims that “Talos is the kind of doom 
that democracy can bring upon itself” (see Blotner 
1966, 225). This doom may be realized if democracies 
shape the type of persons who find their perception of 
their own neediness to be the only determinant of 
policy and the only justification for all political action. 

The novel is political in a sense that both includes 
and transcends the immediately political narrative of 
Stark’s career. In Democracy and Poetry, Warren (1975, 
31) states: “Our poetry...has told us, directly or 
indirectly, consciously or unconsciously, that we are 
driving toward the destruction of the very assumption 
on which our nation is presumably founded.” This 
assumption, according to Warren (who purports to be 
paraphrasing Thomas Jefferson), is that democracy can 
aspire to be “a society in which free men—of indepen- 
dent self—»would exercise their franchise in the light of 
reason” (p. 4). This assumption requires that each 
citizen be both free and aware of his moral responsi- 
bility for his own private and public acts. Warren 
argues that this is the only type of democracy worth 
having and that a democracy of “mass men” (or last 
men) is, in fact, a farce. 

All the King’s Men shows us the torturous path by 
which Jack Burden becomes “an independent self” 
capable of rational self-appraisal and moral responsi- 


19 This passage is a favorite of certain interpreters of Warren, who 
insist that his novels are clandestine arguments for a return to a more 
thorough-going acceptance of a certain Christian view of the world. 
See Ruoff [1957] 1977 and Shepherd [1970] 1987. 
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bility. This difficult but necessary self-understanding is 
framed within a literary framing of the philosophical 
problem that Warren considers the greatest threat to 
the viability of modern democracies. He shows us the 
crisis of our age by revealing its specific features in our 
nation and our political culture. Those committed to 
the study of ideas about politics can hardly fail to heed 
such a discussion. 


CONCLUSION 


From the point of view of the classical approach to 
the regime, All the King’s Men is a great political novel 
on two levels. First, the political superstructure is a 
crisis in the political life of an American-style demo- 
cratic government. This crisis results from the rise to 
power of a demagogue. Willie Stark, however, does 
not simply seek power for its own sake. He resorts to 
demagoguery as a response to his inability, and the 
demonstrated inability of the political system as a 
whole, to meet the needs of “the people.” This type of 
popular regime creates the political problems that push 
Stark to demagoguery. 

Second, the story of Jack Burden demonstrates how 
Stark’s demagoguery itself transforms people around 
him by changing their view of the nature of the world 
and the requirements of good action or justice. Burden 
is shaped by Stark’s regime, but he learns, albeit with 
some difficulty and at great cost, to understand himself 
in a way that is more true to how human beings actually 
think of themselves than is the material reductionism 
that Stark seems compelled to adopt. In doing so, 
Burden gains the ability to act politically in a new way 
that takes its bearings from outside the worldview that 
modern democracies tend to create. Ironically, this new 
orientation may be an indispensable requirement for 
maintaining a healthy democratic system. This reading 
of the novel makes sense of all its elements and how 
they properly belong together. It shows the work to be 
a fit object for careful study by political scientists who 
seek to understand how our politics informs and shapes 
our characters. 
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e report a series of studies of historical reasoning among professional observers of world politics. 

The correlational studies demonstrate that experts with strong theoretical commitments to a 

covering law and cognitive-stylistic preferences for explanatory closure are more likely to reject 
explained” 


close-call counterfactuals that imply that “already 


historical outcomes could easily have taken 


radically different forms. The experimental studies suggest that counterfactual reasoning is not totally 
theory-driven: Many experts are capable of surprising themselves when encouraged to imagine the 
implications of particular what-if scenarios. Yet, there is a downside to openness to historical contingency. 
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too much subjective probability to too many scenarios. We close by defining good judgment as 

reflective-equilibrium process of balancing the conflicting causal intuitions primed by complementary factual 


and counterfactual posings of historical questions. 


advanced the controversial thesis that history, 

properly understood, is a prime candidate for 
reduction to the laws of social science. Events can only 
be considered “explained” when they have been assim- 
ilated into syllogisms that meet three conditions: (1) 
causally efficacious covering laws serve as major pre- 
mises, (2) carefully abstracted antecedent conditions 
that prevail at given times and places serve as minor 
premises, and (3) the conclusions follow as the inexo- 
rable or at least likely result of the hypothesized laws 
operating on the specified antecedents. For better or 
for worse, and many historians believe the latter (Dray 
1989), this covering-law model captures the approach 
of many social scientists to the explanation of historical 
trends and patterns (Goldstone n.d.; Mokyr n.d.). This 
kind of disciplinary tension is familiar: between idio- 
graphic and nomothetic camps, between particularizers 
who complain that theorists neglect critical complexi- 
ties in their eagerness to assimilate history into their 
favorite explanatory templates, and generalizers who 
complain that particularizers are so immersed in idio- 
syncratic detail that they miss the big theoretical pic- 
ture. 

This article explores a recurring source of disagree- 
ment between generalizers and particularizers: the 
soundness of close-call counterfactual scenarios that 
imply that, with only minimal rewriting of antecedent 
conditions, history could have been rerouted down 
different, sometimes radically different,.event paths. 
Close-call counterfactuals are often focal points of 
disagreement for two reasons. 

First, Fogel (1964) and Fearon (1991), among oth- 
ers, demonstrate that all causal inference from history 
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ultimately rests on counterfactual claims about what 
would or might or could have happened in hypothetical 
worlds to which scholars have no direct empirical 
access. This is not to say that evidence is irrelevant. A 
variety of empirical and logical criteria can be used to 
differentiate more from less compelling counterfactual 
claims (see the essays in Tetlock and Belkin 1996). But 
disputes over the relative soundness of competing 
counterfactuals are often notoriously resistant to con- 
sensual resolution. Examples include the seemingly 
interminable debates over the avertability or inevitabil- 
ity of both sweeping historical transformations, such as 
the European rise to global hegemony (Landes 1997; 
Mokyr n.d.), and highly specific events, such as the 
onset of World War I or the end-game moves of the 
Cold War (Lebow 2001). 

Second, close-call counterfactuals assign critical 
roles to causes that covering-law theorists tend to 
disparage as trivial, impossible to categorize, and best 
relegated to error variance (see McCloskey, 1991). The 
quest for parsimonious laws governing political pro- 
cesses becomes progressively more hopeless to the 
degree that the triumph of civilizations hinges on 
horseshoe-nail (or butterfly) effects in battles; the 
outbreak of world wars hinges on royal carriage drivers 
making a wrong turn in downtown Sarajevo or an 
emotionally unstable young man gaining admission to a 
Viennese art school; or the outcome of the Cuban 
missile crisis hinges on the interpersonal chemistry 
among presidential advisers. 

We work from the uncontroversial premise that 
there is an ineradicable element of subjectivity in these 
debates over historical causality. Accordingly, a strong 
case can be made for rigorously scrutinizing potential 
cognitive biases in how historical observers go about 
evaluating the relative plausibility of competing what-if 
scenarios. We report two sets of empirical studies, one 
correlational and the other experimental. The correla- 
tional research explores the degree to which profes- 
sional observers of world politics rely on abstract 
covering laws in assessing what was possible at specific 
times and places. These studies demonstrate that, 
across an array of contexts, the more credence observ- 
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ers place in relevant covering laws, and the stronger 
their cognitive-stylistic preferences for explanatory clo- 
sure, the more likely they are to be guided by those 
covering laws in judging what could have been, and the 
less likely they are to make judgments of historical 
contingency on a case-by-case basis. 

Whereas the correlational studies rely on naturally 
occurring variation in theoretical beliefs and cognitive- 
stylistic preferences to illustrate the deductive, top- 
down character of counterfactual reasoning and the 
power of covering-law preconceptions to shape the 
conclusions that experts draw about what was histori- 
cally possible, the experimental studies encourage ex- 
perts to perform, in effect, an unnatural cognitive act: 
to give more thought than they normally would to 
alternative paths that history could have followed. The 
goal is to test the notion that counterfactual reasoning 
also can take an inductive, bottom-up form. In other 
words, the mental processes of imagining specific coun- 
terfactual scenarios can induce us to change our mind 
and become more circumspect about the power of our 
favorite causal generalizations to delimit the range of 
historical possibilities. Experiment 1 asks foreign policy 
experts to consider various scenarios, supported by 
varying amounts of detail, that “undo” the peaceful 
resolution of the Cuban missile crisis. The net effect 
was to increase their perceptions of the potential for 
alternative, more violent, endings. Experiment 2 shows 
that the manipulations need not be heavy handed and 
that no new information need even be presented. 
Experts were asked to draw on their own historical 
knowledge in searching for possible pathways to more 
violent endings of the crisis, and then to unpack these 
possibilities into progressively more differentiated sub- 
sets; the net effect was inflation of the subjective 
probabilities of those alternative outcomes. Moreover, 
consistent with Tversky and Fox’s (1995) support the- 
ory, the more extensive the unpacking, the greater was 
the resulting inflation of subjective probabilities. Ex- 
periment 3 demonstrates that these effects are not 
peculiar to the relatively recent and brief Cuban missile 
crisis. It provides a conceptual replication in the con- 
text of the debate about the rise of the West, the 
historical transformation over several centuries of 
Western Europe from cultural backwater to global 
hegemon. 

Taken together, the results give real-world empirical 
content to a cognitive account of the construction of 
historical knowledge that specifies the mental pro- 
cesses by which observers apply causal generalizations 
to history and judge the acceptability of close-call 
counterfactuals that circumscribe those generaliza- 
tions. Historical observers may indeed often be prison- 
ers of their preconceptions (Tetlock 1999), but they 
hold the keys to their own release. They are not 
constrained to explain only what happened. We are 
endowed with the imaginative capacity to envision 
alternative paths that history could have taken. The 
more energy we direct to elaborating dissonant alter- 
natives, the weaker is the hold of our preconceptions 
(judging by the dwindling regression coefficients) on 
our judgments of contingency and possibility. But this 
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liberation has a price. The more effort we devote to 
unpacking counterfactual scenarios, the more contra- 
dictory are our resulting judgments of history. Our 
findings suggest that historical observers of world pol- 
itics confront an inescapable trade-off between being 
closed-minded but logically coherent or open-minded 
but logically incoherent. 


THE CORRELATIONAL STUDIES: APPLYING 
ABSTRACT COVERING LAWS TO SPECIFIC 
HISTORICAL CASES 


Guiding Theory 


From a neopositivist perspective on historical explana- 
tion (see Hempel 1965), theorists are under absolutely 
no logical obligation to apply covering laws to any 
specific event. Covering laws in social science are best 
viewed both as probabilistic and as applicable to sets of 
events, not individual occurrences. Logical necessity to 
the side, however, there may well be considerable 
psychological pressure on observers to achieve closure 
in their historical explanations. 

As Herbert Simon (1996) and colleagues have 
shown, even highly trained experts function like bound- 
edly rational satisficers who try to keep the number of 
variables and amount of information that must be 
integrated to a reasonable minimum. Also, a growing 
psychological literature on cognitive style reveals 
marked individual differences in the motivation to 
achieve simplicity and closure in the characterization 
and explanation of events (Kruglanski and Webster 
1996; Suedfeld and Tetlock 2001). Experimental and 
field research finds, for example, that people who score 
high on self-report measures of the need for closure 
tend to rely on prior beliefs in solving unfamiliar 
problems and in evaluating dissonant arguments, pre- 
fer parsimonious interpretations of evidence that in- 
voke as few causal constructs as possible, and prefer 
deterministic accounts that downplay probabilistic 
qualifiers. This body of work suggests that professional 
observers of world politics with a strong need for 
closure should find close-call counterfactuals vexing to 
the degree these imply that minor tinkering with 
antecedent conditions could have undone major out- 
comes that the observers felt they had already comfort- 
ably assimilated into favorite covering-law schemes. 

Exactly how do experts resist theoretically subversive 
counterfactuals? From a strictly logical perspective, 
there are three possible lines of defense (Lewis 1973). 
From a psychological perspective, especially one 
grounded in cognitive consistency theory, there are 
good reasons for expecting experts to mobilize all three 
whenever experts feel core theoretical commitments 
are under siege (see Jervis 1976 on belief-system 
overkill). The lines of defense are as follows. 

First, challenge the mutability of the antecedent. For 
example, insist that it is hard to imagine Kennedy 
heeding his more hawkish advisers during the Cuban 
missile crisis. This defense often seems contrived, 
however, even to those who value closure. All but the 
most implacably La Placean determinists recognize 
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that it is exceedingly implausible to insist that each and 
every microscopic detail surrounding key events had to 
take the form it did. 

Second, challenge the connecting principles that 
bridge antecedent and consequent. Counterfactual ar- 
guments are inherently elliptical; it is impractical to 
spell out all the background assumptions that must be 
satisfied to sustain even the simplest antecedent-con- 
sequent linkages. Consider the claim that if Kennedy 
had heeded his hawkish advisers during the missile 
crisis, and attacked Soviet bases in Cuba, World War 
Il would have ensued. Deterrence theorists can 
counter that war was not inevitable because the Soviets 
recognized their strategic inferiority vis-a-vis the 
United States. They would have grudgingly acquiesced 
to the fait accompli and forbidden their ground forces 
in Cuba to use nuclear-tipped Luna missiles against the 
American armada. 

Third, concede that the antecedents may be mutable 
and that connecting principles may be sound, but still 
insist that history would have been thrown off track 
only temporarily, and the proposed changes would 
have had little long-term significance. This defense is 
designated a second-order counterfactual because it 
undoes the undoing of history implied by the original 
scenario. For example, even if Genghis Khan had lived 
and the Mongols had mauled Western Europe, the 
resilient Europeans would have bounced back quickly. 


Hypothesis 


The preceding argument lays the conceptual ground- 
work for the two hypotheses tested in each of the three 
correlational studies. The covering-law hypothesis pre- 
dicts a main effect: The more confidence an expert 
places in any given causal generalization, the more 
likely s/he is to invoke all three lines of belief-system 
defense against close-call scenarios that undercut the 
applicability of the generalization to historical out- 
comes previously thought covered by that generaliza- 
tion. The cognitive-style hypothesis predicts an inter- 
active effect: Experts with a strong need for 
explanatory closure are likely to invoke all three de- 
fenses against dissonant scenarios. 


Research Design and Loglc of Analysis 


The correlational studies probe reactions to close-call 
scenarios that undercut three categories of covering 
law. Study 1 targets variants of nuclear deterrence 
theory that stress the power of mutual assured destruc- 
tion to induce even minimally rational actors to act 
_with self-restraint (see, e.g., the instructive exchange in 
Sagan and Waltz 1995). Scenarios that imply we were 
on the brink of nuclear war at numerous junctures in 
the Cold War should seem far-fetched to theorists who 
posit the robustness of nuclear deterrence. Looking 
back on the Cold War, these theorists should find it 
hard to imagine that either superpower ever would 
have allowed a crisis to escalate into a nuclear war (just 
as, looking forward, they find it hard to worry about the 
dangers of nuclear proliferation). 
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Study 2 targets variants of the neorealist balancing 
hypothesis. That is, when one state threatens to be- 
come too powerful and to dominate the entire inter- 
national system, other states—rational, self-preserving 
actors as they are posited to be—coalesce against it, 
thereby preserving the balance of power (cf. Vasquez 
1997). From this standpoint, the failure of aspiring 
hegemons, such as Philip I, Napoleon, or Hitler, is 
predetermined by a fundamental law of world politics, 
and close-call claims that these bids to dominate 
Europe might have succeeded are misinformed. 

Study 3 targets a pair of covering laws that loom 
large in debates on the rise and fall of civilizations. The 
first is that history, in the long term, is an efficient 
process of winnowing out maladaptive forms of social 
organization. The second is that Western societies, 
with their emphasis on the rule of law, property rights, 
free markets, and the practical applications of science, 
are better adapted to prevail in long-term competition 
with other civilizations (Mokyr n.d.). It is no accident 
that, between 1500 and 1800, Europeans came to wield 
global influence so disproportionate to their numbers. 
The rise of the West, at the expense of the “Rest,” 
reveals the fundamental superiority of Western values 
and institutions. Close-call counterfactuals that pur- 
port to undo the rise of the West by minor mental 
mutations of antecedent conditions—more aggressive 
Islamic incursions into France in the eighth century or 
less isolationist Chinese emperors in the fifteenth 
century—are merely whimsical excursions into what-if 
history. 


Methods and Measures 


Participants were drawn from overlapping populations 
of scholars who specialize in diplomatic and military 
history, security studies, and international relations. 
The 87 participants in correlational studies 1 and 2, 
which focus on neorealist balancing and nuclear deter- 
rence, were randomly sampled from Division 18 (In- 
ternational Conflict) and Division 19 (International 
Security and Arms Control) of the American Political 
Science Association and from the Society of Diplo- 
matic and Military Historians. The 63 participants for 
study 3, which focuses on the rise of the West, were 
randomly drawn from the membership roster of the 
World History Association. All respondents were con- 
tacted by either postal mail or e-mail, and they were 
promised both anonymity and detailed feedback on the 
purposes of the study. The response rate for studies 1 
and 2 was 29%; for study 3, 31%. Assessing selection 
bias as a result of the relatively low response rates was 
problematic because of the limited information avail- 
able on participating and nonparticipating individuals 
in our sample. Available results suggest, however, only 
slight overpresentation of academics employed in large 
research universities. 


Covering-Law Beliefs. These measures assessed en- 
dorsement of the covering laws on nine-point agree- 
disagree scales. For neorealist balancing there were 
four items. (a) “For all the talk about a new world 
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order, world politics is still essentially anarchic—the 
strong do what they will and the weak accept what they 
must”; (b) “Whenever one state starts to become too 
powerful, other states find a way of combining forces 
and preventing it from dominating them”; (c) “The 
security policies of states are often driven by morality, 
not just by rational calculations of the impact of those 
policies on the balance of power”; (d) “It is naive to 
suppose that the failure of would-be conquerors such 
as Philip I, Napoleon, and Hitler to achieve lasting 
dominance in Europe was predetermined by balance- 
of-power politics—it might just have been an accident.” 

For nuclear deterrence there were three items. (a) 
“For all the talk about the risk of nuclear accidents, the 
U.S.A. and U.S.S.R. never really came close to nuclear 
war”; (b) “Nuclear weapons played a key role in 
moderating the behavior of both the American and 
Soviet governments during the Cold War”; (c) “It is 
unrealistic to assume that leaders working under great 
stress will always act with great restraint in crises that 
raise the risk of the use of nuclear weapons.” 

Two items dealt with the survival of the fittest 
civilizations. (a) “History is, in the long run, an efficient 
process of winnowing out maladaptive forms of social 
organization”; (b) “Western societies and institutions, 
with their greater emphasis on the rule of law, property 
rights, free markets, and the practical applications of 
science, were better adapted to prevail in long-term 
competition with other civilizations.” 


The Need for Explanatory Closure. Respondents gave 
answers on a nine-poimt agree-disagree scale to nine 
items: (a) “I think that having clear rules and order at 
work is essential for success”; (b) “Even after I have 
made up my mind about something, I am always eager 
to consider a different opinion”; (c) “I dislike questions 
that can be answered in many different ways”; (d) “I 
usually make important decisions quickly and confi- 
dently”; (e) “When considering most conflict situa- 
tions, I can usually see how both sides could be right”; 
(f) “I prefer interacting with people whose opinions are 
very different from my own”; (g) “When trying to solve 
a problem I often see so many possible options that it 
is confusing”; (h) “Scholars are usually at greater risk 
of exaggerating the complexity of political processes 
than they are of underestimating the complexity of 
those processes”; (i) “Isaiah Berlin [1997] classified 
intellectuals as hedgehogs or foxes. A hedgehog knows 
one big thing and tries to integrate the diversity of the 
world into a comprehensive and parsimonious vision 
whereas a fox knows many small things and tries to 
improvise explanations on a case-by-case basis. I would 
place myself toward the hedgehog or fox style of 
thinking about politics.” 

Beliefs about Close-Call Counterfactuals. The first set 
of scenarios undid the outcomes of either World War I 
or Il: (a) “If Germany had proceeded with its invasion 
of France on August 2, 1914, but had respected the 
Belgian neutrality, Britain would not have entered the 
war, and France would have quickly fallen”; (b) “If the 
German High Command had implemented the Schlief- 
fen Plan more aggressively in August, 1914, the miracle 
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of the Marne would have been impossible, and Paris 
would have fallen”; (c) “If Germany had avoided 
antagonizing the United States by meddling in Mexico 
and by initiating unrestricted submarine warfare, the 
United States would not have entered World War I and 
Germany would have prevailed against the French and 
British in its spring offensive of 1918”; (d) “If Hitler 
had not invaded the Soviet Union and concentrated 
German resources on defeating the British, Germany 
would have defeated Britain”; (e) “If Hitler had more 
consistently focused on taking Moscow in the summer 
of 1941, he could have knocked the Soviet Union out of 
the war”; (f) “If Hitler had not declared war on the 
United States on December 11, 1941, the British and 
the Soviets could never have defeated Nazi Germany.” 

The second set of counterfactuals explored the fea- 
sibility of the Cold War becoming thermonuclear: (a) 
“If Stalin had lived several years longer (surviving his 
stroke but in an irrational state of mind that encour- 
aged high-risk adventures), World War M could easily 
have broken out in the mid-1950’s”; (b) “If bad weather 
had delayed the discovery by U-2 reconnaissance 
planes of Soviet missiles in Cuba until the missiles were 
operational, the Soviets would have refused American 
demands to dismantle and withdraw the weapons”; (c) 
“If the Soviets had refused to withdraw their missiles, 
the U.S. would have launched air strikes against the 
Soviet bases”; (d) “If the U.S. had launched such air 
strikes, the Soviet commanders in Cuba would have 
launched at least some missiles at the eastern seaboard 
of the United States”; (e) “If the Soviets had fired 
Cuban-based nuclear missiles at American cities, retal- 
iatory nuclear strikes would have been launched at 
Soviet cities”; (f) “If Soviet hardliners had taken charge 
of the Communist Party in the mid-1980s, the Cold 
War—far from ending peacefully and quickly—would 
have intensified.” 

The third set of counterfactuals explored the feasi- 
bility of unmaking the West through hypothetical in- 
terventions that either enfeebled Europe or empow- 
ered rival civilizations: (a) “If China had had, at key 
junctures, emperors more sympathetic to economic 
and technological development, it could have emerged 
as the world’s first superpower”; (b) “If the Mongols 
had continued their advance into central and western 
Europe and not been distracted by the death of Geng- 
his Khan, later European development would have 
been impossible”; (c) “If Islamic armies had made a 
serious attempt to conquer France and Italy in the 8th 
century, later European development could have been 
radically side-tracked”; (d) “If the Black Death had 
been even more lethal, killing, say, 70% of the popu- 
lation, Europe could not have arisen as the dominant 
region in the second half of the millennium.” 

For each scenario, experts were asked to make three 
judgments on nine-point scales. (1) “How plausible was 
the antecedent condition of the argument? (Do we 
have to ‘re-write’ a little or a lot of history?)” (2) 
“Assuming the plausibility of the antecedent, how 
likely was the hypothesized consequence?” (3) “As- 
suming the plausibility of the hypothesized conse- 
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TABLE 1. Predicting Resistance to Close- 
Cali Counterfactuals 


Covering Law 
Neoreallst balancing 
Cognitive styte 
Balancing x style 
n= 87 
R? = 0.47 
Nuclear deterrence 
Cognitive style 
Deterrence x style 
n = 86 


Æ = 0.43 

Adaptive advantage of Weet 
Cognitive style 
West x style 

n = 63 

Æ = 0.41 


neorealist balancing scales were correlated quite highty (r = .57). *p = 
.05, “p = .01, “p = .001. 





quence, what would the long-term ramifications have 
been?” 


Findings 


Correlational analyses revealed that experts invoked all 
three belief-system defenses against dissonant close- 
calls but preferred certain defenses. Two of them— 
challenge the logic of the connecting principles and 
generate second-order counterfactuals that put history 
back on track—were widely employed and were tightly 
linked to the respondent’s abstract orientation toward 
world politics (average r = 0.54 with the covering-law 
scales). The third defense—challenge the mutability of 
historical antecedents—was markedly less linked to 
abstract political orientation (average r = 0.29). There 
is no compelling reason one’s theoretical position on 
the robustness of nuclear deterrence should predict 
whether one believes Stalin could have survived the 
cerebral hemorrhage of March 1953 or whether Cuba 
could have been cloudier on an October day in 1962. 
The plausibility of most antecedents hinges on specific 
facts tied to particular times, places, and events; if 
anything, it is disconcerting that abstract orientation 
predicts so much variance—8% to 10%—in judgments 
of the mutability of antecedents. 

Table 1 summarizes ordinary-least-squares multiple 
regression results for a composite dependent variable 
that additively combines the two most correlated resis- 
tance strategies, challenge connecting principles and 
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invoke second-order counterfactuals. These regres- 
sions consistently show that the more committed the 
scholars were to a school of thought, the more dismis- 
sive they were of counterfactuals that undercut histor- 
ical applications of favorite explanatory generaliza- 
tions. The more experts believed in the reliability of the 
balancing hypothesis, the more they rejected the sce- 
narios that implied Germany, with slightly sounder 
decision making, could have emerged victorious in 
either of the two world wars and achieved continental 
hegemony. The more experts believed in the robust- 
ness of nuclear deterrence, the more dismissive they 
were of close-calls that implied the United States and 
the Soviet Union easily could have slipped into nuclear 
conflict during the Cold War. The more experts be- 
lieved in the superiority of Western institutions, the 
less use they had for counterfactuals that implied other 
civilizations, with minor twists of fate, could have been 
contenders for geopolitical dominance. 

Table 1 also shows that, as predicted, the interaction 
between theoretical belief and covering law emerged in 
all three domains of application: Experts who most 
valued explanatory closure and parsimony mounted the 
stiffest resistance to dissonant scenarios. The interac- 
tion cannot, moreover, be attributed to a differential 
restriction of range. Hartley’s Fmax test for heteroge- 
neity of variance revealed only minimal differences in 
the variance of each theoretical belief scale among low 
versus high scorers on the cognitive-style scales (medi- 
an split) and equally negligible differences in variation 
on the cognitive-style scale between low and high 
scorers on the three theoretical belief scales (no p value 
fell below .20). Differential reliability of measures also 
was not a factor. Cronbach alphas for the three theo- 
retical belief scales were all in the relatively narrow 
range of 0.70 to 0.80 for both low and high scorers on 
the cognitive-style scale. 


Discussion 


Respondents with'a high need for closure were not 
content just to claim that they had the right explana- 
tions of the past. They insisted that they “were not 
almost wrong,” and that the historical outcomes they 
felt they had explained either were difficult to “undo” 
or were not undoable for long. As soon as one causal 
pathway to the theoretically predicted outcome was cut 
off, another pathway arose, hydra-like, in a second- 
order counterfactual (Tetlock 1998). 

These findings shed new light on some old observa- 
tions: It is easy to extract lessons from history that 
reinforce our ideological or theoretical stance (Jervis 
1976; Neustadt and May 1986), and it is difficult to 
avoid the temptation of selection bias and theoretically 
self-serving reasoning in defense of that stance (Lustick 
1996). The data also put in new psychological perspec- 
tive some old epistemological and methodological con- 
troversies. The patterns of counterfactual inference 
documented here can be viewed as integral parts of the 
cognitive mechanisms that produce selection bias in 
the use of historiography. Theoretically committed 
observers feel justified in giving short shrift to historical 
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accounts that dwell on inconsequential contingencies 
and frivolous what-if scenarios. 

The cognitive patterns documented here also en- 
courage the conceptual stretching of theories that 
some scholars identify as signs of a degenerative re- 
search program (Vasquez 1997). Stretching can be 
facilitated either by dismissing dissonant scenarios 
(“my theory already explains x, so do not bother me 
with counterfactuals that imply I was almost wrong”) or 
by embracing consonant close-calls (“my theory is 
admittedly hard-pressed to explain x, but ~x predicted 
by my theory almost occurred, so my theory should get 
some credit”). These arguments raise a fundamental 
challenge to the discipline: How can we avoid becom- 
ing prisoners of our preconceptions, trapped in a 
self-serving cycle of filling in the missing counterfactual 
contro! conditions of history with theory-scripted sce- 
narios, and then deploying that same theory-driven 
content, in other contexts, to justify the theory itself? 


EXPERIMENTAL STUDIES: THE POWER OF 
COUNTERFACTUALS TO CHANGE MINDS 


Experiment 1: The Cuban Missile Crisis 


Guiding Theory. Although experts tend to dismiss 
dissonant close-call counterfactuals, hefty regression 
coefficients do not preclude the possibility that these 
scenarios have some measurable effect on their judg- 
ment of historical contingency. Indeed, the psycholog- 
ical literature suggests that causality operates in 
precisely this direction. Laboratory experiments re- 
peatedly find support for the prediction— derived from 
Tversky and Kahneman’s (1983) classic work on the 
availability and simulation heuristics—that merely imag- 
ining multievent scenarios increases the perceived likeli- 
hood of the component events (Koehler 1991). Scenarios 
tend to be mentally sticky: Once they have been run 
through our “mental software,” they leave psychologi- 
cal traces in the form of images, thoughts, and emo- 
tions that can shape a host of subsequent causal- 
propensity and subjective-probability judgments. 

The literature on heuristics and biases also warns us 
to expect systematic logical inconsistencies when peo- 
ple judge complex event sequences that require inte- 
grating two or more probabilistic linkages. The text- 
book illustration is the conjunction fallacy (Tversky 
and Kahneman 1983). Imagine that one randomly 
constituted group is asked to judge the likelihood of a 
plausible conjunction of events, such as an earthquake 
that ruptures a dam, which in turn causes a flood that 
kills more than 500 people in California. Imagine that 
another randomly constituted group is asked to judge 
the likelihood of a flood (produced by any cause) 
killing more than 500 people in California. The likeli- 
hood judgments of the former group typically will exceed 
those of the latter group by a substantial in, even 
though the former group is judging a subset of the class 
of outcomes being judged by the latter group. 

Building on this work, Tversky and Fox’s (1995) 
support theory predicts systematic violations of the 
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extensionality principle of probability theory. The prin- 
ciple requires that if two sets of events have identical 
coverage, then they must have identical probabilities. 
Psycho-logic trumps logic here because most people 
can mobilize mental support more readily for highly 
specific possibilities than they can for the abstract sets 
that subsume these possibilities. As a result, people 
often judge the likelihood of an entire set of possibil- 
ities, such as a specific team from a given league 
winning the championship, to be substantially less 
likely than the sum of the likelihood values of that set’s 
exclusive and exhaustive components (the probabilities 
of losses for individual teams that make up the league). 
In effect, people judge the whole to be less than the 
sum of its parts and give quite different answers to 
logically equivalent versions of the same question. 


. Drawing on the literature on heuristics 
and biases as well as the work on cognitive styles, we 
designed Experiment 1 to test two hypotheses. First, 
thinking about counterfactual scenarios (that pass 
some minimum plausibility threshold) should tend, on 
average, to increase the perception that those scenarios 
once had the potential to materialize and may even 
once have been more likely than the concatenation of 
events that actually materialized. Linking this predic- 
tion to research on cognitive style, we also expect that 
the effect should be more pronounced among respon- 
dents with low need for closure. 

Second, Tetlock (n.d.) shows that there are two 
logically but not psychologically equivalent methods 
for scaling experts’ perceptions of historical contin- 
gency. One imposes a factual framing on the historical 
question and solicits inevitability-curve judgments. For 
example, in Experiment 1, experts on the Cuban mis- 
sile crisis were asked at what point some form of 
peaceful resolution became inevitable. They then were 
asked to trace how the subjective probability of that 
class of outcomes waxed or waned in the preceding 
days. The other method imposes a “counterfactual” 
framing on the historical question and solicits impos- 
sibility-curve judgments. In Experiment 1, for example, 
experts also were asked at what point they believe all 
alternative, more violent endings of the crisis became 
impossible and then were asked to trace how the 
subjective likelihood of that class of outcomes waxed or 
waned in the preceding days. 

It was not expected that experts would be blatantly 
inconsistent: Their judgments of the retrospective like- 
lihood of some form of peaceful outcome between 
October 16 and 29, 1962, should generally mirror their 
judgments of the retrospective likelihood of alterna- 
tive, more violent, outcomes when those judgments are 
obtained back to back from the same respondents. 
Logic and psycho-logic should coincide when the prin- 
ciple or binary complementarity is transparently at 
stake, and experts can plainly see that they are assign- 
ing so much probability to both x and its complement 
that the sum will exceed 1.0. But logic and psycho-logic 
do not always coincide. Factual framings of historical 
questions effectively invite experts to engage in hypoth- 
esis-confirming searches for potent causal candidates 
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that create an inexorable historical momentum toward 
outcome x. Analysts feel that they have answered the 
question when they have convinced themselves that x 
had to happen approximately when and in the manner 
it did. 

By contrast, counterfactual framing of historical 
questions effectively invites analysts to look long and 
hard for causal candidates that have the potential to 
reroute events down radically different event paths. 
Accordingly, we expect systematic anomalies in retro- 
spective likelihood judgments when we compare the 
judgments of two groups of experts, one of which 
completed the inevitability curve exercise and the other 
of which completed the logically redundant impossibil- 
ity curve exercise, but neither of which had yet seen or 
worked through the other group’s exercise. 

We made two “anomaly” predictions. First, system- 
atic violations of binary complementarity should arise 
at pre-inevitability and pre-impossibility dates. When 
we add the subjective probabilities assigned to peace by 
experts first asked to respond to inevitability curves and 
the subjective probabilities assigned to war by experts 
first asked to respond to impossibility curves, the sums 
will consistently exceed 1.0. Second, there will be a 
twilight-zone period during which experts who first 
complete inevitability curves will deem peace inevita- 
ble, but experts who first complete impossibility curves 
Will judge war still to be possible. The rationale for the 
between-group nature of the comparisons is worth 
stating explicitly because it underscores the critical 
advantages of experimentation in this context. Given 
that the experimental groups were constitituted by 
random assignment and hence should not differ sys- 
tematically in preexisting attitudes, there is no meth- 
odological reason to expect systematically different 
responses to the logically equivalent inevitability- and 
impossibility-curve questions. Across conditions, the 
error variance in responses should be normally distrib- 
uted around the same “true” population mean of respon- 
dents’ beliefs about the likelihood of peace or war. 


Methods and Measures. Pilot groups for experiments 1 
and 2 were informally drawn from faculty at two large 
American universities. Respondents for the actual 
treatment were then randomly selected from the mem- 
bership lists of divisions 18 and 19 of the APSA, the 
Society for Military Historians, and the Society for 
Historians of American Foreign Relations. All respon- 
dents were contacted by mail and were promised 
complete anonymity and detailed feedback on the 
purposes of the survey. The response rate was 26%. 
Experiment 1 randomly assigned the 76 participants 
to one of three conditions. First, in the control condi- 
tion, respondents (n = 30) were asked (1) when some 
form of peaceful resolution of the Cuban missile crisis 
became inevitable and, having identified a point of no 
return, to estimate the likelihood of a peaceful resolu- 
tion for each preceding day of the crisis (thereby 
creating inevitability curves). (2) They were also asked 
when all alternative (more violent) endings became 
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impossiblé and, having identified an “impossibility” 
date, to estimate the likelihood of those alternative 
endings on each preceding day (thereby creating im- 
possibility curves). 

Second, in the moderate-salience condition, before 
making retrospective likelihood judgments, respon- 
dents (n = 23) judged the plausibility of three close- 
call scenarios. (1) “If Kennedy had heeded his more 
hawkish advisors in the initial meetings of October 16, 
there would have been an American air strike against 
Soviet missile bases in Cuba, and possibly a follow-up 
invasion of Cuba.” (2) “If at least one Soviet ship either 
did not receive orders to stop before the blockade line 
(or, for some reason, disobeyed orders), there would 
have been a naval clash between American and Soviet 
forces in the Atlantic that would have resulted in 
military casualties, raising the possibility of tit-for-tat 
escalation.” (3) “If, in the aftermath of the shooting 
down of a U.S. reconnaissance plane over Cuba on 
October 20, Kennedy had agreed to implement his 
standing order to carry out retaliatory air strikes 
against Soviet SAM (surface to air missile) sites in 
Cuba that shot down U.S. aircraft, then the U.S. Air 
Force would have attacked Soviet antiaircraft installa- 
tions, which might have set off tit-for-tat escalation.” 
As in the correlational studies, respondents made three 
judgments of each scenario on nine-point scales: the 
ease of imagining that antecedent could have occurred; 
the likelihood of the hypothesized consequence if the 
antecedent had occurred; and the long-term effect on 
history if the hypothesized antecedent and conse- 
quence did occur. 

Third, in the high-salience condition, respondents 
(n = 23) not only considered the three aforemen- 
tioned situations but also judged a series of nine 
additional what-if scenarios that reinforced the ante- 
cedents in each of the three close calls. For example, 
counterfactual arguments 1, 2, and 3 reinforced the 
plausibility of the antecedents in the fourth counter- 
factual. (1) “If there had not been someone with the 
intellectual stature and credibility of Secretary of De- 
fense McNamara to make a credible case for caution, 
then Kennedy would have followed the advice of his 
more hawkish advisors.” (2) “If one of the newspapers 
to whom Kennedy had confided details of the Soviet 
placement of missiles in Cuba had leaked the story, 
there would have been irresistible public pressure on 
Kennedy to follow the advice of his more hawkish 
advisors.” (3) “If Kennedy had believed that the 
United States Air Force could knock out all of the 
Soviet missiles in a single strike (with no need for a 
follow-up land invasion), he would have followed the 
advice of his more hawkish advisors.” (4) “If Kennedy 
had followed the advice of his more hawkish advisors in 
the initial meetings of October 16, there would have 
been an American air strike against Soviet missile 
bases in Cuba, and possibly a follow-up invasion of 
Cuba.” The full text and set-up for the presentation of 
the antecedent-bolstering arguments is available from 
the authors on request. 
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Retrospective Perceptions of Inevitability and Impossibil- 
ity. The order of administration of these questions 
was always counterbalanced. The inevitability-curve 
exercise instructions were as follows. 


Let’s define the crisis as having ended when, on October 
29, Kennedy communicated to the Soviet leadership his 
agreement with Khrushchev’s radio message of October 
28. At that juncture, we could say that some form of 
peaceful resolution was a certainty—a subjective probabil- 
ity of 1.0. Going backward in time, day by day, from 
October 29 to October 16, trace on the graph your 
perceptions of how the likelihood of a peaceful resolution 
rose or fell during the 14 critical days of the crisis. If you 
think the U.S. and U.S.S.R. never came close to a military 
clash between October 16 and 29, then express that view by 
assigning consistently high probabilities to a peaceful 
resolution across all dates (indeed, as high as certainty, 1.0, 
if you wish). If you think the superpowers were very close 
to a military conflict throughout the crisis, then assign 
consistently low probabilities to a peaceful resolution 
across all dates. Finally, if you think the likelihood of a 
peaceful resolution waxed and waned day to day, then 
assign probabilities that rise or fall in accord with your 
intuitions about how close the U.S. and U.S.S.R. came to 
a military clash at various junctures. To start, we have set 
the subjective probability of peace at 1.0 (certainty) for 
October 29, marking the end of the crisis. 


The impossibility-curve mstructions were similar, 
except that the starting point was the subjective prob- 
ability of 0.0 assigned to October 29 to signify that 
alternative, more violent outcomes had become impos- 
sible. Experts were then asked to go backward in time, 
day by day, from October 29 to October 16, and trace 
on the graph their perceptions of how the likelihood of 
those more violent outcomes waxed and waned. 


Findings. The initial analyses involved a 3 X 2 X 13 
fixed-effects, unweighted-means analysis of variance 
that crossed three levels of the between-subjects exper- 
imental manipulation (control, moderate, and high 
salience), two levels of the individual-difference classi- 
fication variable (low versus high need for closure), and 
thirteen levels of the repeated-measures factor that 
corresponded to the days of the crisis. Contrary to 
expectation, the moderate and high conditions did not 
differ on either inevitability or impossibility curves 
(both F's < 1). We attribute this null result to a 
methodological shortcoming: Respondents reported 
being rather overwhelmed by the number of judgments 
required in the high-salience condition, and fatigue 
may have attenuated any further effect that exposure to 
additional counterfactual scenarios might have had. 
To simplify analysis, therefore, we collapsed the 
moderate and high groups into a single salient condi- 
tion. Follow-up analyses, now taking the form of a 2 x 
2 X 13 analysis of variance, revealed the predicted 
second-order interaction: Inevitability curves rose 
more slowly over time among those with lower need for 
closure assigned to the salient condition, F (12, 
908) = 6.74, p < .01. The predicted mirror-image 
second-order interaction emerged on the impossibility 
curves F (12, 908) = 5.33, p < .01, which is not 
surprising, given that the measures were highly corre- 
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lated, r = 0.76. Figures 1 and 2 clearly show that the 
distinctive functional forms of the inevitability and 
impossibility curves of low-need-closure respondents in 
the salient condition drive both interactions. 

As expected, within-subjects comparisons reveal that 
when experts completed an inevitability curve and 
immediately thereafter an impossibility curve—that is, 
when binary complementarity was transparently at 
stake—subjective probabilities of peace and war 
summed to approximately 1.0 (X = 1.04). Systematic 
violations of binary complementarity emerged, how- 
ever, when we made more subtle between-group com- 
parisons. For instance, when we add the subjective 
probability of peace assigned by experts who first 
completed inevitability curves to the subjective proba- 
bility of war assigned by experts who first completed 
impossibility curves, the average sum across dates is 
1.19. This value is significantly different from what we 
obtain by adding the probability of war and peace 
judgments of the two groups of experts who completed 
their inevitability or impossibility curves in the second 
position: The average sum across dates = 0.90, (F (1, 
71) = 10.32, p < .01). There was, however, no 
evidence for the twilight-zone-period hypothesis that 
the experts who responded first to either inevitability or 
impossibility curves could be “lured” into assigning 
probability values that implied the existence of a period 
during which peace was inevitable (1.0) but war had 
not yet become impossible (0.0), X impossibility date 
of war = October 275 and X inevitability date of 
peace = 26.9, F (1, 71) = 2.68, p < .15. 


Experiment 2: Unpacking Alternative 
Outcomes of the Cuban Missile Crisis 


Skeptics can argue that in Experiment 1 respondents 
were confronted with an elaborate battery of mutually 
reinforcing counterfactuals that made alternative his- 
tories unfairly vivid and left little room for determin- 
istic rejoinders. It also can be argued that norms of 
politeness made experts reluctant to dismiss all the 
researchers’ what-if scenarios as errant nonsense. Ex- 
periment 2 eliminates both objections by shifting the 
spotlight to the power of entirely self-generated coun- 
terfactual scenarios to alter perceptions of historical 
contingency. 


Guiding Theory. Consider again forecasts of which 
league, division, or team will win a sports champion- 
ship. Tversky and Fox (1995) demonstrate that the 
subjective probabilities people assign to binary comple- 
ments at the league level (East vs. West) generally sum 
to 1.0, but the subjective probabilities assigned to 
progressively more detailed or unpacked outcomes— 
the prospects of divisions within leagues and teams 
within divisions—typically exceed 1.0 and occasionally 
even 2.0. Forecasters find it easier to generate eviden- 
Ct ee 
several different teams 

In support theory, it is the | ease with which these 
reasons come to mind, their availability, that deter- 
mines the subjective feeling of support for, and subjec- 
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tive probability of, outcomes. The result can be massive 
“subadditivity.” The cumulative probabilities assigned 
to the exhaustive and exclusive components of the 
whole set exceed 1.0, which violates the extensionality 
axiom of probability theory. If people were to back up 
their unpacked bets with actual money, they would be 
quickly transformed into money pumps. It is, after all, 
logically impossible for each of four teams within an 
eight-team division to have a 0.4 chance of winning the 
championship the same year. 

Unpacking manipulations are understandably 
viewed as sources of cognitive bias in subjective prob- 
ability judgments of possible futures. They stimulate 
people to find too much support for too many possi- 
bilities. Yet, such manipulations may help reduce bias 
in subjective probability judgments of possible pasts via 
exactly the same mechanism. The key difference is that 
judgments of possible pasts, unlike those of possible 
futures, are already contaminated by the powerful 
certainty of hindsight. Experimental work shows that as 
soon as people learn which of a number of once- 
deemed possible outcomes happened, they quickly 
assimilate that knowledge into their cognitive structure 
and have a hard time recapturing their ex ante state of 
uncertainty (Hawkins and Hastie 1990). Mental exer- 
cises that involve unpacking sets of possible pasts 
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should have the net effect of checking the hindsight 
bias by bringing back to psychological life counterfac- 
tual possibilities that people long ago buried with 
deterministic “I-knew-it-had-to-be” thinking. 


Hypotheses. Drawing on support theory, we hypothe- 
size that experts who are encouraged to unpack the set 
of more violent endings of the Cuban missile crisis into 
progressively more differentiated subsets will find sup- 
port for those alternative outcomes. As a result, their 
inevitability curves will rise more slowly and their 
impossibility curves will fall less rapidly than those of 
experts who judge the entire set of possibilities as a 
whole. It is also expected that experts in the unpacking 
condition, especially those with low need for closure, 
will display stronger subadditivity effects (cumulative 
subjective probabilities exceeding 1.0) than the holistic 
group. 

Research Design, Method, and Logic of Analysis. The 64 
respondents in Experiment 2 were drawn from the 
same subject population as Experiment 1 and recruited 
in the same mail survey. Respondents were randomly 
assigned to one of two groups. The control group (1 = 
30) simply responded to the perceptions-of-inevitabil- 
ity and perceptions-of-impossibility items, as in Exper- 
iment 1. The other group (n = 34) was asked to 
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consider (1) how the set of more violent endings of the 
Cuban missile crisis could be disaggregated into sub- 
sets in which violence remained localized or spread 
outside the Caribbean, (2) in turn differentiated into 
subsets in which violence claimed fewer or more than 
100 casualties, and (3) for the higher casualty scenario, 
still more differentiated into a conflict either limited to 
conventional weaponry or extending to nuclear. Re- 
spondents generated impossibility curves for each of 
the six specific subsets of more violent scenarios as well 
as a single inevitability curve for the overall set of 
peaceful outcomes. 


Findings. ‘The results again reveal that how we pose 
historical questions shapes how we answer them. Fig- 
ure 3 illustrates the power of unpacking questions. The 
shaded area represents the cumulative increase in the 
subjective probability that experts believe counterfac- 
tual alternatives once possessed, an increase that was 
produced by asking experts to generate impossibility 
curves not for the abstract set of more violent out- 
comes (lower curve) but for each of the six specific 
subsets of those outcomes (upper curve). The analysis 
of variance took the form of a fixed-effects, unweighted 
means 2 (control versus unpacking) X 2 (low versus 
high need for closure) X 13 (days. of crisis) design. 
Consider the impossibility-curve dependent variable. 
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(Inevitability-curve results were again highly corre- 
lated, r = .71, and largely redundant for these hypoth- 
esis-testing purposes.) Analysis revealed the predicted 
main effects for unpacking (F (1, 58) = 7.89, p < 
.05) and need for closure (F (1, 58) = 5.05, p < .05), 
as well as the expected tendency for the impossibility 
curve of respondents with low need for closure to fall 
more slowly than that of high-need respondents in the 
unpacking condition (F (1, 58) = 4.35, p < .05). In 
addition, two unexpected tendencies emerged: Un- 
packing effects diminished toward the end of the crisis 
(F (12, 718) = 7.31, p < .05), as did differences 
between low- and high-closure respondents (F (12, 
718) = 5.02, p < .05). Experts, even low-need- 
closure experts unpacking possibilities, saw less and 
less wiggle room for rewriting history as the end 
approached. 

There was also support for the hypothesis that 
low-closure experts in the unpacking condition will 
exhibit the strongest subadditivity effects (probability 
judgments of exhaustive and exclusive sets of possibil- 
ities summing to more than 1.0). Averaged across 
dates, their combined inevitability and impossibility 
judgments summed to 1.38, which was significantly 
greater than the sum for low-closure experts in the 
control group (X = 1.12) or for high-closure experts in 
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either the unpacking condition (X = 1.18) or control 
group (X = 1.04) (F (1, 58) = 9.89, p < .05). Again, 
there was little support for the twilight-zone-period 
hypothesis. The longest time during which experts 
judged peace inevitable (X inevitability date = Oct. 
27.2) but war not yet impossible (X impossibility 


date = Oct. 28.1) emerged in judgments within the 
unpacking condition, and even this diference fell short 
statistically (F (1, 58) = 3.03, p < .10). 

The curve-fitting results also underscore the power 
of counterfactual thought experiments to transform 
our understanding of the past. Simple linear equations 
capture large proportions of the variance in retrospec- 
tive-likelihood judgments of the undifferentiated sets 
of peaceful outcomes (82%) and more violent alterna- 
tives (84%). The past appears to be a smooth linear 
progression toward the observed outcome. By contrast, 
the past looks more like a random walk, albeit around 
a discernible trend, from the perspective of low-closure 
experts who unpacked the set of more violent out- 
comes. A convoluted fourth-order polynomial equa- 
tion is necessary to explain the same proportion of 
variance in their retrospective likelihood judgments, a 
function that rises and falls at three junctures. 


FIGURE 3. inevitability and Impossibility Curves from Experiment 2 
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The power of unpacking is also revealed by cross- 
condition comparisons of correlations between theo- 
retical beliefs, such as the robustness of nuclear deter- 
rence, and reactions to close-call counterfactuals that 
move the missile crisis toward war. The correlation is 
greater in the control condition than in the unpacking 
condition (r (28 df) = 0.61 versus r (32 df) = 0.27). This 
drop is consistent with the notion that, under unpack- 
ing, observers shift from a theory-driven, covering-law 
mode of thinking to a more idiographic, case-by-case 
mode. 


Experiment 3: Unmaking the West 


Guiding Theory. Scholars have long pondered how a 
small number of Europeans, working from the super- 
ficially unpromising starting point of 1000 a.p. or 1200 
AD. or even 1400 a.D., managed in relatively a few 
centuries to surpass all other peoples on the planet in 
wealth and power. Not surprisingly, there is a wide 
range of opinion. At one pole are determinists, who 
view history as an efficient process of winnowing out 
maladaptive forms of social organization and who 
believe that the triumph of capitalism has long been in 
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the cards. The key advantages of European polities 
allegedly included more deeply rooted legal traditions 
of private property and individual rights, a religion that 
encouraged worldly achievement, and a fractious mul- 
tistate system that prevented any single power from 
dominating all others and halting innovation at the 
reactionary whim of its ruling elite (McNeill 1982). 

At the other pole are the antideterminists. To adapt 
Gould’s (1995) famous thought experiment, they be- 
lieve that if we could rerun world history thousands of 
times from the starting conditions that prevailed as 
recently as 1400 A.D., European dominance would be 
one of the least likely outcomes. These scholars decry 
“Eurocentric triumphalism” and depict the European 
achievement as a precarious one that easily could have 
unraveled at countless junctures. Other civilizations 
could have checked the West and perhaps even been 
contenders themselves but for accidents of disease, 
weather, bad leadership, and other miscellaneous 
slings and arrows of outrageous fortune. As our third 
correlational study suggests, the list of “could-have- 
been-a-contender” counterfactuals is long. South Asia 
and perhaps East Africa might have been colonized by 
an invincible Chinese armada in the fifteenth century if 
only there had been more support in the imperial court 
for technological innovation and territorial expansion. 
Europe might have been Islamicized in the eighth 
century if the Moors had cared to launch a serious 
invasion of France. If not for Genghis Khan dying in a 
nick of time, European civilization might have been 
devastated by Mongol armies in the thirteenth century. 

Within the antideterministic framework, thought 
experiments become exercises in ontological egalitari- 
anism, an effort to restore dignity to those whom 
history has eclipsed by elevating possible worlds to the 
same moral and metaphysical status as the actual world 
(Tetlock n.d.). Thought experiments are the only way 
left to even the score, an observation ironically remi- 
niscent of the Marxist historian E. H. Carr’s (1961) 
dismissal of anti-Bolsheviks as sore-losers who, from 
dreary exile, contemplated counterfactuals that undid 
the Russian Revolution. But now the gloaters, claiming 
historical vindication for their ideological principles, 
are on the Right, and the brooders, absorbed in wistful 
regret, are on the Left. 


Hypotheses. The hypotheses parallel those for Exper- 
iment 2, except now the focal issue is not the Cuban 


missile crisis but the rise of Western civilization to 
global hegemony (a massively complex historical trans- 
formation that stretches over centuries, not days). 
Once again, unpacking is expected to inflate the per- 
ceived likelihood of counterfactual possibilities and to 
produce subadditivity effects, especially for respon- 
dents with low need for closure. 


Research Design, Methods, and Measures. Experiment 
3 draws on the same respondents and uses the same 
mail survey as the third correlational study. The exper- 
iment has only two conditions. The no-unpacking 
control group (n = 27) generated inevitability curves 
for some form of Western geopolitical domination and 
impossibility curves for the set of all possible alterna- 
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tives to that domination (order counterbalanced). The 
intensive unpacking group (7 = 36) was first asked to 
unpack the set of all possible alternatives to Western 
domination into progressively more detailed subsets. 
These began with classes of possible worlds in which no 
region achieved global hegemony (either because of a 
weaker Europe or stiffer resistance from outside Eu- 
rope) and moved on to classes of possible worlds in 
which a non-Western civilization achieved global hege- 
mony (China, Islam, the Mongols, or a less familiar 
alternative). Experts then completed inevitability and 
impossibility curves that began with 1000 a.D. and 
moved by 50-year increments to 1850 a.p. (for which 
the subjective probability of Western dominance was 
fixed at 1.0 and that of possible alternatives at 0.0). 


Findings. The shaded area in Figure 4 represents the 
cumulative increase in the subjective probability that 
experts believe counterfactual alternatives once pos- 
sessed, an increase that was produced by asking experts 
to generate impossibility curves not for the abstract set 
of alternatives to Western domination (lower curve) 
but for each of the six specific subsets of alternatives 
(upper curve). The analysis of variance took the form 
of a fixed-effects, unweighted means 2 (no unpacking 
versus unpacking conditions) X 2 (low versus high 
closure) X 17 (50-year increments between 1000 and 
1850) mixed-factorial design. It reveals a significant 
unpacking effect (F (1, 58) = 6.77, p < .05) anda 
significant interaction between unpacking and cogni- 
tive style: The differences between low- and high- 
closure respondents grew more pronounced when they 
were asked to perform the unpacking exercise (F (1, 
58) = 4.88, p < .05). The same two unexpected but 
readily interpretable tendencies emerged as in Exper- 
iment 2, That is, as dates approached the end of the 
time series, now 1800, experts who unpacked alterna- 
tives to Western dominance saw less potential for 
rewriting history (F (16, 1018) = 4.88, p < .01), and 
the gap between low- and high-closure respondents 
narrowed (F (16, 1018) = 4.02, p < .05). Again, 
observers saw less likelihood for counte alter- 
ing outcomes toward the end of the specified historical 
process. 

Also as in Experiment 2, low-closure experts in the 
unpacking condition were most likely to assign subjec- 
tive probabilities that were subadditive, violated exten- 
sionality, and summed to well above 1.0. Averaged 
across dates, their judgments about inevitability and 
impossibility summed to 1.41, which was significantly 
greater than the sum for low-closure experts in the 
control group (X = 1.09) or for high-closure experts in 
the control group (X = 1.03) or unpacking condition 
(X = 121) (F (1, 58) = 4.67, p < .05). A 
twilight-zone period also emerged; in contrast to the 
temporally compacted missile crisis, there is now a 
sufficient range in responses to permit significant ef- 
fects. Focusing just on the control condition, we com- 
pared the average date for the inevitability of Western 
dominance to the average date for the impossibility of 
alternatives to Western dominance. The result was a 
twilight-zone period bounded by the inevitability date 
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FIGURE 4. Inevitabllity and Impossibility Curves from Experiment 3 
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of 1731 and the impossibility date of 1749 (F (1, 28) = 
4.21, p < .05). This period for the unpacking condi- 
tion was bounded, respectively, by 1751 and 1787, 
which not only is a significant difference in itself (F (1, 
32) = 8.43, p < .01) but also is significantly longer 
(F (1, 58): = 4.36, p < .05). 

In terms of curve fitting, a fifth-order polynomial 
equation was necessary to capture 80% of the variance 
in the numerous ups and downs in average-perceptions 
of the likelihood of unpacked outcomes, whereas a 
simple linear equation sufficed for the undifferentiated 
set. Unpacking “messes up” our understanding of the 
past, transforming what had once been a smooth 
progression toward a preordained outcome into a far 
more erratic and unpredictable journey. Unpacking 
also undermines the power of abstract covering laws to 
constrain our perceptions of specific historical possibil- 
ities; correlations fell from 0.63 in the control condition 
to 0.25 in the unpacking conditions. 


GENERAL DISCUSSION 


Our results do not tell us who is right about this or that 
historical controversy. Proponents of the covering-law 
approach can argue that high correlations between 
abstract theoretical beliefs and specific opinions about 
what is possible at given times and places are fully 





justified and that low-preference-for-parsimony €x- 
perts who fail to make tight conceptual connections 
between the two levels are just confused and sloppy 
thinkers. As for the results of our experiments, these 
proponents can argue that unpacking manipulations 
simply lead historical observers into the inferential 
equivalent of wild goose chases that encourage them to 
assign far too much subjective probability to far too 
many scenarios. The end result is logical absurdities, 
such as “x is inevitable,” but “alternatives to x remain 
possible.” There is nothing admirably open-minded 
about incoherence. 

Critics of the covering-law approach can respond 
that the high correlations between abstract theoretical 
beliefs and specific historical ones are a warning sign of 
an excessively theory-driven style of thinking about 
history. Insofar as unpacking manipulations prove an 
effective method of opening closed minds by reminding 
theory-driven thinkers of how riddled with indetermi- 
nacy history is, then so much the better, and do not fret 
that unpacking induces internal inconsistencies in be- 
lief systems. Unpacking reveals previously hidden cog- 
nitive contradictions and sets a constructive agenda for 
rethinking. A foolish consistency, these critics might 
admonish us, is the hobgoblin of little minds. 

A cognitive approach to historical reasoning can 
never resolve the question of whether, in any given 
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historical case, we have struck the right balance be- 
tween theory-driven and imagination-driven modes of 
information processing. A cognitive approach can, 
however, enhance the quality of intellectual debate in 
four tangible ways. 

First, it identifies systematic individual differences in 
the relative importance that observers place on achiev- 
ing an integrative view of the past. Respondents with a 
preference for closure (who resemble the hedgehogs in 
Berlin’s famous taxonomy) tried to assimilate the past 
into their favored deductive system. Those with less 
need for closure (who resemble Berlin’s foxes) ap- 
proached the past m a frame of mind open to the 
possibility that a potpourri of processes may have been 
involved, and an equally miscellaneous set of outcomes 
could have resulted. Moreover, these individual differ- 
ences cut across disciplinary boundaries, a point under- 
scored by the striking parallelism in the results of 
Experiment 2 (which drew mostly on political scien- 
tists) and Experiment 3 (which drew mostly on histo- 
rians). 

Second, a cognitive approach deepens our under- 
standing of how the framing of historical questions can 
shape where and how we look for answers. Logically 
equivalent factual and counterfactual framings of the 
same historical issue elicit contradictory probabilistic 
intuitions about what had to be and what might have 
been. Our experiments also demonstrate the impor- 
tance of the specificity of the question: The more 
detailed the unpacking of counterfactual alternatives, 


the greater is the tendency for the whole class of ` 


alternatives to be judged less likely than the sum of its 
exclusive and exhaustive components (subadditivity). 
The challenge that these findings pose to rationality 
should not be understated. Unpacking effects violate a 
core assumption not only of Bayesian but also of all 
formal logical models of belief, namely, the extension- 
ality principle, which asserts that classes of events with 
the same content should be assigned the same proba- 
bility (Tversky and Fox 1995). 

Third, a cognitive approach suggests methods of 
pitting cognitive biases against each other, the mental 
equivalent of fighting fire with fire, by checking the 
excesses of convergent, theory-driven thinking with 
divergent imagination~driven thinking, and vice versa. 
The preeminent theory-driven bias in historical reason- 
ing is, as many laboratory studies now attest, certainty 
of hindsight (Hawkins and Hastie 1990). Once people 
learn the outcome of an historical process—how the 
stock market finished the year or whether a crisis was 
resolved peacefully—they have difficulty recalling how 
uncertain they once were and exaggerate the degree to 
which “they knew it all along.” The most influential 
cognitive explanation of the hindsight effect attributes 
it to the “automaticity” of theory-driven thought, to the 
rapidity with which people assimilate known outcomes 
into their favorite covering laws. In the process, they 
demote once possible, even probable, futures to the 
status of implausible historical counterfactuals. 

Knowledge of past possibilities need not, however, 
be lost forever. The “debiasing” literature points to 
experimental interventions that can help people recon- 
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struct latent knowledge of historical junctures at which 
they once thought events could have taken a different 
turn (Tetlock and Belkin 1996). The potential correc- 
tives draw on two well-replicated effects widely consid- 
ered judgmental biases themselves: the tendency to 
inflate the perceived likelihood of vivid, easily imag- 
ined, scenarios, coupled with the tendency to underes- 
timate how rapidly subjective probabilities should di- 
minish as a function of adding contingencies to 
scenarios (Koehler 1991; Tversky and Kahneman 
1983). The most effective manipulations for attenuat- 
ing certainty of hindsight make use of these effects by 
asking people to imagine, in florid detail, alternative 
paths that history could have taken. 

Fourth, a cognitive approach warns us not to go too 
far in the quest to open closed minds. Laboratory work 
on mental simulation shows that people can get carried 
away when they start imagining alternative “possible 
worlds” (Koehler 1991). To check runaway unpacking 
effects, people need plausibility pruners to cut off 
speculation that otherwise would grow like topsy be- 
yond the bounds of probability. These elimination 
criteria inevitably reflect covering-law beliefs about 
what counts as a plausible cause-effect linkage. Indeed, 
the cognitive approach also warns us not to mystify or 
Tromanticize imagination and treat it as somehow supe- 
rior to theory-driven thinking. Alice-in-Wonderland 
scenarios aside, most imaginative thinking imports 
theory-driven schemes for understanding cause and 
effect in hypothetical worlds. The distinction between 
the two modes is a matter of degree: theory-driven 
cognition is convergent, relying on a single deductive 
framework to explain what actually happened, whereas 
imaginative cognition is divergent, drawing on a variety 
of frameworks to explore not only what happened but 
also what could have happened. 

If this account is correct, historical observers con- 
front a perplexing trade-off between theory-driven and 
imagination-driven modes of making sense of the past. 
Theory-driven strategies confer the benefits of explan- 
atory closure and parsimony but desensitize us to 
nuance, complexity, contingency, and the possibility 
our theory is wrong. Imagination-driven strategies sen- 
sitize us to possible worlds that could have been, but 
the price is increased confusion and even incoherence. 
The challenge is to strike a reasonable balance, a 
reflective equilibrium, between the conflicting intui- 
tions primed by factual framings of historical questions 
that activate narrowly theory-driven thinking and those 
primed by counterfactual framings that engage our 
imaginations and activate a wider range of theories. 

Absent compelling grounds for preferring one or the 
other mode of information processing, the prudent 
stance is to define good judgment as an: iterative 
process of checking convergent theory-driven thinking, 
which focuses on explaining why what was had to be, 
with divergent imagination-driven thinking, which fo- 
cuses on What could have been, and then putting the 
whole procedure into reverse so that theory-driven 
thinking can be deployed to prevent us from spinning 
out of imaginative control. There will, of course, be no 
single, well-defined equilibrium solution. But it is rea- 
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sonable to expect that the process—as it unfolds not 
just in the mind of one observer but in ostensibly 
self-correcting scholarly communities—will yield plau- 
sibility ranges of equilibrium solutions. These will be 
anchored at one end by those with the most hedgehog- 
like confidence in covering laws and at the other by 
those with the most fox-like curiosity about the paths 
events could have taken. This range of reflective- 
equilibrium solutions should be preferred to lopsided 
solutions that give priority to either theory-driven or 
imaginative modes of thinking. 


CONCLUSION 


Objectivity was the epistemological rock on which 
professional societies of historians and social scientists 
were founded in the late nineteenth century. The 
disciplinary mandate was to move closer, by successive 
approximations, toward the truth, a truth unadorned by 
apologetic or mocking quotation marks (Novick 1991). 
Well before century’s end, however, objectivity was 
under siege as doubts grew about positivism and about 
the feasibility of drawing sharp distinctions between 
observer and observed, between facts and values, and 
even between facts and fiction. Constructivist and 
reflexivist epistemologies, which depicted truth as per- 
spectival, not unitary, gained in influence. 

Our research provides conceptual ammunition to 
both constructivist opponents and neopositivist propo- 
nents of objectivity. Constructivists can take heart from 
our findings that theoretical preconceptions are potent 
moderators of what “observers” see as possible or 
impossible in event sequences. They can also point to 
repeated demonstrations that how we pose historical 
questions, even logically equivalent factual and coun- 
terfactual framings of the same question, shapes the 
conclusions we draw about what had to be or could 
have been. But positivists need not surrender. The 
discipline that provided the concepts and methods 
underlying the studies reported here—cognitive psy- 
chology—is itself a neopositivist enterprise. There are 
many precedents in the history of science in which the 
scientific method has highlighted potential biases and 
correctives in scientific practice. Defenders of “that 
noble dream,” objectivity, can quite defensibly argue 
that identifying judgmental biases is a prerequisite to 
correcting them through disciplined exercises in reflec- 
tive equilibrium. 
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Outside Options and the Logic of Security Council Action 
ERIK VOETEN George Washington University and Stanford University 


asymmetric power to achieve favorable outcomes in 


multilateral bargaining between states that have conflicting interests and veto power. Using a 
game-theoretic framework, I show that the ability to act outside, either unilaterally or with an ally, helps 
the superpower to reach agreements that would be vetoed in the absence of the outside option. These 
agreements, however, are usually not at the superpower’s ideal point. Under some conditions, uncertainty 
about the credibility of the outside option can lead to unilateral action that all actors prefer to avoid. In other 


| examine if and how a superpower can use its 


he number of military interventions authorized 

by the United Nations Security Council (UNSC) 

increased dramatically in the past decade. Ob- 
servers Claim that the UNSC is finally, as envisioned by 
the UN Charter, initiating emergency responses in 
crises that threaten international peace and stability 
(Fromuth 1993).1 The Kosovo crisis demonstrates, 
however, that the UNSC is not the sole international 
actor with the ability to intervene militarily in conflicts. 
After consecutive attempts to reach a compromise 
failed, the NATO allies circumvented the UNSC. 
Moreover, the United States has an unprecedented 
decisive preponderance in all underlying components 
of power (Wohlforth 1999), and unilateral American 
intervention often is a realistic possibility. The variety 
of multilateral institutions combined with the U.S. 
capability to act alone frequently creates situations in 
which the United States has a number of options for 
intervening in international conflicts. The United Na- 
tions is usually the first venue for coalition building 
(Baker 1995, 278), but if the member states cannot 
reach consensus, it need not be the sole recourse. 

I argue that asymmetric outside options profoundly 
affect the logic of UNSC action and multilateral action 
more generally. I develop and analyze a game-theoretic 
framework that examines the strategic problem of 
bargaining in an abstract multilateral institution mod- 
eled after the UNSC. In the model, states have con- 
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flicting preferences over outcomes, and a superpower 
can pursue a costly outside option, either unilaterally 
or with an ally. if this option is credible, it creates a 
bargaining range that would not exist in the absence of 
outside options. Uncertainty about its credibility may 
dissolve this bargaining range, however, and lead to 
outside action that all actors would prefer to avoid. The 
thesis is that it is helpful to have one dominant power 
to generate multilateral actions is not new. Hegemonic 
stability theorists have long argued that a dominant 
state allows large-scale international cooperation to 
emerge (Kindleberger 1973; Krasner 1976), but the 
theory applies only if international cooperation is a 
public good. Here, I explain under what conditions a 
dominant state can use its outside power to satisfy its 
private interests through multilateral actions. 

This article also contributes to a more general 
critique of the institutionalist literature, which mostly 
treats international institutions as potential vehicles for 
overcoming problems of monitoring and enforcement. 
Largely neglected is the important role of international 
organizations as forums for bargaining to resolve dis- 
tributional conflicts between states (see Fearon 1998; 
Garrett 1992; Krasner 1991; Morrow 1994b). Institu- 
tionalists believe that power asymmetries are im- 
portant, but their consequences are rarely explicitly 
modeled. I integrate realist concerns about the conse- 
quences of power asymmetries with institutionalist 
concerns about incomplete information and Pareto- 
inferior outcomes (e.g., Axelrod and Keohane 1985; 
Martin 1992). 

Furthermore, my view of the use of power in deter- 
mining the outcome of distributional conflict differs 
from earlier accounts. In the dominant treatment of 
the subject, Krasner (1991) argues that relative capa- 
bilities mostly matter in determining bargaining out- 
comes along the Pareto frontier.? In contrast, I find 
that the asymmetric ability to use outside options helps 
the superpower establish the Pareto frontier but is of 
no use when bargaining over outcomes along the 


2 The Pareto frontier is the set of all possible outcomes that are 
Pareto optimal. This means that there is no outcome that is not on 
the Pareto frontier, which makes all players equally well off and at 
least one player better off than an outcome along the Pareto frontier. 
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frontier. Instead, bargaining costs and the ability of 
actors to make partial commitments determine the 
final outcome. Less powerful states may be able to gain 
considerable concessions from the superpower. Thus, 
the Security Council does not merely do the bidding of 


the United States. The United States needs to make | 


compromises in order to achieve multilateral authori- 
zation of interventions. Therefore, interventions autho- 
rized by a multilateral institution such as the UNSC 
will differ from unilateral or bilateral action. 

The model accounts for several observable patterns 
in UNSC decision making hitherto unexplained. It 
accounts for the absence of vetoes in combination with 
a large number of abstentions from member states with 
veto power. Moreover, the model stipulates conditions 
under which to expect multilateral authorization for 
interventions. The role of international organizations 
in general and the United Nations in particular is 
largely neglected in the recent theoretical literature on 
global institutions (Abbott and Snidal 1998).3 There is 
some research on UNSC reform (e.g., Russett 1997) 
and specific episodes of UNSC decision making (e.g., 
Malone 1998). Winter (1996) and O’Neill (1996) for- 
mulate models that examine the relative power of veto 
players vis-a-vis nonveto players in the UNSC, but they 
do not explore how asymmetric capabilities among veto 
players may affect the bargaining process, and they do 
not explain if and when agreement can be reached. 

I begin by illustrating the ways in which the other 
permanent members of the UNSC have diverged from 
the U.S. position during the post-Cold War period, 
and I then derive the main assumptions about the 
constellation of preferences in the model. I explore the 
basic assumption that the United States often views 
UNSC authorization as desirable but not imperative. 
Next, I show that when the United States has a credible 
outside option, there exists a set of negotiated agree- 
ments that rational states would prefer to the outside 
action. I derive how allies, nonpermanent members, 
and additional veto players can affect the range of 
possible agreements. I then analyze the way states 
bargain over the different possible agreements along 
the Pareto frontier. Finally, I introduce mcomplete 
information to the model and show how the framework 
provides valuable insights into bargaining over the 
Kosovo intervention. 


THE OTHER PERMANENT FOUR: SOURCES 
OF DISAGREEMENT WITH THE UNITED 
STATES 


Since the end of the Cold War, UNSC action has not 
been routinely blocked by the exercise of vetoes. From 
1946 to 1990, 279 vetoes were issued on substantive 
issues. From 1990 to 1998 this happened only eight 
times, whereas the UNSC has adopted a growing 
number of resolutions that actually matter. Between 
1990 and 1998 the UNSC authorized 31 peacekeeping 





3 For example, in a recent book-length review of developments in the 
study of mternational institutions, the UN is not even mentioned in 
the index (Hasenclever, Mayer, and Rittberger 1997). 
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missions and adopted 145 resolutions under Chapter 
VI; such decisions are binding to all members and can 
only be taken if international is seriously endan- 
gered.‘ In contrast, before 1980 the UNSC authorized 
only 15 peacekeeping missions and adopted only 22 
resolutions under Chapter VIL Agreement among the 
five veto powers on multilateral action does not imply 
that these states have harmonious interests now that 
the Cold War has ended. China, Russia, and to a lesser 
degree France have clashed with the United States 
over UNSC policy on a variety of occasions. States with 
the ability to veto UNSC action often abstain on 
resolutions, which allows them to pass without 

sions of explicit support. From 1991 until 1995 5 China 
abstained 26 times, often on imporiant matters. On 38 
out of 97 Chapter VII resolutions in this period, the 
Chinese either abstained or expressed reservations 
(Thalakada 1997). From 1995 to 1998 China and 
Russia each abstained 15 times, compared to two 
abstentions among the other three members with veto 
power.’ 

The disagreement between the United States and 
the other permanent members stems from a variety of 
sources. UN interventions have not threatened Chi- 
nese territorial integrity or physical security, but China 
considers “nontraditional” aspects of peacekeeping as 
immical to its interest (Fravel 1996) because they 
infringe on national sovereignty.S Nevertheless, the 
desire to be viewed as the representative of the Non- 
Alignment Movement (N in the UNSC usually 
leads China to support intervention in civil wars in 
Africa and parts of Latin America. The Chinese have 
most strongly objected to operations initiated by the 
United States and its allies, such as those in Iraq, the 
Balkans, and Haiti. China’s record of respecting UN 
resolutions, even those it opposes, is seen by some as 
evidence that China has no ambitions of becoming a 
dominant world power (Lampton 1998). Others argue 
more convincingly that its UNSC behavior fits well 
within the “maxi-mini” foreign policy strategy, that is, 
maximize security and economic benefits and minimize 
responsibilities (Thalakada 1997).7 China has been 
able to achieve both economic side payments and 
security guarantees in exchange for its behavior in the 
UNSC This prompted Christensen (1996) to call 
China “the high church of realpolitik in the post-Cold 
War world.”9 


4 These meclude military actions as well as nonmilitary sanctions 
against states and even nonstate entrhes, such as UNITA in Angola. 
See Bailey and Daws 1998, 271 

5 France and the United States each abstained once. See US. 
Department of State various editions, 1995-98. 

6 See, for instance, Cromette 1999. 

7 The “maxi-muni” characterization of Chinese foreign policy strat- 
egy stems from Kim 1994. 

1 For instance, in exchange for a Chmese abstention on the landmark 
resolution 678, which authorized the use of all necessary means to 
restore Aristide to power in Haiti, the United States abstamed in a 
World Bank vote on Chinese loans. The security guarantees mostly 
relate to concessions made by the United States and other nations in 
relation to Tawan and substantive ch in various resolutions, 
Former Secretary of State James Baker (1995, 370—4) mentions that 
the Chinese insisted on elimmating “the use of force” from the text 
of a resolution. Conceesions on Taiwan were acquired on negotia- 
tions over Haiti (Malone 1998) and Guatemala (e.g, Goshko 1997). 
° A number of authors refer to the realpolitik nature of Chinese 
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Russia’s abstentions in the UNSC began when Yev- 
geni Primakov replaced Andrei Kozyrev as foreign 
minister in 1993. This shift, and the growing influence 
of nationalistic figures in the Yeltsin government, led 
at least rhetorically to a more anti-Western foreign 
policy strategy, which included advances toward China 
(McFaul 1997; Waller 1997). Russia’s prime goals 
became to reassert itself as an important power and 
establish some distance from the West (Kubicek 1999, 
594). In 1994, divisions in the Western alliance over 
policy for Bosnia gave the Russians an opportunity to 
further these goals. They wanted to avoid unilateral 
actions by NATO on which they were not consulted 
and over which they had no influence, so they sought to 
subordinate NATO to other organizations, such as the 
UN and the Organization for Security and Coopera- 
tion in Europe, leveraging their influence with the 
Bosnian Serbs (p. 552). 

This strategy yielded short-lived success in February 
1995, when Russian envoy Vitaly Churkin convinced 
the Serbs to pull back from the Sarajevo area. UN 
peacekeeping forces, mostly Russian troops, were sup- 
posed to monitor the situation. This settlement initially 
prevented NATO air strikes against the Serbs, but 
Serbian attacks on the UN-declared safe haven of 
Gorazde in April made air strikes inevitable, and 
Russia had to cede the military and diplomatic offen- 
sive to the West. This episode illustrates the behavior 
of Russia in the UNSC more generally. On the one 
hand, it tries to pursue the goals noted above but, on 
the other hand, often is willing to compromise in 
exchange for economic or other benefits, i 
when confronted with the inevitability of intervention 
(MacFarlane 1999, 241).1° 

The French generally support peacekeeping mis- 
sions and humanitarian interventions with generous 
contributions of troops and finances.'! The permanent 
seat in the UNSC is regarded as a precious asset by the 
French, and French political leaders are expected to 
express a global vocation and exercise international 
influence. “The worst criticism the French can make of 
their leaders’ foreign policy is not that it is expensive 
and hazardous, but that it is flat and lacking in style” 
(Smouts 1998, 7). France often takes an independent 
stance in the UNSC, or at least a position distinct from 
that of the United States. The latest expression of this 
is the French alliance with the Russians and the 
Chinese over the renewal of weapons inspections in 
Iraq. By far the most serious conflict between the 
United States and its European allies concerned the 
Bosnian crisis, in which the British also clashed with 
the Americans (e.g., Papayoanou 1997). 





foreign policy m the post-Cold War penod (e.g, Johnston 1999; 
Nathan and Ross 1997; Segal 1995). 

10 These benefits usually encompass U.S. support for International 
Monetary Fund (IMF) or World Bank loans, but small security 
compromises also have been reached. For instance, it is alleged that 
in exchange for Russian support of resolution 678, the United States 
approved the deployment of a small UN mission m Georgla (Malone 
1998, 107). 

11 France even initiated its own mission m Rwanda with the approval 
of the UNSC. 
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THE CONFIGURATION OF STATE 
PREFERENCES IN THE MODEL 


According to Huntington (1999, 42) from the end of 
the Cold War to the second half of the 1990s the 
central lineup among permanent members of the 
UNSC has shifted from 3:1:1 (the United States, 
France, and Britain against the Soviet Union, with 
China in the middle) to 2:1:2 (the United States and 
Britain opposing China and Russia, with France in the 
middle). In many cases in which the United States has 
actively sought UNSC authorization for the use of 
force, it has had to satisfy an ally with diverging 
preferences as well as at least one veto player with 
preferences at the opposite end of the spectrum. This 
observation motivates the basic assumption about the 
configuration of preferences in my model. In its sim- 
plest version, there are three strategic veto players: 
superpower S, its ally A, and a potential challenger to 
the superpower, C. These players have ideal points on 
a unidimensional policy space X, such that xş = 0,0 < 
x4 < 1, and xc = 1. I also assume a fixed status quo 
point, x55 E X. I am mostly interested in the 
situation in which 0 <xs9 < 1, which means there is 
conflict over the direction in which the status quo 
should be shifted.12 As in most spatial models, the 
negative Euclidean distance between the outcome of 
the game (x € X) and the ideal point of the player 
defines the payoffs for the player. 

In decision making over interventions we can con- 
ceptualize the U.S. ideal point for a hypothetical 
military intervention as an intervention under U.S. 
command that seeks to achieve goals set by U.S. 
policymakers. France wants an intervention with joint 
control, perhaps with a more limited authorization 
and/or somewhat different goals from the United 
States. Russia and China may prefer no intervention or 
at least the allied operation, since this would force the 
United States to make concessions and limit U.S. 
influence and/or weaken the intervention. If unilateral 
action were no more costly than multilateral action, 
then there would be no incentive for the United States 
to resort to multilateral action, because the latter 
implies compromises away from the U.S. ideal point. In 
the next section, I argue that American decision mak- 
ers perceive circumventing the UN and NATO as 


costly. 


INCORPORATING COSTS INTO THE 
MODEL 


A central and recurring issue in debates on U.S. 
foreign policy in the post-Cold War period is whether 
to act alone or with others (Tucker 1999). The advan- 
tage of independence is that no compromises need to 
be made about objectives or the division of spoils. Yet, 
in UNSC and NATO operations, the US bears a 
smaller share of the financial burden and needs to 





2 This does not severely limit the analysis, because if 155 < 0 or 
Xsq > 1, there is always a compromise x that all actors prefer to 
Xsp, such that 0 < x < 1. 
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deploy fewer American soldiers than in unilateral 
undertakings (Sandler and Hartley 1999). 

Political benefits also accrue from UNSC authoriza- 
tion. First, the le Dy ear A of the UNSC can attract 
domestic support for decisions to intervene, 
if Congress is Oe focal James Baker (1995, 718) 
emphasizes in his memoirs that this was the main 
reason for a multilateral approach to the Guif War. 
Malone (1998, ix) argues that it was easier for the 
Clinton administration: to get support for military 
intervention in Haiti from the UNSC than from Con- 
gress. Moreover, opinion polls consistently show stron- 
ger public support for U.S. participation in multilateral 
actions than unilateral actions (Schultz n.d, Sobel 
1998). For instance, approval ratings for the Haitian 
intervention never exceeded 20% when the ee 
was explicitly phrased as unilateral but were as 
60% for a ldel undertaking (Schultz n.d.). By 
acting unilaterally, the president risks the domestic 
political cost of going against popular opinion. 

Second, it may be beneficial to use means perceived 
as legitimate by the international community. Not only 
liberals and constructivists but also some realists claim 
that approval for an action by an organ of the UN is a 
valuable asset, even for a major power. Mastanduno 
(1997) argues, for instance, that the use of multilateral 
institutions as instruments of foreign policy by the 
United States may dampen the incentives for other 
states to balance its preponderant power. Many ana- 
lysts and politicians perceived the cost of damaged 
relations with Russia as so high that the Kosovo 
operation should have been avoided altogether. The 
administration took a less extreme position and con- 
sidered UNSC authorization highly desirable but not 
imperative (Daalder and O’Hanlon 2000). 

In my model, there are two costs of unilateral action: 
circumventing the UN and ignoring allies (NATO). 
First, the superpower pays a cost c4 > 0 for bypassing 
its ally. Second, there is an additional cost co > 0 for 
excluding the challenger from the coalition. I define 
the costs in terms of their equivalent outcomes in the 
policy space (Morrow 1986, 1141). For example, 
acting unilaterally generates a policy outcome at the 
superpowers ideal point (x = 0), but the superpow- 
er’s utility for this outcome is equivalent to that of a 
compromise at x = c4 + Cc. If the superpower and 
the ally decide to act bilaterally, the superpower only 
incurs cost cœ but a compromise is involved. I 
assume for simplicity that this compromise is always 
at the midpoint of the ally’s and the superpower’s 
ideal points (x = Y%x,), although more elaborate 
models of bargaining between the ally and the 
superpower could be developed.'* 

The assumption that ceteris paribus, unilateral ac- 
tion is more costly than multilateral action implies that 


3 Prominent politicians who openly opposed the NATO interven- 
tion incinde Trent Lott, Jimmy Carter, James Baker, Dan Quayle, 
and Pat Buchanan. The divide was not completely along lines. 
Among others, presidential hopefuls Elisabeth Dole and John Mc- 
Cain favored the intervention. See Cohen 1999; Mandelbaum 1999; 
Whitney 1999. 

4 For an application of an itraallience bargaining model to the 
Bosnia case, see Papayoanou 1997. 
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the United States considers authorization of an oper- 
ation by an international organization as desirable but 
not essential. In terms of the model, if the UNSC is 
willing to approve an intervention at the U.S. ideal 
point, then the United States prefers authorization to 
no authorization. This assumption is important, as it 
will drive much of the results. It is also a reasonable 
assumption, and its consequences are worth investigat- 
ing. From the perspective of the superpower, the 
decision-making problem involves a tradeoff between 
the costs of acting alone and the policy loss from 
preserving the status quo or acting at somie 

mise point, either bilaterally or a allael This 
tradeoff captures well the actual decision-making di- 
lemmas the United States has faced. In the case of 
Kosovo, for instance, most debates revolved around 
whether the situation was bad enough to warrant the 
cost of intervention, including the risk of a troubled 
telationship with Russia. From the ca een of the 
challenger and the ally, the dilemma involves accepting 
either a compromise or the ibility of outside action 
over which they have no DA: 


MULTILATERAL AGREEMENTS IN THE 
PRESENCE OF CONFLICT 


Does the availability of a credible outside option for 
one actor or subset of actors create a situation in which 
a multilateral compromise can be reached that is not be 


‘possible in the absence of an asymmetric option? To 


answer this question, let us first define the conditions 
under which options are credible. Lemma 1 presents 
these conditions for unilateral and bilateral action, 
respectively. Throughout the article I make the tie- 
breaking assumption that indifferent players choose 
UNSC action over bilateral action over the status quo. 


Lemma 1. Unilateral action is a credible outside alterna- 
ve ONO 8 Ch tc sos Oe 
credible outside alternative if and only if (i) cç + 
War, <Xsq and (ii) (a) 1x4 <Xsq or (b) c4 + ce < 
Xsq)- 


The proof is straightforward and can be presented 
informally. Unilateral action is credible whenever the 
superpower prefers the costs of unilateral action to the 
utility loss represented by the status quo. Bilateral 
action is slightly more complicated. It requires (i) that 
the superpower prefers to incur the cost of excluding 
the challenger (cc) plus the utility loss of the bilateral 
policy (2x4) to the utility loss represented by the status 
quo. It also requires (ii) that the ally prefers bilateral 
action. This can occur in two ways. The ally either 
prefers the compromise policy to the status quo or 
prefers the compromise to the credible unilateral al- 
ternative. 

Suppose that at least two veto players have conflict- 
ing preferences about the direction in which the status 
quo should be moved (0 < xsg < 1). In the absence of a 
credible outside option, this means that the status 
quo is stable. When either outside alternative is cred- 
ible, however, the superpower will exercise the 
outside option if the challenger vetoes a multilateral 
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agreement. The players no longer evaluate their 
payoff from a multilateral compromise against their 
payoff from the status quo outcome but, instead, 
against their payoff from the outside option. It is 
easy to verify that if both outside options are credi- 
ble, then the superpower prefers bilateral to unilat- 
eral action if “4x, =£ c4. The credible and preferred 
outside option redefines the disagreement outcome. 
I now can show that there is a range of multilateral 
agreements that save the superpower the cost of 
acting unilaterally and that save the challenger po- 
tentially bad outcomes, such as an intervention over 
which it has no control. Proposition 1 defines the 
range of multilateral agreements that all players 
prefer to the disagreement outcome. 


challenger worse off than the status quo. 
i If unilateral action is the preferred outside option: 
X, = [0, min (c4 + Co 2x4)]. 
i. If bilateral action is the preferred outside option: 
X, = [Map Yr, + min(co X4)]. 
Proofs are available from the author or at http:// 
www.princeton.edu/~evoeten. 

A credible outside option creates a bargaining range 
that otherwise would not exist. Figure 1 illustrates how 
the bargaining set is determined by the different vari- 
ables in the model. The lower bound (x”) is determined 


FIGURE 1. The Set of Feasible Bargalning Outcomes with Different Positions of Superpowers 


Outside Option Is Unilateral Action 





C] Agreements that S vetoes 


S = Superpower, A = Superpower's Ally, C = Challenger 






Outside Option Is Bilateral Action 






Bargaining Set 
xt 
x 


A 




















(with or without A) 


by the outcome point associated with the outside 
action. By assumption, if unilateral action is the alter- 
native, this corresponds to the superpower’s ideal 
point, and if bilateral action is the preferred outside 
option, it is the midpoint between the superpowers 
and the ally’s ideal point. The point closest to the 
superpower’s ideal point at which either the ally or the 
superpower is indifferent between outside and multi- 
lateral action determines the upper bounds of the 
bargaining set ; 

The upper part of Figure 1 shows that if the ally’s 
ideal point is close to the superpower’s ideal point 
compared to the cost of outside action, there is a range 
of multilateral agreements that will be vetoed by the 
ally but not the superpower. The ally does not have 
concerns equal to those of the superpower about the 
cost of excluding the challenger from a coalition. The 
ability of the ally to veto proposals the superpower will 
not veto may help the latter achieve more favorable 
compromises. That is, x” is closer to the superpowers 
ideal point than to the point that corresponds to the 
agreement at which the superpower is indifferent to 
outside action (x’). This analysis implies that a close 
ally is particularly important to the superpower when it 
is relatively costly to exclude the challenger from the 
coalition. It should be noted that all multilateral agree- 
ments make the superpower better off than the status 
quo and at least as well off as the outcome associated 
with the outside option. 

The central prediction of proposition 1 is that a 
credible outside option helps the United States obtain 
a favorable UNSC agreement in the absence of har- 
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monious interests among the veto powers. There is 
substantial anecdotal evidence that outside options 
have indeed been important in overcoming disagree- 
ment among the five permanent members. The diplo- 
macy surrounding the building of the Gulf War coali- 
tion illustrates the validity of both proposition 1 and 
the essential assumption in the model that UNSC 
agreement has some value to the United States. After 
Iraq invaded Kuwait in summer 1990, the international 
community was unified in condemning Saddam Hus- 
sein and in demanding unconditional withdrawal. 
The Soviets and the Chinese, however, were not willing 
to back this demand with the use of force. The United 
States easily could have acted outside the UNSC with 
the help of the British, and Margaret Thatcher publicly 
stated she preferred this option (Baker 1995, 278-9). 
This indicates that it may indeed sometimes be easier 
for an ally to reject an inclusive multilateral option 
than for the superpower to do so, as suggested by the 
analysis in Figure 1. The Americans cared more about 
keeping others involved, especially the Russians, and 
struggled to build the UNSC coalition. They also 
incurred the cost of compromise, in terms of both the 
resolution text and debt concessions, support for 
World Bank loans, and financial aid (Baker 1995, 
260-309; Goldstein 1995). Eventually a compromise 
was achieved, even though a Chinese abstention and a 
“no” vote from Yemen on the final roll call could not 
be avoided. According to Baker (1995, 282) the U.S. 
ability to act alone played a large role in achieving the 
compromise. 

The Haitian intervention is another example m 
which the United States had a clear unilateral alterna- 
tive. The United States had not sought authorization 
for the use of force to overthrow regimes in Grenada 
(1983) or Panama (1989). After the breakdown of the 
Soviet Union had left the United States the only 
superpower, the Americans took considerable trouble 
to garner support from the UNSC to overthrow the 
Haitian regime. In particular, they had to overcome 
resistance from the Chinese, who did not want to 
establish a precedent for interference in the domestic 
affairs of a country. According to a careful study by 
Malone (1998, 104-5), the U.S. decision to consider 
military intervention a serious option in spring 1994 
was crucial to passage of UN resolution 940.16 

If two superpowers with opposing preferences have 
outside options, then UNSC authorization for inter- 
vention can prevent a direct confrontation between the 
two.17 Indeed, during the Cold War, the purpose of 
most UN missions was to maintain the status quo (e.g., 
Lebanon, Cyprus). Since then, most missions have 
moved the status quo toward the ideal point of the 
United States. For example, in 1988, when the Cold 


4 The UNSC resolution with this Implication, SR660 (Angust 2, 
1990), was adopted with one abstenbom Yemen. 
16 The first public mention of military action as a serious option was 


June 8, 1994. The final passage of resolution 940 was on July 31, 
1994. 
17 This situation is modeled explicitly in the working paper version. 
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War was ending, the UNSC initiatives to monitor the 
Cuban withdrawal from Angola and the Russian with- 
drawal from (Durch 1993, 21) clearly 
served the purposes of the one remaining superpower. 

Another piece of evidence that UNSC agreement in 
the post-Cold War period is often not merely the result 
of harmonious preferences is the dramatically different 
voting records of states in the UNSC and the General 
Assembly (UNGA). The United States usually has no 
outside options for matters decided in the UNGA. 
Moreover, contrary to UNSC resolutions, 
UNGA resolutions are nonbin , which allows states 
to vote according to their preferences. The United 
States votes in favor of only 25% of resolutions 
adopted by roll call vote m the UNGA, compared to 
virtually 100% in the UNSC.18 China votes for 83% of 
UNGA resolutions and Russia for 50%, but in the 
UNSC they abstain much more frequently than the 
United States. Moreover, on procedural issues in the 
UNSC, cases in which the United States has no veto 
power and no outside options, the Americans often 
concede on sensitive matters, such as allowing Pales- 
tinian delegations to attend UNSC meetings. This 
suggests that outside options are indeed essential in 
order for the United States to negotiate multilateral 
compromises in its favor. 


NONPERMANENT MEMBERS AND 
ADDITIONAL VETO PLAYERS 


Besides the five permanent members that have veto 
power, the UNSC has ten members that have neither 
permanent membership nor veto power.!9 These non- 
permanent members may, however, block passage of 
resolutions. Under current procedures, passage of sub- 
stantive resolutions requires nine affirmative votes, 
including the concurring vote of the five members with 
veto power. I simplify this by adding two nonveto 
players to the model with three veto players. For 
passage of a resolution, four out of the five players have 
to vote in favor, and no veto player can exercise its veto 
power. I assume that the nonpermanent members have 
ideal points somewhere on the policy continuum be- 
tween the ideal points of the challenger and the 
superpower. Earlier analyses using a different frame- 
work revealed that nonpermanent members have only 
a very small probability of influence (O’Neill 1996; 
Winter 1996). Proposition 2 defines the conditions 
under which nonpermanent members can influence the 
bargaining range and shows that if they do so the 
superpower benefits, not the challenger. 


PROPOSITION 2. No members can affect the 
bargaining set only if the ideal points of both nonper- 
manent members are closer to the ideal point of the 
superpower than the ideal point of the ally is to the 
superpawer’s idea! point. The consequence can only be 
Ge ee eee ene 

the superpower's ideal point. 


18 Based on my computations from voting records for 1991-96. 

19 These members are elected out of the general membership of the 
UNGA for two-year terms that are not mmediately renewable. The 
elections are organized by regional groupings. 
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Proposition 2 implies that nonpermanent members 
may help but not hurt the superpower. They do not 
have concerns equal to those of the superpower about 
the cost of excluding the challenger from a coalition. 
Therefore, a nonpermanent member with preferences 
close to those of the superpower may vote against 
proposals that the superpower would not veto. Non- 
permanent members who are close to the superpower 
thus may constrain the bargaining set in a way similar 
to that of a close ally (see Figure 1). Of course, this only 
helps the superpower if it constrains the bargaining set 
beyond the extent to which the preferences of the ally 
already constrain it. 

Proposition 2 implies that when the United States 
and its allies have divergent preferences, nonperma- 
nent members who are close to the United States can 
help it achieve favorable multilateral agreements. An 
example is lifting the weapons embargo against the 
Bosnian Muslims, which was favored by the United 
States but strongly opposed by NATO allies. The U.S. 
threat to lift the embargo unilaterally and support from 
the majority of nonpermanent members (including 
many Muslims) proved decisive in achieving a UNSC 
compromise on the matter (Christopher 1998, 344-50). 
In general, however, it is rare for a sufficient number of 
nonpermanent members to have preferences closer to 
the U.S. ideal point than do Britain and France. 

The same logic that applies to nonpermanent mem- 
bers can be extended to the introduction of additional 
veto players, a major issue in current debates surround- 
ing UNSC reform. If a new veto players preferences 
are closer to the superpower’s ideal point than to the 
ideal points of the other veto players and the pivotal 
nonpermanent member, then the introduction of an 
additional veto player may help the superpower. It is 
not surprising that the United States has decided to 
support the candidacy for permanent membership in 
the UNSC of its close allies, Japan and Germany. 
Ceteris paribus, the introduction of “hostile” veto 
players does not affect the set of bargaining outcomes, 
but this assumes that the superpower incurs no addi- 
tional cost from excluding these new players from a 
winning coalition. 


ABSTENTIONS 


China and Russia frequently abstain from UNSC votes 
and often issue a statement that indicates sharp dis- 
agreement with the resolution. In the model I do not 
specify any incentives for the challenger to abstain. It is 
imaginable, however, that a challenger would choose to 
abstain rather than vote in favor of a resolution from 
the bargaining set defined by proposition 1, which 
outlines an agreement that the challenger does not 
prefer to the status quo. Such an abstention signals 
discontent, not indifference. At the same time, an 
abstention indicates impotence. Why else would a state 
with formal veto power not exercise it on a resolution 
that it dislikes? 

If this reflects the actual use of abstentions, China 
and Russia should abstain only when the United States 
and its allies have credible outside options. Otherwise, 
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they could veto any proposal they dislike, and only 
proposals that reflect true great power harmony could 
be adopted. Evidence from the UNSC voting records 
supports this assertion. From 1990 to 1998, on resolu- 
tions that initiated, extended, expanded, or terminated 
a mandate for the use of force or enforceable sanc- 
tions, there were multiple Chinese and Russian ab- 
stentions in the cases of Albania, Bosnia, the Gulf War, 
Haiti, Kosovo, and Macedonia, as well.as the monitor- 
ing of the sanctions and embargo on Iraq. In each 
instance a clear outside alternative was either exercised 
or explicitly available. 

Over the same period, UN missions or sanctions 
were authorized without great power abstentions in 
Angola, Cambodia, Central African Republic, El Sal- 
vador, Lebanon, Liberia, Mozambique, Nicaragua, Si- 
erra Leone, Somalia, and the western Sahara as well as 
in Afghanistan, Georgia, Nagorno-Karabach, and Ta- 
jikistan. In most cases the country either requested 
UNSC assistance, about which there has been little or 
no great power disagreement since the end of the Cold 
War, or bordered Russia, which makes acting alone 
extremely costly to the United States. The only obvious 
exceptions are El Salvador and Nicaragua, where small 
and relatively uncontroversial observer missions were 
based. China abstained on two occasions with respect 
to operations in Africa, and in both cases Western 
powers had outside alternatives. The first involved 
France’s Operation Turquoise in Rwanda and Eastern 
Zaire (SR929, June 22, 1994), and the second was U.S. 
aircraft sanctions against Sudan in 1996 (SR1070, 
August 16, 1996). These sanctions compelled states to 
deny aircraft permission to take off from, land in, or fly 
over their territories if the aircraft was registered in the 
Sudan, or owned, leased, or operated or substantially 
owned or controlled by the government or public 
authorities of the Sudan. 


BARGAINING ALONG THE PARETO 
FRONTIER 


It appears that a credible outside option allows the 
United States to shift the disagreement outcome in 
UNSC bargaining, that is, creates a bargaining range 
that would not exist in the absence of such an option. 
Since the United States strongly prefers all outcomes in 
the bargaining set to the status quo, the outside option 
lends it considerable bargaining power. But not all 
outcomes in the bargaining set are equally desirable for 
all players. The superpower prefers outcomes at the 
lower end of the bargaining range drawn in Figure 1, 


2 The total was 365 UNSC resolutions from 1990 to 1998. I included 
mandates for observer missions, unless they were explicitly purely 
civilian. 


21 Bosnia represents a difficult case as NATO troops exceeded the 
UNSC mandate in 1995 but did not totally circumvent the UN. The 

of NATO threats for outside action was in doubt before 
1995, but the threats were always there. Albania (SR 1101, March 29, 
1997) is a somewhat distinct case because it was not the United 


led by Italy, would continue with or without UN authorization. 
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whereas the challenger prefers outcomes at the upper 
bound. How do the players agree upon an outcome 
from the bargaining set, given their conflicting inter- 
ests? What tactics can they employ to obtain more 
favorable outcomes? In this section I provide answers 
to these questions. 

To simplify, I model the process as a bilateral 
bargaining game im which the superpower and ‘the 
challenger alternate offers. Although bargaining be- 
tween the superpower and its allies may be interesting, 
most UNSC agreement hinges on a deal struck be- 
tween the United States and Russia and/or China. If 
the superpower and the challenger agree on a compro- 
mise in the bargaining stage, then all players vote on its 
acceptance in the voting stage. The game is a variant of 
the Rubinstein (1982) bargaining model with outside 
options, except that players have spatial utilities and 
the outcome needs to be approved by a vote. The 
separation into a bargaining and a voting stage reflects 
actual decision-making procedures well (Bailey and 
Daws 1998). 

The outcome in a Rubinstein bargaining model 
depends on other game structures, in particular, who 
makes the initial offer and the players’ respective 
haggling costs. In this analysis, the structure follows 
logically from the bargaining situation I investigate. 
Because the superpower seeks to alter the status quo, 
whereas the challenger prefers to maintain it, it is 
reasonable to assume that the superpower always 
makes the initial offer. There is no particular advantage 
in this, so there is no reason for the challenger to 
propose a preemptive compromise. It is also reason- 
able to assume that delay in agreement is costless for 
the challenger. Indeed, it can be argued that the 
challenger could benefit from delay, although I do not 
analyze the reasons here.” The superpower, however, 
is likely to incur a positive cost of delay. While UNSC 
negotiations last, a situation undesirable to the super- 
power continues to evolve. For example, the human 
rights violations in Kosovo prompted domestic and 
international pressure on the U.S. government to act, 
and this pressure grew as time went on. The increasing 
flow of refugees from Haiti also imposed a delay cost 
on the United States. The superpower incurs delay 
costs regardless of whether it eventually intervenes 
alone or through multilateral action. 

A general result from the Rubinstein (1982) model is 
that if one player is infinitely patient but the other is 
not, then the player least eager to reach agreement—in 
this case the challenger—receives all that is at stake in 
the game. 


Proposition 3. With complete information, the chal- 
lenger and the superpower will agree on a proposal x = 
x" in the first round of bargaining, and the UNSC will 
vote in favor of that proposal. 


7 In general, bargaining is strictly increasing in such an inside 
ee F chip: & for ay analvale of Inside opticns'te 
Rubinstein bargaining models. 
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The proof of this proposition follows directly from 

Rubinstein (1982).2 The proposition implies that the 
credible outside option helps the superpower establish 
a favorable bargaining set, but the option cannot be 
used as a strategic asset in the bargaining process over 
the set of Pareto-efficient compromises.24 The under- 
lying intuition is that the superpower can only use the 
option to gain leverage over the challenger by making 
a threat to go it alone if the challenger does not accept 
a proposal. This threat is only credible if the outside 
option gives the superpower a higher payoff than 
continued bargaining. At any point in the process, 
however, the superpower prefers any outcome from 
the bargaining set to the outside action, so the latter 
gives the superpower no bargaining power over the set 
of Pareto-efficient agreements. In contrast, patience 
gives the challenger quite a bit of bargaining power. 
These insights are contrary to Krasner (1991), whe who 
argues that differences in capabilities matter when 
bargaining over outcomes along the Pareto frontier; I 
suggest that asymmetric capabilities can be used to 
establish the frontier but not to gain leverage when 
bargaining along it. 

Can the superpower use bargaining strategies to 
achieve a more favorable outcome than x”? A fre- 
quently discussed issue is intentional actions that par- 
tially commit states to a strategically chosen bargaining 
position. Schelling (1960, 28) observes: “When national 
representatives go to international negotiations know- 
ing that there is a wide range of potential agreement 
within which the outcome will depend on bargaining, 
they seem often to create a bargaining position by 
public statements, statements calculated to arouse a 
public opinion that permits no concessions to be 
made.” This course can be costly. When leaders implic- 
itly or explicitly link their political fate to a specific 
bargaining position, backing down may jeopardize their 
survival. The utility of partially committing to a bar- 
gaining position, or creating “audience costs,” has 
received widespread attention (Fearon 1994, 1997; 
Schultz 1998, 2001; Smith 1998). Much of this litera- 
ture argues that leaders can create larger audience 
costs in a democracy than in other political systems 
because their survival is more directly dependent on 
how voters perceive them. In these models, the stron- 
ger the ability to make partial commitments, the better 
is the bargaining outcome. 

To study the effect of partial commitments, I add a 
stage to the model that allows players to set their 
commitment level. Let us assume that before embark- 
ing on negotiations the superpower takes some action 
to communicate that it will accept compromise x only if 


D It is straightforward to see that Rubinstein’s original analysis 
applies here, where the pis is defined by the bargaining interval from 
proposition 1. The result holds both if we use discount factors as well 
as fixed costs to model the cost of delay. With fixed costs the result 
is even stronger: The player who is more patient than the other 
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x < Zg. If it reneges and accepts x > Zg, it incurs a cost 
ks(x — zs), with ks > 0.35 It becomes more costly to 
revoke the partial commitment as the final agreement 
moves farther from the publicly stated target. The 
coefficient k; determines the extent to which the su- 
perpower ties its hands by partially committing. Prop- 
osition 4A states the main result of this analysis. 
Proposrrion 4A. A unilateral ability to make a partial 
commitment does not help the superpower achieve a 
more favorable outcome but may be used to hurt the 
challenger. 


Because the challénger is infinitely patient and the 
superpower is not, the challenger can always respond 
with counteroffers that give the superpower exactly its 
reservation value. Thus, for any commitment zs < x7, 
the challenger can find a proposal x that it prefers to 
x = zs and that the superpower prefers to the outside 
option. Although partial commitment does not affect 
the utility of the superpower, it can affect the final 
agreement and thus the utility of the challenger. This is 
true because there are equilibria in which the super- 
power makes a partial commitment, zs < x”, and 
accepts the agreement zs; < x* < x”, which makes it 
just as well off as not making a commitment and 
proposing x = x”. Because the challenger’s utility is 
increasing in x, the challenger is clearly not indifferent 
between these options. The competitive nature of 
international politics makes it reasonable to assume 
that when the superpower is indifferent regarding 
various possible multilateral agreements, it prefers the 
agreement x** that makes the challenger worse off.” In 
other words, the superpower may use a partial com- 
mitment strategy to hurt the challenger, but it cannot 
use such a strategy for its own benefit. 

Can the challenger use a partial commitment tactic 
to exclude such disadvantageous equilibrium agree- 
ments? We may assume that in response to the super- 
power it can partially commit to compromise x = Zç 
and incurs cost k-(zc — x) if the target is not met. 
Proposition 4B states the main result. 


ProposiTion 4B. If the ability of the challenger to incur 
audience costs is sufficiently high, the challenger can 
prevent the superpower from using commitments to 
punish the challenger. 

The intuition behind proposition 4B is that the 
challenger can use a commitment strategy credibly if 
the cost of reneging is so high that the outside option 
would be preferable. If the ability of the challenger to 
create audience costs is sufficiently great, it can achieve 
absolute gains from a partial commitment strategy. 
Russian leaders frequently make statements that stir 
national sentiments and tie their hands in UNSC 
negotiations. This is particularly effective in UNSC 
negotiations over operations in the Balkans, since the 


3 This is a fairly standard way of modeling the cost of partial 
commitments with continuous outcomes (see Muthoo 1999, chap. 8). 
Many models restrict the analysis to two possible outcomes. 

26 Eyen though the players in this model care about absolute rather 
than relative gains, the spatial utilities of the model are relative by 
nature; a gain for the superpower means a loss for the challenger. 
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Russian people genuinely care about the issue and 
audience costs are easily engendered, as opposed to an 
issue such as intervention in Haiti. 

A general insight from the analysis thus far is that it 
is difficult for the United States to win concessions 
when bargaining over outcomes along a Pareto frontier 
that basically exists because of coercion. In cases such 
as Haiti, Bosnia, and Kuwait, the American bargaining 
position was weakened because the United States was 
much more eager to intervene than at least one of the 
states with veto power. Hold outs could demand con- 
cessions or economic benefits. Side payments are one 
way a preponderant economic power can achieve a 
favorable multilateral compromise. When: side pay- 
ments are equally valuable to all players, they do not 
change the outcome in terms of utility. When they have 
a different marginal rate of substitution for the super- 
power and the challenger, there is an opportunity for 
Pareto improvement.?’ For example, World Bank In- 
ternational Monetary Fund loans are often much more 
valuable for China and/or Russia than they are costly 
for the United States, and U.S. backing virtually guar- 
antees loan approval. Both sides have something to 
trade, and there is ample evidence that deals were 
made with respect to the Gulf War and the Haitian 
intervention.”* In principle, side payments also can be 
used to break deadlock in the absence of credible 
outside options, but in these situations the United 
States holds no additional advantages because it lacks 
outside options. Obtaining a given policy outcome is 
much less expensive when a credible outside option is 
available. 


INCOMPLETE INFORMATION AND THE 
POSSIBLE FAILURE OF MULTILATERAL 
AGREEMENTS 


So far I have assumed that all actors have complete 
information about one another’s preferences and costs. 
I now relax the assumption of complete information. In 
particular, I assume that other states are uncertain 
about the extent to which the superpower is willing to 
compromise to keep the challenger in the multilateral 
coalition. In the case of Kosovo, for example, there was 
much public debate within the United States about the 
costs of acting without Russian approval relative to the 
benefits of intervention. Some argued that these costs 
were so high that they outweighed the benefits. Others 
thought the United States should intervene, regardless 
of the Russians.” This kind of dispute is likely to make 
other actors uncertain about the true intentions of the 
executive, especially because the president has incen- 
tives to misrepresent the true value of these costs.” 


77 See Morgan 1990 for a similar argument on the effectiveness of 


issue linkage. 

2 On the Gulf War, see Baker 1995, 275-300, Goldstein 1995. With 
respect to Harti, see Malone 1998. 

2 See the discussion in the section on costs. 

xI assume that the final decusion-making power lies with the 
executive, who may or may not be influenced by Congress and other 
domestic actors. 
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I assume that actors other than the superpower 
believe that cc takes on the value cy with probability m, 
and the value cz with probability (1 — a), where 0 < 
Cy < Cy < 1 and 0 < m < 1. This assumption 
establishes that there are two types of superpower: Sy 
(the high compromise type) and S, (the low compro- 
mise type).>! The challenger and the ally both have the 
initial belief m that S = Sy. After each move by the 
superpower, they update this belief using Bayes’s rule. 
I limit the analysis to the case in which S, prefers either 
unilateral and/or bilateral action to the status quo, but 
Sy prefers the status quo to both.” Thus, if the 
superpower proposes any x < Xoo, the challenger and 
the ally are uncertain whether the proposal will be 
credibly backed up by outside action. The threat is 
credible from a superpower of the low-cost type but not 
from a superpower of the high-cost type. Proposition 5 
summarizes the outcomes in the sequential equilibrium 
of this game. 


Propostrion 5. If the ally and the challenger are uncertain 
about the credibility of the outside option, there exists a 
sequential equilibrium where the outcome depends on 
whether their initial belief ~ that the outside option is 
not credible exceeds some threshold p,. The outcomes 
can be summarized as follows: 


(1) If w S py: The superpower makes an initial 
proposal x* that all players accept. 

(2) If a > p,: No multilateral agreement can be 
reached. Depending on the true type of the super- 


Power, either the status quo is maintained, or 
outside action is initiated. 


If unilateral action is the preferred outside option 
Cater 





for S;: py= ys 
If bilateral action is the preferred outside option for 
Cr 

= 
-ixa 

The proposition states that multilateral agreement 
breaks down if the ally and the challenger believe that 
the superpower’s outside option is not credible above a 


critical threshold p,. In this scenario, the challenger 
prefers the lottery between the status quo outcome and 


Sp: pi= 


3 This is a standard modelmg strategy (see Morrow 19942). 
* In an earlier version, I exammed the case when both Sy and Sz 
prefer outside action to the status quo. In this scenario, the uncer- 
tainty 1s not about the credibility of the outside option but about the 
extent to which S ls willing to compromise. The possibilty of more 
favorable multilateral compromises provides S,, with an incentive to 
t tts trus willingness to compromise by making the same 
offers that S; makes in equilibnum. I show that as long as C's bellef 
that S = Sy is below some critical threshold, C always accepts this 
initial offer. When C's belief 1 above that threshold, a mixed strategy 
sequential equilibrmmm exists in which C learns about the superpow- 
er's true willingness to compromise. In this equilibrium, C rejects the 
superpowers proposals with positive probability and makes counter- 
offers that S, does not accept, but Sy accepts with positive proba- 
bility. 
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the outside action to any agreement that is acceptable 
to Sz. The outcome will be the status quo if the 
superpower is of the high-cost type (Sy) and outside 
action if the superpower is of the low-cost type (Sz). In 
equilibrium, S,, always pools with S,. The payoff for Sy 
when negotiations break down is the same payoff for its 
outside option: the status quo. The payoff for a com- 
promise x < Xc¢ is better than that. Sy therefore has 
no incentive to reveal any information about its true 
resolve, so the challenger cannot learn during the 
bargaining process. As long as m Sp, this may lead to 
the authorization of a multilateral intervention even if 
the superpower is of the high-cost type and would not 
resort to outside action in the absence of a muitilateral 
agreement. This result is important because it implies 
that incomplete information about the willingness of 
the superpower to compromise may lead to interven- 
tions authorized by a multilateral institution that would 
not have been initiated in the absence of this institu- 
tion. If + > p,, the incomplete information about the 
true intentions of the superpower may lead to outside 
action that all actors would prefer to avoid. 

The initial belief m about the credibility of the 
outside option and the threshold belief p, thus play 
decisive roles in determining the outcome of the bar- 
gaining process. The threshold belief p, indicates the 
degree to which the challenger may doubt the true 
intentions of the superpower before multilateral agree- 
ment breaks down. This critical value depends on three 
variables. First, the threshold belief decreases as the 
status quo point moves farther from the outcome point 
with outside action. Thus, for multilateral agreement to 
be achievable, the superpower’s outside options need 
to be more credible when the challenger has more to 
gain from preserving the status quo vis-a-vis the outside 
action. This result predicts that doubt about the threat 
of exercising outside options reduces the chance of 
obtaining UNSC authorization more in a case such as 
Kosovo, because Russia had a lot to gain from prevent- 
ing an intervention, than in a case such as Haiti, 
because neither Russia nor China had much at stake. 
In the latter instance it would be easier for the United 
States to “fool” Russia and China into adopting mul- 
tilateral action, even if the United States were not 
willing to act without UNSC authorization. 

Second, the smaller is cz, the lower the threshold 
belief. The presence of a domestic coalition with little 
willingness to compromise increases the likelihood that 
multilateral cooperation will collapse in the event of 
uncertainty about the superpower’s true intentions. If a 
multilateral deal is reached, however, it will be more 
favorable to the superpower when the value of cz is 
low. Thus, there is a clear tradeoff between the ability 
to reach multilateral agreements, and the ability to 
reach favorable ones. 

Third, if the preferred outside option is bilateral 
rather than unilateral action, the closer the ally’s ideal 
point is to that of the superpower, the lower is the 
threshold belief. The intuition here is that when the 
ally and the superpower are close, there is little room 
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for the challenger to negotiate a favorable multilateral 
compromise, so the challenger is more likely to accept 
the risk of outside action. This may partly explain why 
Russia and China did not veto any resolutions with 
respect to the Bosnian crisis when the United States 
and its allies disagreed considerably, but threatened to 
veto the intervention in Kosovo when NATO was more 
unified. For the superpower, the obvious disadvantage 
of a difference of opinion with its ally is that more 
extensive compromises are required to obtain agree- 
ment. This may weaken the size and scope of an 
intervention. 


THE KOSOVO CRISIS: UNCERTAINTY 
ABOUT THE CREDIBILITY OF OUTSIDE 
ACTION 


In the Kosovo case all conditions were present that 
decrease the threshold belief p, and thus increase the 
likelihood that no multilateral compromise can be 
achieved if a veto player doubts the credibility of the 
outside option. At least one veto player (Russia) cared 
a great deal about preserving the status quo. The 
United States was internally divided about support for 
intervention, and the Clinton administration regarded 
UNSC authorization as highly desirable but not imper- 
ative (Daalder and O’Hanlon 2000, 36). Moreover, 
NATO members were not in complete agreement but 
were fairly close in terms of their preferred UN policy, 
certainly in comparison to the Bosnia crisis. Although 
the French were somewhat hesitant to participate in 
NATO action, President Chirac believed “the human- 
itarian situation constitutes a ground that can justify an 
exception to the rule” (quoted in Guicherd 1999, 8). 

In accord with proposition 5, these conditions estab- 
lish that the threshold belief p, is low, which implies 
that Russia’s doubt about the credibility of outside 
action need not be very great in order for UN agree- 
ment to collapse. I maintain that Russia had good 
reason to question NATO’s resolve and that NATO 
had difficulty communicating its true willingness to act 
without UNSC authorization. 

In early August 1998, the use of force by the West in 
the Kosovo crisis first became a realistic option when 
NATO approved concrete plans for military action and 
scheduled military exercises in Albania and Macedonia 
to warn Slobodan Milosevic (see Erlanger 1998; 
Reuters 1998). Yevgeni Primakov admitted that it was 
urgent not to allow a dangerous precedent: a unilateral 
armed operation by NATO without UN blessing and 
without concern for the Kremlin.» Primakov wanted to 
keep alive Moscow's opportunity to influence the 
course of armed intervention. Most NATO countries 
also preferred UNSC action to their own intervention. 
On September 23, UNSC resolution 1199 demanded 
that Milosevic stop the Kosovo offensive but did not 





3 See Yusin 1998; Primakov’s statements were made in relation to 
the participation of a platoon of Russian paratroopers in the NATO 
maneuvers in Albania. 
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mention the use of force, which allowed Russia to 
support the resolution. Only China abstained, stating 
that the Kosovo crisis was an internal matter for 
Yugoslavia (Crossette 1998). 

On the eve of the adoption of this resolution, U.S. 
Secretary of Defense William S. Cohen was clearly 
concerned about the perceived credibility of the 
NATO threat to use force. He argued that the alliance 
would “rightly face severe criticism for issuing empty 
threats” in case it was not prepared to go forward 
(Myers 1998). When it became clear that resolution 
1199 was ineffective, the United States and its allies 
increasingly indicated their willingness to circumvent 
the UNSC. On October 8, NATO’s secretary general, 
Javier Solana, said: “NATO takes the decision on its 
own. We have finalized our military plan. We are ready 
to act. That is now clear” (Lockwood 1998). Later that 
month, renewed efforts to obtain UNSC agreement led 
to the adoption of resolution 1203 on October 24, 
which again fell short of authorizing the use of force, 
on Russia’s insistence (Ibrahim 1998). This led to 
several diplomats to doubt NATO’s willingness to act 
outside the UN framework. Why would the resolution 
be necessary if the Western powers believed they had 
the right to act militarily anyway (Ibrahim 1998)? 

As the atrocities continued, the pressure upon the 
Western powers increased. In January and February 
1999, NATO issued more threats and prepared its 
troops for combat, without abandoning efforts to reach 
a UNSC deal. Some observers still questioned the 
credibility of NATO’s threat. Robert Kagan, a critic of 
the Clinton administration’s foreign policy, drew a 
comparison with earlier negotiations: “This is precisely 
what happened last October, when NATO last went 
through this complex little dance. And the reason is the 
same now as it was then. Neither the allies nor the 
Clinton administration actually want to go ahead with 
military action against Milosevic.” The UNSC, reso- 
lutions were heralded in the Russian press as a sign 
that Russia still mattered as a force in world politics.* 
Both the October 24 and the September 23 resolutions 
essentially preserved the status quo, in that they re- 
frained from authorizing the use of force in Kosovo.* 
Primakov was convinced that Russia’s success in pre- 


% Robert Kagan (1999a) repeated his doubts about the credibility of 
NATO military ection one month later. Kagan was not the only one 
doubting the credibility of the NATO threats. See for instance, 
Krauthammer 1999. In the Ruan press, see, for instance, 
Petrovskaya 1999. 

% See, for example, LexisNexis 1998, p. 4. Foreign mmister Igor 
Ivanov proclaimed to the Duma: “In the Kosovo crm, Russia has 
demonstrated that rt 1s a great power, that it is not going to make any 
deals to get new [IMF] tranches and that it will not give up its 
national interests.” 

æ% Chinese concerns were more normative and related to interfer- 
ence m domestic affairs. Even though China threatened to use its 
veto against any UNSC resolution that authored military means in 
‘Kosovo, it 1 generally assumed that its abstention could have been 
acquired through the same means used in other cases (Bosnia, 
Haiti): ude payments and compromise. 
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venting an intervention was because NATO perceived 
the alienation of Russia as too costly.37 Given the 
uncertainty about NATO’s willingness to circumvent 
the UNSC, it seems plausible that the Russians be- 
lieved they could continue to forestall action, but on 
March 24 NATO started its bombing campaign. 

The United States and its allies did not credibly 
communicate to Russia that they would act in the 
absence of UN approval. Russian agreement to UNSC 
authorization might have been obtained had the Rus- 
sians been convinced from the start that NATO’s 
threats were credible. Evidence for this interpretation 
is that in early March the Russians tried hard to regain 
some control of events. Foreign minister Ivanov trav- 
eled to Belgrade on March 12 to persuade Milosevic to 
accept a peacekeeping force, which the Russians had 
resisted thus far in the UNSC (Smirov 1999). Some 
newspaper reports even suggested that Russia was 
already cooperating with NATO by allowing its vessels 
to transport military supplies (Golotyuk 1999). After 
bombing began, Russia tried desperately to regain 
influence and even participated in the peacekeeping 
process under UNSC authorization, but its worst-case 
scenario had already materialized: NATO troops were 
in Yugoslavia (Levitin 2000). 


CONCLUSION 


My theoretical model not only accounts for observable 
patterns in UNSC decision making but also provides 
insights about how the dominant superpower and the 
unipolarity in the current system affect the opportuni- 
ties for and consequences of multilateral actions when 
states have conflicting preferences over outcomes. In 
the absence of a Pareto frontier, the first condition for 
multilateral action is that the United States be willing 
and able to act alone or with close allies. This cannot be 
taken as a given and depends strongly on how decision 
makers perceive the interests of the United States in a 
particular situation. A U.S. president who is reluctant 
to engage U.S. troops in foreign conflicts can pro- 
foundly affect the opportunities for multilateral action. 

Outside options may more generally be important to 
create multilateral agreements among veto players with 
conflicting interests. For example, Garrett (1992) 
shows that the threat by France and Germany to create 
a free trade zone with the Benelux countries proved to 
be a major incentive for British agreement on major 
institutional reforms in the Single European Act. An- 
other example is the current debate regarding missile 
defense systems. President Bush is threatening to uni- 
laterally break the Anti-Ballistic Missile treaty but is at 
the same time seeking support from the Europeans and 
the Russians to redesign this treaty multilaterally. 
Following the logic of my model, the attempt to 
achieve a multilateral compromise will depend on the 


37 See Sysoyev 1998, which quotes Primakov saying: “It obvious 
now that there will be no strike... . We openly stated that if a strike 
1s dealt on Yugoslavia, our attitude to NATO will change drastically, 
and there will be a lot of changes in our orientation. This was the 
main argument which had had an effect.” 
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ability of the Americans to communicate that the 
unilateral option is credible. Of course, not every 
multilateral action is the result of a threat to exercise 
an outside option. In the absence of credible outside 
options, agreement between sovereign states depends 
on the existence of a Pareto frontier and the bargaining 
mechanisms that facilitate achieving an outcome along 
that frontier. We may also have pure public good 
problems, such as relatively uncontroversial but costly 
interventions. A good illustration is the intervention in 
East-Timor, which was made possible by the willing- 
ness of Australia to shoulder a disproportionate 
share.38 Yet, when there is sharp disagreement be- 
tween states with the ability to block a multilateral 
agreement, understanding their asymmetric abilities to 
achieve their objectives outside the multilateral frame- 
work becomes crucial. 

The second condition for multilateral action is that 
states with a credible outside option must have some 
incentive not to exercise it. In my model this is cap- 
tured by the assumption that it is costly for the 
superpower not to include other states in the winning 
coalition. If these costs did not exist, the superpower 
would have no reason to seek multilateral authoriza- 
tion and make compromises. In many situations this 
assumption is reasonable. UNSC authorization may be 
useful to the United States for burden sharing, or to 
lend legitimacy for domestic and international pur- 
poses. Similarly, there were advantages for France and 
Germany in a single market that included Britain 
(ceteris paribus). Moreover, the assumption about 
costs does not imply that multilateralism is a necessity 
for a powerful state, merely that it is desirable, and the 
extent of that desirability is an important variable in 
the model. It determines the likelihood and scope of 
compromises and, in case of uncertainty about the 
perceived benefits of multilateralism, it partly deter- 
mines the likelihood of unilateral action. To some 
degree the costs for circumventing the UNSC are 
self-induced. Therefore, it would be interesting to 
formulate a model in which the costs for circumventing 
a multilateral institution are determined endogenously, 
either as a function of domestic politics or by past 
experience and/or expectations about the future. This 
topic is left for future research. 


æ This intervention became a public good problem after Indonesia 
invited a UN force to intervene, thus removing all mternational 
conflict about the desirability of an intervention. 
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support for key empirical implications of the model. 


ow do changes in the inequality of income affect 

political support for welfare policy? Starting 

with the economic models of Romer (1975), 
Roberts (1977), and Meltzer and Richard (1981), the 
conventional view is that increased mequality in pretax 
earnings leads to greater political demand for redistrib- 
utive policies. The logic is simple and compelling. If the 
majority of the electorate receives a below-average 
income and if an increase in inequality causes above- 
average incomes to rise and below-average incomes to 
fall, then it is reasonable to think that demands for 
public policies to reduce the gap between rich and poor 
will increase. 

The argument of Romer (1975) and Meltzer and 
Richard (1981) is best illustrated by comparing two 
hypothetical lognormal income distributions with the 
same mean but different levels of inequality as shown 
in Figure 1. As the figure shows, the greater the 
variance of a distribution like the lognormal distribu- 
tion that is skewed to the right, the greater the gap 
between median and mean income. In the models of 
Romer (1975), Roberts (1977), and Meltzer and Rich- 
ard (1981), political competition drives the level of 
welfare spending toward the ideal point of the median 
income voter. The greater the gap between the pretax 
earnings of the median income voter and average 
(mean) income, the greater is the level of spending 
preferred by the median income voter and the higher is 
the equilibrium level of welfare spending. 

The relationship between the inequality of pretax 
earnings and welfare expenditures is important be- 
cause it shapes our understanding of the relationship 
between political and economic equality. According to 
the conventional view, a change in the economic 
environment that causes the income distribution to 


Kad Ove Moene is Professor of Economics, P.O. Box 1095 Blindern, 
University of Oslo, 0317 Oslo, Norway. Michael Wallerstein is 
Professor of Political Science, Northwestern University, Evanston, IL 
60208-1006. 

The authors thank three referees and the editor, Ada 
Finifter, for their suggestions. We also thank Jim Alt, David Austen- 


grow more unequal increases political support for 
redistributive policies. In other words, the public favors 
redistributive policies as the need for them increases. 
Although voters are assumed to care only about their 
own welfare, the result is a welfare policy that varies 
appropriately with the needs of the poor. In addition, if 
greater equality reduces the demand for redistributive 
policies and if those policies inhibit growth, then 
reduced income inequality promotes growth (Alesina 
and Rodrik 1994; Persson and Tabellini 1994). 

In this article, we demonstrate that a more complete 
theory leads to different conclusions. Support for some 
kinds of welfare spending may increase as inequality 
rises, but support for other kinds is lower when in- 
equality is higher. In particular, our framework implies 
that greater inequality in pretax earnings is associated 
with less, not more, spending on welfare policies 
targeted to people who have lost their market income 
because of layoffs, accidents, or ill health. Both theory 
and the data on welfare expenditures in 18 advanced 
industrial countries suggest that one political conse- 
quence of greater income inequality is less support for 
policies that constitute a significant share of the welfare 
budget. 

Our framework combines two different approaches 
to understanding the sources of political support for 
welfare policy. In the first view, as expressed in most 
economic models and the large literature in political 
science and sociology that emphasizes the political 
strength of the working class in cross-national studies 
of welfare spending, welfare policy is fundamentally 
about redistribution from rich to poor. Self-interested 
voters support welfare policy up to the point at which 
their gain from income redistribution matches their 
share of the cost. In the second view, the essence of 
welfare policy is the public provision of insurance, and 
self-interested voters support welfare policy to obtain 


1 The early literature on the role of social democratic parties and 
organized labor in the expansion of welfare policies is surveyed by 
Shalev 1983. For more recent studies, see Esping-Andersen 1990; 
Hicks 1999; Hicks and Swank 1992; Huber, Ragin, and Stephens 
1993; and Huber and Stephens 2001. 
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FIGURE 1. Two Hypothetical Lognormal 


Income Distributions with the Same Mean 
and Different Levels of Inequality 





protection against risks that private insurance markets 
fail to cover.? 

These two views appear to have very different polit- 
ical implications. Critics frequently view welfare pro- 
grams as redistributive policies that distort incentives 
and reduce the efficiency of the economy. Supporters 
often argue, in contrast, that welfare policies provide 
insurance to all and enhance efficiency to the extent 
that the public sector protects against risks that are 
difficult or impossible to cover through private insur- 
ance markets. In fact, the policy implication of either 
view is not obvious. On the one hand, one might think 
that welfare policies are fundamentally redistributive 
but favor increased spending on the ground that the 
benefit of greater equality outweighs the efficiency loss. 
On the other hand, one might consider welfare policies 
as social insurance but think that the demand for 
insurance could be better satisfied by private firms. 

Our purpose is to investigate the contrast between 
the redistributive and the insurance views in terms of 
how inequality affects political support for welfare 
spending. Consider an increase in income inequality 
that lowers the income of the median voter but leaves 
mean income unchanged. In the redistributive model, 
the wider this gap, the more the median voter gains 
from welfare expenditures. In the insurance model, in 
contrast, the demand for insurance declines with in- 
come, holding risk constant, assuming that insurance is 
a normal good. If median-voter income decreases and 
the risks covered by social insurance do not change, 
then support for spending on social insurance will 
decline. 


2 For studies of social welfare as publicly provided insurance, see 
Barr 1992; Casamatta, Cremer, and Pestieau 2000; De Donder and 
Hindricks 2000; Sinn 1995; and Wright 1996. 
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In our judgment, both approaches provide essential 
ingredients for an adequate understanding of the pol- 
itics of welfare policy. Social insurance policies com- 
prise a large part of the welfare budget. Even means- 
tested policies can be viewed as protection against the 
residual risk of income loss that social insurance poli- 
cies do not cover. At the same time, public insurance is 
commonly provided and financed in a manner that is 
redistributive ex ante, in that voters with lower ex- 
pected income receive insurance on more favorable 
terms than do voters with higher expected income. 
Thus, without specifying how the policy is designed, 
one cannot tell which aspect—redistribution or insur- 
ance—dominates in determining the effect of inequal- 
ity on support for welfare spending. 

Our article is related to two strands of the recent 
theoretical literature on the politics of welfare policy. 
The first consists of studies of how income or wealth 
inequality affects support for redistributive policies. 
This literature can be subdivided according to whether 
the emphasis is on the cost of redistributive policies 
(Moffitt, Ribar, and Wilhelm 1998; Rodriguez 1998; 
Saint-Paul 1998),? voters’ empathy toward the poor or 
the unlucky (Kristov, Lindert, and McClelland 1992; 
Piketty 1995), or voters’ demand for insurance versus 
redistribution (Bénabou 2000; this article).5 The sec- 
ond strand of literature examines the effect of benefit 
targeting on political support for welfare expenditures 
(Casamatta, Cremer, and Pestieau 2000; De Donder 
and Hindricks 1998, 2000; Gelbach and Pritchett 1997; 
Moene and Wallerstein 2001b). We bring these two 
strands together to examine (1) how benefit targeting 
alters the effect of income inequality on support for 
welfare expenditures and (2) how the inequality of 
income affects the share of the welfare budget targeted 
to different groups. Our mathematical framework is 
similar to the model Wright (1996) uses to study the 
effect of economic growth on welfare expenditures. 
Most recently, Iversen and Soskice (2001) apply the 
same framework to study how different types of train- 
ing affect welfare support. 

In the following section we develop a model in which 


government spending is characterized by two parame- 


3 Moffitt, Ribar, and Wilhelm (1998) argue that a reduction in the 
earnings of low-wage workers increases the cost of welfare policies by 
lowering the incentive of welfare recipients to find work. Saint-Paul 
(1998) argues that if inequality increases due to a decline in the 
income of the poor, the mean income may fall relatrve to the median, 
which will increase the cost of redistribution to the median voter. 
Rodriguez (1998) maintams that higher equality mcreases the 
ability of the rich to evade redistributive taxes by political lobbying. 
‘ Kristov, Lindert, and McClelland (1992) argue that voters are more 
willing to support benefits for others like themselves. Thus, support 
for welfare declines as the gap between the poor and the middle 
grows. Piketty (1995) maintains that willingness to support redistrib- 
clive noties depends on belete yesuntling he volutes Mironas OE 
luck and effort in determining earnings. In Piketty’s model, a 
negative income shock can shift the equilibrium in such a way that 
support for redistributive policies may either increase or decrease, 
depending on the status quo ante. 

5 In Bénabou’s (2000) model, mequality and spending on egalitarian 
policies that promote efficiency are simultancously determmed. 
Bénabou does not consder differences in the targeting of welfare 
benefits, which is the focus here. 
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ters: a tax rate that determines the level of aggregate 
welfare spending and a distributive parameter that 
determines how welfare benefits are targeted. This 
framework encompasses both the redistributive and 
the insurance views of welfare policy, depending on the 
type of targeting. We then show how targeting can alter 
the influence of inequality on voters’ choice of the level 
of benefits. When the beneficiaries are predominantly 
persons who are employed, we obtain the conclusion of 
the redistribution models: When a rise in inequality 
reduces the income of the median voter relative to the 
mean, support for welfare expenditures increases. 
When the beneficiaries are those without earnings, 
however, the response to greater inequality is predicted 
by the insurance model: A reduction in the median 
income, holding the mean constant, reduces support 
for welfare expenditures. 

We also investigate the simultaneous choice of the 
level and targeting of benefits. When targeting is 
endogenous, benefits aimed at those without earnings 
decline as income distribution becomes more skewed. 
Thus, when increases in inequality reduce the income 
of the median relative to the mean, benefits targeted to 
those without earnings are reduced, both as a share of 
GDP and as a share of government spending. The 
situation regarding benefits targeted to the employed is 
more complicated. If income distribution is not too 
unequal, a majority of voters prefer all welfare pay- 
ments to be targeted to the unemployed. If the distri- 
bution of income is sufficiently skewed, benefits aimed 
at the employed are an increasing function of the 
skewness, as in the pure redistribution model. We test 
these propositions with data on welfare spending and 
the inequality of wages and salaries in eighteen ad- 
vanced industrial countries from 1980 to 1995. 


BASIC ASSUMPTIONS 


We present our theory in the context of a simple model 
of the economy that contains two essential ingredients: 
uncertainty regarding future income on the part of a 
significant fraction of the population and heterogeneity 
among voters in terms of both their income and the 
risks they face. We will assume that the population is 
divided into three groups. The share gp is permanently 
outside the labor market and has no income other than 
transfer payments. The share oz is the group of wage 
earners who receive a wage of w p when employed. The 
share gy is the high-income group and receives wy, 
with w, > w,. We assume that the three groups 
exhaust the population, so that co + oz + oy = 1. 

Wage earners may be employed or not. We assume 
that the probability of employed wage earners losing 
their source of income (whether due to lay off, injury, 
or illness) within period dt is adt. The probability that 
workers who have lost employment will find a new job 
within dt is Bdt. For simplicity, both œ and ĝ are 
assumed to be constants The Markov process de- 
scribed by the parameters a and B converges to a 


6 In a more general model, the probability of obtaining employment, 
Bat, would be partly a matter of agents’ efforts rather than a 
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steady state distribution of wage earners in which the 
fraction B/(a + B) are working. Alternatively, B/(a + 
B) denotes the fraction of time that each wage earner 
expects to be employed in the long run. The high- 
income group faces a lower risk of losing earnings than 
do wage earners. For simplicity, we set the risk of job 
loss for high-income earners to zero.” 

The population without earnings consists of the 
share who are permanently outside the labor market, 
Sœ plus a share who are temporarily without employ- 
ment, [a/(a + B)]o,; the workforce consists of 
high-income earners, Cy, plus the share of low-income 
earners who are employed, [B/(a + B)]o,. It simpli- 
fies the notation to introduce a symbol e = oy + 
[B/(a + B)]o, for employed share of the population. 
We will assume that the majority of people are em- 
ployed, or e > % In addition, we assume that the 
high-income groups constitutes a minority, or oy < ¥2. 
It follows that the employed wage earners are the 
median income earners. 

We represent fiscal policy with two parameters. The 
first is the flat tax rate on earnings, t, that determines 
aggregate government spending per capita, T(t). We 
write the requirement that tax receipts equal expendi- 
tures as 


T(t) = t(tew, (1) 


where w is the average wage, w = (l/e)[o,~y + 
(Ba + B))o, wz], and q(t) represents tax revenues as 
a share of earnings. The function 1(t) implicitly incor- 
porates the deadweight cost of taxation. We assume, 
therefore, that q(t) is a strictly concave function (the 
deadweight cost of taxation rises at an increasing rate 
as the tax rate rises), with 7’(0) = 1 (there is no 
deadweight cost when the tax rate is zero) and 7(0) = 
+(1) = 0 (tax revenues are zero when the tax rate is 
zero or when taxes are confiscatory).® 

The second policy parameter, y, represents the share 
of welfare spending received by employed persons. The 
remaining share, (1 — y), is assumed to go to programs 
aimed at those without earnings. Thus, the posttax and 
transfer consumption of a person with a pretax income 
of w is 


cw) =a -w EO, (2) 


where y7(t)/e is the welfare benefit received by each 
employed person. The consumption of those without 


earnings is 


_a=-yTO 
CN 7 =e 


(3) 


parameter. Making 8 endogenous is discussed briefly in a later 
section. 
7 We discuss the of relaxing this assumption below. 


over z 
could assume that hours worked are fixed and that £ — 1(t) 
represents the costs of collecting taxes, with the cost assumed to be 
an increasing, convex function of t. 
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FIGURE 2 The Distribution of Income 


Permanently 
Out of Work 


The share of the populabon who are employed 


Implicit in equation 3 is an assumption that all persons 
without earnings receive the same benefit, regardless of 
their history of employment or earnings.? If y = 0, then 
welfare policy is targeted at those without work. If y = 
1, then the benefits go exclusively to those with earn- 
ings. (We assume throughout that 0 s y < 1.) A 
universalistic policy that pays the same benefit to all, 
regardless of employment status, is implied by y = e. 
Our assumptions regarding the distribution of pre- and 
posttax and transfer income are summarized in Figure 
2. 


We also assume that all individuals have identical 
preferences over consumption, described by a standard 
utility function, u (c), with the following characteristics: 
(1) u”(c) < 0, (2) u'(c) > as c > 0, and (3) p = 
—cu"(c)/u'(c) > 1. Assumption 1 states that individ- 
uals are risk averse. Assumption 2 means that individ- 
uals always want some insurance to cover a nonnegli- 
gible risk that they may have nothing. Assumption 3 
implies that insurance is a normal good or that the 
demand for insurance increases as income rises. Em- 
pirical estimates of u, usually called the coefficient of 
relative risk aversion, consistently conclude that p = 1 
(Friend and Blume 1975). We assume p > 1 to simplify 
our discussion. How the description of the results 


? Such an assumption is stronger than necessary. All the results go 
through in a more general model in which the benefits targeted to 
those without earnings partly depend on past wages or contributions 
as long as there is some minimum benefit that everyone without 
earnings receives. 
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Temporarily 
Out of Work 








would have to be modified to encompass the borderline 
case of u = 1 is easily seen from the mathematics. 

Assuming that individuals live forever, the expected 
lifetime utility for a wage earner can be derived from 
the asset equations: 


rV® = u(cg(w)) — a(VE - V), (4) 


rV“ = ulen) + BOF — W™), 5) 


where V* is the expected lifetime utility of a person 
currently employed, V™ is the expected lifetime utility 
of a person temporarily not employed, u(c,) is the 
instantaneous utility of consumption when employed 
(i = E) or when not employed (i = N), and r is the 
discount rate.1° Equations 4 and 5 can be solved for the 

ed lifetime utilities of starting out in the two 
different states. We will concentrate on the expected 





10 To understand equation 4, observe that Infetime expected utility 
(for individuals who live forever) can be written as the sum of current 
utility during period dt plus expected lifetime utility one period in the 
future, discounted by the discount factor e; VF = u(c,)dt + 
e7™ [(adt)" + (1 — adt)V*]. Future expected lifetime utility 
equals the expected Irfetime utility of someone without employment 
with probability adt. With probability (1 — adt), lifetime utility 
remains Rearrangmg terms, letting dt — 0, and using 
the fact that (1 — e~™)/dt — r as dt + 0 yields equation 4. The 
derivation of equation 5 is similar. The assumption that individuals 
live forever can be relaxed by replacing r with r/(1 — e-"*) in 
equations 4 and 5, where H is the voter’s life expectancy. (We thank 
an anonymous referee for this observation.) 
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lifetime utility of employed wage earners, which is 
conveniently written as 


Btr a 
a u(ce(wz)) + in) u(cy). 


(6) 


Equation 6 indicates that the expected lifetime utility 
of an employed wage earner consists of a weighted 
average of expected utility in the two states, with the 
current state of being employed weighted more heavily, 
the greater is the discount rate. Self-interested workers 
care about the benefits received by the unemployed 
because of the chance that they may be without em- 
ployment sometime in the future. Of course, voters 
may care about those without earnings out of altruism 
as well as self-interest. In this case, the parameter r 
might be interpreted as reflecting concern for the poor 
as well as concern for the future. The lower is r, the 
greater the weight given to the welfare of those without 
earnings in wage earners’ choice of how to cast their 
ballots. 


yar? = 


CHOOSING THE LEVEL OF BENEFITS WITH 
EXOGENOUS TARGETING 


We first investigate the political choice of the level of 
benefits when targeting is fixed. The investigation of 
the choice of t for a fixed y provides a general 
framework in which the contrasting predictions of the 
two models of welfare spending—as redistribution and 
as public insurance—can be com and shown to 
depend on how benefits are targeted. In addition, the 
model of choosing ¢ for a fixed y may be applicable in 
circumstances in which changing the funding level is 
politically easier than altering a program’s design. 

With y fixed, the level of taxation and benefits 
preferred by wage earners is given by the first-order 
condition 


dv B+ 
a (aa Je CEREO E 
a ‘ oor (Hew 
— a ew) | 


Te 


|-o (7) 


or, by rearranging, 


£ + ”) EA ( e | (1 -— yh’ (ð | 0 
a u' (cy) 1— eji (ww) — yr) : 
(8) 


The first term in equation 8 represents the marginal 
rate of substitution between consumption when em- 
ployed and consumption when unemployed, and the 
second term represents the marginal rate at which 
income can be transferred via the welfare system from 
a workers’ earnings when employed to income when 
not employed. The strict concavity of u(c) and q(t) 
guarantees that the second-order condition for a max- 
imum is satisfied. In this model, all groups have 
single-peaked preferences. Therefore, we can identify 


Vol. 95, No. 4 


the political equilibrium as the preferred tax rate of the 
median group of income recipients, or the value of t 
that solves equation 8. 

From equation 8 it is apparent that a decrease in the 
discount rate r or an increase in voters’ altruism 
induces voters to raise u’(c,)/u’ (cy) or to increase the 
redistribution of income from cg to cy. Conversely, a 
rise in the cost of taxation, as represented by a decrease 
in the marginal tax yield t’ (t), induces voters to lower 
u'(cg)/u'(cy) or to reduce the redistribution of in- 
come from cg to cy" It is sometimes argued that the 
more policy benefits are targeted to the majority with 
earnings, the higher is the level of political support.12 
Within our framework, this argument is partially cor- 
rect. Differentiating equation 7 with respect to y yields 
dt*/dy > 0, where ¢* is the median group’s preferred 
tax rate. Thus, the more benefits targeted to the 
employed, the higher is the benefit level the employed 
majority will support. 

An increase in the funding of benefits may not 
benefit the poor, however, if the benefits are targeted 
more broadly. The question of whether less targeting to 
the poor benefits the poor concerns the sign of 


dey e z pick dt* : 

ae sere ae): 
which can be either positive or negative, depending on 
the concavity of the function r(t). It is straightforward 
to show that dc,/dy > 0 when the deadweight cost of 
taxation is negligible (i.e., when T(t) ~ t), whereas 
dcyidy < 0 when the deadweight cost of a marginal 
increase in taxation increases rapidly (i.e., when It"(t)I 
is sufficiently large). 

Our topic, however, is how changes in income in- 
equality affect support for welfare expenditures. Con- 
sider the effect of a mean-preserving spread in the 
wage distribution, that is, an increase in wy and a 
reduction in wz, such that the average wage remains 
constant. To investigate the importance of a change in 
w; on wage earner support for welfare, holding w 
constant, define 


P(w y) = u' (ce) yt’ (tw — w] 


as the part of equation 7 that depends on wz. The sign 
of dt*/dw, is the same as the sign of 








av 
EY L ufea- 0, (9) 
where 
dee (1-9) _ = olw- yO] 
d(1—t) ce (1—#)w, + yr(t)w 


is the elasticity of consumption when working with 
respect to 1 — t.23 Since p > 1 and € = 1, the sign of 
a¥/aw, in equation 9 is not clear. 


11 The effect of deadweight costs on political support for redistribu- 
tive policies ıs emphasized in Becker 1983, 1985. 

12 See, for example, the exchange between Skocpol (1991) and 
Greenstein (1991). 

13 To derive equation 9, start with 
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| There are two special cases in which the sign of 
oV/dw, is immediate, however. The first is the case of 
Yy = 1, in which the employed receive all the benefits. If 
y= 1, we have w; — yr'w = 0 from equation 7, which 
| implies that € = 0. In this case, equation 9 reduces to 


aF (wz,1) = 


m —u' (cg) <0. (10) 























Thus, workers with lower wages prefer higher benefits 
(dt*/dw, < 0) when benefits are targeted at the 
employed. 

The second special case is -y = 0, in which benefits 
are targeted exclusively at those without earnings. 
Since y = 0 implies that € = 1, equation 9 reduces to 


a¥(w,,0) 


aw, Cee- 1) > 0. 


(11) 
Thus, an increase in the earnings of the median income 
group, wz, increases their preferred benefit level (dr*/ 
dw, > 0) when benefits are aimed at those without 
earnings. 

The effect of increased inequality on political sup- 
port for welfare spending is summarized in the follow- 
ing proposition. 
PROPOSITION 1. A mean- 


4 8 
voter's preferred level of benefits when benefits are 
targeted to the employed. 


Proof: According to equation 10, a decline in the 
median voters wage reduces his or her demand for 
welfare benefits if y = 0. According to equation 11, a 
fall in the median voters wage increases his or her 
demand for welfare benefits if y = 1. Since ¥(w,, y) 
is continuous in y, the conclusions hold for y near zero 
and y near one as well.14 

A mean-preserving spread of the pretax income 
distribution has two effects on the choice of benefits. 
On the one hand, an increase in inequality represents a 
decline in income for workers with income below the 
mean. The wage reduction increases employed work- 
ers’ resistance to paying taxes to finance benefits for 
those not working. On the other hand, greater inequal- 
ity lowers the ratio of the median voter’s income to 
mean income, thereby lowering the tax that must be 
paid by low-wage workers to finance a given level of 
benefits. A reduction in the price of providing benefits 
increases the willingness of low-wage voters to support 
higher benefit levels. Thus, in addition to an income 
effect that leads the median voter to reduce his or her 


ay 
gw, T E CDA — Dirr h — w] — u' (cx) 

L 
and use the definition p = —cgu"(cg)/u' (cg) to write w(cg) = 
—~pu'(cy)/cg. Equation 10 follows immediately. 
1 We conjecture that € is generally a monotonic declining function 
f y, which would imply the exastence of a unique ¥ e (0,1) such that 
*/dw, > 0 for y < Ẹ and dr*/dw, < 0 for y > ¥. 
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preferred level of expenditures, there is a substitution 
effect that works in the opposite direction. 

Alternatively, the two effects can be described as a 
redistribution effect and an insurance effect. For any 
value of y < 1, welfare policy both redistributes income 
and provides insurance. A rise in inequality increases 
the redistributive effect of the welfare system, to the 
benefit of workers with below-average income. At the 
same time, an increase in inequality, holding average 
income constant, implies a reduction in the income of 
workers whose incomes are below average. Voters 
whose wages decline prefer to reduce the amount of 
insurance they buy. Which effect dominates depends 
on the coefficient of relative risk aversion, p, and on 
the targeting of welfare benefits, y. 

Consider first the effect of voters’ risk aversion for a 
fixed value of y. If p = 1, that is, if voters’ risk aversion 
is at its lower bound, then the insurance benefit 
provided by the welfare system is less important, and 
greater inequality increases the median voter’s pre- 
ferred tax rate for all y > 0. If is sufficiently large, 
that is, if voters are sufficiently risk averse, then the 
insurance aspect of welfare dominates, in which case 
greater inequality lowers the median voter’s preferred 
tax rate. 

Alternatively, for a fixed u > 1, whether the redis- 
tributive or the insurance effect dominates can be 
understood as a function of benefit targeting. When 
benefits are mostly paid to the employed (when y is 
close to one), the redistributive aspect dominates, and 
the preferred benefit level of the median income 
earner increases as inequality grows. This is the case 
described by the standard redistribution model. When 
benefits are mostly paid to those without earnings 
(when y is close to zero), however, the insurance aspect 
dominates, and the preferred benefit level of the 
median income earner declines as inequality increases. 

Equation 11 implies that, in comparing different 
countries with similar average income and similar 
distribution of the risk of income loss, support for 
spending on benefits targeted to the unemployed rises 
as the skewness of the income distribution declines. 
Equation 11 does not imply that support for spending 
on benefits aimed at the unemployed is a positive 
function of income when comparing the preferences of . 
voters located at different points in the income distri- 
bution. When the risk of job loss is correlated with 
income, such that low-income voters face a greater 
probability of income loss than high-income voters, the 
relationship between a voter’s position in the income 
distribution and support for spending on policies tar- 
geted to the unemployed can go either way. In our 
simple model, high-income voters are assumed to face 
no risk of income loss, so they prefer less welfare 
spending than low-income voters for all values of y and 


pL. 
The model in this section can be generalized in a 
variety of ways without altering the conclusions. The 
simplifying assumption that there are only two types of 
workers, lower paid and higher paid, can be replaced 
by the assumption of any finite number of types, or 
even a continuum of types. The assumption that only 


American Political Science Review 


wage earners are subject to the risk of income loss can 
be replaced by assuming a general distribution of this 
risk. As long as the risk for workers with lower income 
is the same as or ter than the risk for workers with 
higher income, all the results of this section remain 
unchanged. Also, one might consider a different envi- 
ronment in which wages each period are random draws 
from a known wage distribution. In this case, one can 
investigate the effect of an increase in uncertainty 
regarding future wages, holding expected wage and risk 
of job loss constant. The results are similar. If welfare 
benefits are targeted to the employed, then an increase 
in uncertainty regarding future wages raises the pre- 
ferred level of benefits of the median voter. If welfare 
benefits are targeted to those without employment, an 
increase in uncertainty regarding future es reduces 
the median voter’s preferred level of benefits.“ 

The assumption that the probability of obtaining 
employment, Bdt, is not affected by changes in taxes 
and benefits could be relaxed to allow B to be a 
function of the difference in welfare between those 
with and without earnings. The result would be to 
introduce an additional mechanism whereby increased 

inequality causes reduced welfare spending; as de- 
scribed by Moffitt, Ribar, and Wilhelm (1998), voters 
cut benefits to the poor as wages fall in order to restore 
work incentives. Another extension would allow self- 
insurance (or saving) to compete with public insurance 
against income loss. The possibility of saving would 
enable the share of the population with no need for 
social insurance to be made endogenous. Such an 
extension might introduce a third reason for the asso- 
ciation of greater inequality with lower welfare spend- 
ing insofar as greater inequality increases the relative 
share of the electorate who prefer self-insurance to 
public insurance. 


CHOOSING BOTH BENEFIT LEVELS AND 
TARGETING 


The targeting of benefits is as much a political decision 
as the level of welfare spending. Thus, a general model 
of the politics of welfare must include the political 
choice of targeting. Consideration of a second dimen- 
sion of political choice is made difficult, however, by 
the general absence of a majority rule equilibrium in 
two dimensions without additional assumptions about 
the political process. We begin by characterizing the 
optimal policy of the median income group, as in the 
previous section. We then describe two alternative 
models of the political process, both of which imply 
that the policy preferred by the median income group 
constitutes the political equilibrium in the context of 
our particular model. 

We concentrate, as before, on employed wage earn- 
ers who receive wz, and face the risk adt of losing their 
source of earnings within period dt. Wage earners’ 
ideal policy is the combination of t and y that maxi- 


15 See Moene and Wallerstem 1998 for details on the mathematical 
analysıs of the effect of an increase in uncertamty on support for 
welfare spending. 
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mizes their expected lifetime utility (equation 6), sub- 
ject to the constraint that y = 0 or that total benefits 
paid to those without employment, (1 — e)cẹ cannot 
exceed total expenditures, T(t). The constraint that 
y = 1, or that cy = 0, is never binding, given our 
assumption that u'(c) —> œ as c — 0. The first-order 
conditions for the solution can be written as 


E + ”) Ee] K í e ) | A- y)r'® | =0 
a /|u'(cs)| -e l wwr ° 


(12) 


B +r) Ju’ (cg) e 
Aa Co) lt 
The first-order condition with respect to t, equation 12, 
is identical to equation 8. We now must consider the 
first-order condition with respect to y as well. 
Equation 13 indicates that there are two cases to be 
considered. In the first, the constraint is not binding, or 
the optimal choice of y, denoted y*, is greater than 
zero. In this case, the first-order conditions can be 
written as 




















Tw- w, = 0, (14) 


B +r\ [u'(ce) ( e\_ 

a u’ (cy) l-e/ ` 

Equation 14 determines the optimal tax rate as the rate 
at which the marginal revenue gain from an increase in 
the tax rate, t'(t)w, just equals the marginal cost to 


wage earners, wz, in line with the pure redistribution 
model. The optimal tax is zero when w; = w, since 





(15) 


7'(0) = 1, and it rise as w,/w) declines, since 
£ n <0 16 
dwi FO a 


With the optimal tax rate determined by equation 14, 
the optimal allocation of tax revenues between benefits 
targeted to those with and without earnings is given 
implicitly by equation 15. Equation 15 represents the 
standard condition that the marginal rate of substitu- 
tion between consumption when employed and con- 
sumption when not employed must equal the cost of 
transferring income from a worker’s earnings when 
employed to income when not employed. For a fixed 
welfare budget, the cost of transferring benefits from 
those without earnings to the employed is the relative 
size of the two groups, e/(1 — e). Equation 15 
indicates that a change in the distribution of income 
that causes a decline in w; must be matched by a 
decline in the benefits received when not employed to 
keep the ratio (B + r)u’(cg)/au’(cy) unchanged: 


on (1 — O [e(1 — e] (P + r)u"(cz) an 
(B + nu"(cs) + [el — e) Pou" (en) 
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FIGURE 3. Preferred Policy of Employed 
Wage Earners 


Welfare 
Spending 





It follows from equations 16 and 17 that 


dy* 1 T(t) 1—e\ dey 
dws -zF 0 7O - (= Fn <0. 


Employed workers who suffer a decline in earnings 
prefer a partial offset of the wage reduction through an 
increase in the benefits targeted to themselves. 

The second case to consider is the binding con- 
straint, or when y* = 0. In this case, the first-order 
condition with respect to ¢ simplifies to 


Sleek ae 6 


Wage earners would like to lower t and raise money 
with a lump-sum tax (ie., set y below zero), but 
lump-sum taxes are ruled out by the constraint. There- 
fore, wage earners prefer to transfer less money from 
Cg to cy than they would if lump-sum taxes were 
possible. From proposition 1, we know that dt*/dw, > 
0 when y = 0. 

In order to visualize the wage earners optimal 
policy, it is helpful to rewrite the policy choice as a 
choice of aggregate expenditures, T(t), and a choice of 
the total transfers that are disbursed to those without 
earnings, (1 — e)cy. These choices are graphed in 
Figure 3. The curve T(t*) represents wage earners’ 
unconstrained optimal aggregate welfare expendi 
which decline as w, increases. The curve (1 — e)cy 
represents the unconstrained optimum with respect to 
the benefits targeted to those without earnings. This 
curve is an increasing function of w; from equation 17. 
Since T(t*) = 0 when w, = w, whereas (1 — e)cy is 
always positive and increasing in w,, the two curves 
must cross at a wage level below w, denoted wọ in the 
figure. If w, < wo, wage earners’ optimal choice of 
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benefits targeted to themselves is „given by the differ- 
ence between T(t*) and (1 — e)cy For w; = wọ the 
constraint that y = 0 or that T(t) = (1 — e)cy binds. 
The constrained optimum with y* = 0 or T(t*) = (1 — 
e)cy is represented by the curve T(t*! y = 0). That 
T(t*ly = 0) is an increasing function of w, is a 
restatement of part (i) of proposition 1. 

The comparative static results implicit in Figure 3 
are summarized as follows. 


PROPOSITION 2. A mean- increase in inequality 
tha lowers die income of the median voter (i) reduce 
wage earners’ preferred level of benefits targeted to 
those with no income, (ii) reduces wage eamers’ 


Proof: Part (i) states that cy is an increasing function of 
w, (equation 17 and proposition 1, part (i)). Part (iii) 
states that 7(t*) is an increasing function of w, for 
W, < Wo (equation 16), and part (ii) states that 
T(t*ly = 0) isa function of w, for w; > 
wọ (proposition 1, part (i)).15 

When workers’ ‘income falls, their demand for redis- 
tribution increases, and their demand for insurance 
against loss of earnings declines. When the wage is 
sufficiently low, relative to the mean, the preferred 
level of aggregate spending provides more than enough 
to finance the preferred level of insurance, which 
leaves money in the budget to be distributed to em- 
ployed workers and high-income earners. As the wage 
rises relative to the mean, however, wage earners’ 
demand for insurance increases, and their demand for 
redistribution falls. Eventually, the wage rises above 
the threshold w; = wo, and wage earners prefer the 
entire welfare budget to be targeted to those without 
earnings. With y = 0, wage earners face the conflict 
between redistributive and insurance motives for sup- 
porting welfare spending, described in the previous 
section. According to proposition 1, the insurance 
motive dominates when y = 0, in the sense that the 
preferred benefit level rises with w L- 

In the previous section, when y was assumed to be 
fixed, political choice was one-dimensional, and the 
political equilibrium could be identified with the opti- 
mal policy of the median income group. Proposition 2 
implies that the same reasoning can be applied with 
regard to the simultaneous choice of t and y when the 
median income is sufficiently close to the mean. If 
wz = Wo in Figure 3, a majority of voters prefer to 
target all benefits to those without earnings. (The y 
that is optimal for wage earners who have lost their 
earnings and for those who never work is always less 
than or equal to the y preferred by employed wage 
earners.) Given majority support for y = 0, part (i) of 
proposition 1 applies. The ideal policy combination of 


16 Bénabou (2000) derives a similar V-shaped 
redistributive spending and inequality from a different set of assump- 
tions regarding preferences, risk, and the fiscal system. 
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employed wage earners is preferred by a majority to all 
feasible policy alternatives. 

In the case in which w, is sufficiently low, such that 
employed wage earners prefer a policy combination 
with some spending targeted to the employed (w; < 
wo), however, no policy has the property of being 
preferred by a majority to all feasible alternatives. To 
specify an outcome, one must add some additional 
assumptions about the political process. We consider 
two approaches that can be used to select an equilib- 
rium. The first assumes issue-by-issue voting, as in 
Shepsle (1979). If t and y are decided in separate 
parliamentary votes, then the outcome is the policy 
combination preferred by employed wage earners, 
regardless of which policy vote is first, since employed 
wage earners are the median group in both policy 
dimensions.1? The second approach assumes electoral 
competition between two parties or two coalitions of 
parties that have distinctive constituencies.’® Suppose 
the leftist party seeks the support of wage earners and 
the poor, and the rightist party seeks the support of 
wage earners and voters with income above the me- 
dian. If the party system prevents the formation of an 
alliance of the rich and poor against the middle, the 
wage earners’ preferred policy is again a stable political 
equilibrium. Although other plausible approaches 
might yield other equilibria, we proceed to investigate 
the extent to which the pattern of government spending 
in advanced industrial societies fits the preferences of 
the median income group as described in proposition 2. 


INEQUALITY AND WELFARE SPENDING IN 
18 COUNTRIES, 1980-95 


Definition of Variables 


According to proposition 1, for the case of exogenous 
targeting, and proposition 2, for the case of endoge- 
nous targeting, an increase in the skewness of the 
income distribution reduces the share of GDP that the 
median income group prefers to spend on benefits for 
those who have lost their earnings. In the notation of 
the model, the preferred level of (1 — e)cy = (1 — 
)T(t) is a decreasing function of the skewness of the 
income distribution. In addition, the two propositions 
imply that an increase in skewness reduces the share of 
government spending that the median income group 
prefers to allot to benefits aimed at those without 


earnings.’ In this section, we show that both these 


studies of political competition among constituency-based partes 
over redistributive policies. 

19 More precisely, the model implies that (1 — e}c,/T(t) = (1 — y) 
is a declining function of the skewness of the Income distribution as 
long as y > 0. Given the existence of government expenditures for 
purposes other than insurance and redistribution, however, it is 
reasonable to expect that, if our model is correct, spending on 
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implications of the model fit the data on welfare 
expenditures in advanced industrial societies during 
1980-95. Spending on social insurance against in- 
come loss is lower in countries where the income 
distribution is highly skewed, whether measured as a 
share of GDP or as a share of total government 
expenditures. 

To measure Spending on Insurance Against Income 
Loss, we sum expenditures in seven categories: disabil- 
ity cash benefits, occupational injury and disease, sick- 
ness benefits, services for the disabled and elderly, 
survivors’ benefits, active labor market programs, and 
unemployment insurance. Both government expendi- 
tures and mandated private expenditures are included. 
Government spending on health care does not fit in 
this category because in seventeen of the countries 
coverage is provided to all, regardless of income or 
employment status. The exception is the United States, 
which targets substantial programs to the elderly and 
the poor. To include health care spending would 
significantly overstate expenditures aimed at the unem- 
ployed in the seventeen countries, but to exclude it 
might understate benefits targeted to those without 
employment in the United States. Therefore, we added 
100% of government spending on health care to the 
benefits received by those without earnings in the 
United States, but we only added 42% in the other 
countries. The latter figure is the government share of 
total health care expenditures in the United States in 
1990 (OECD 1994).21 

Our measure of insurance against the risk of income 
loss excludes old age cash benefits, family benefits, 
housing benefits, and benefits for other contingencies. 
Many family and housing programs cover those with- 
out employment, but many do not, and we have no way 
of estimating how the spending in these areas is divided 
among different types of households. Pensions are a 
large category of spending that is received by persons 
who are not employed. Although public pensions in- 
gure against investment risks inherent in private pen- 
sions, the Joss of income upon retirement is an ex- 
pected event in a way that the loss due to sickness or 
layoffs is not. Thus, we exclude government pension 
programs from our measure. Spending on insurance 
against income loss is still substantial even with pen- 
sions excluded. Among the eighteen countries in our 
sample, it averaged 10% of GDP or 20% of total 
government spending from 1980-95. 

According to most models, the aspect of income 
distribution that matters politically is the comparison 
of the median wage to the mean. The absence of data, 
however, compels us to use the ratio of earnings at 


insurance should increase as a share of total government spending as 
skewness falls, even when the y = 0 constraint is bi 
2 The countries in the for Economic Cooperation and 
Development (OECD) data set are Anstralia, Austria, Belgium, 
Canada, Denmark, Finland, France, Germany, Italy, Japan, Nether- 
lands, New Zealand, Norway, Portugal, Sweden, Switzerland, United 
and the United States. These are the countries for which 
the OECD (1993, 1996) has published measures of wage and salary 
inequality. 
21 The statistical results are not changed significantly if governmen 
spending on health care is excluded from the insurance category. 
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different percentiles of the wage distribution as a proxy. 
If we approximate the empirical distribution of wages 
and salaries with the lognormal distribution, then the 
ratio of the median to the mean wage can be written as 


median wage 
mean wage _ exp(— 07/2), 

where o° is the variance of the log of wages and 
salaries. This variance, in turn, can be derived from 
the ratio of wages at any two percentiles of the wage 
distribution, according to the formula 


o =k, In(w/w,), 


where w, and w, are the wages at the ith and jth 
percentiles, respectively, with i > j, and where kisa 
positive constant that depends on i and j. It follows that 
the ratio of the median wage to the mean wage is a 
strictly decreasing function of the wage ratio w,/w, for 
any i and j with i > j. 

In theory, any ratio w,/w, is an equally good proxy for 
the ratio of the median to the mean. Our proxy for the 
skewness of the income distribution, Inequality (90/10), 
is derived from the ratio of pretax earnings as between 
the 90th percentile and the 10th percentile of the 
distribution of wages and salaries. These data are 


available for eighteen industrialized countries from _ 


1979-80 through 1995-96 (OECD 1996). Taking the 
log of [Woo/W39) — 1] improves the fit slightly. There- 
fore, we use 


Woo — rar) 


inequality (90/10) = n( = 


as our proxy for the skewness of the income distribu- 
tion. All the statistical analyses were redone using the 
WsolW19 and the wo/Wsq ratio in place of the wWo/W19 
ratio to check that our findings are robust with respect 
to the choice of proxy. Because wage inequality data 
are not available annually for the entire data set, we 
took the average of all data points for each country in 
the periods 1980-94, 1985-89, and 1990-94, which 
yielded (3)(18) = 54 possible data points. After remov- 
ing cases with no inequality data for one or more of the 
five-year periods, we were left with 50 observations.24 

Welfare spending today is highly correlated with 
welfare spending in the recent past. Budgets are ad- 
justed up or down from the status quo. For the 
eighteen countries, if one regresses spending on social 
insurance against income loss as a share of GDP in 







See Moene and Wallerstem 2001a for the full set of regression 
equations using the we/w,, and the WooW wage ratios. In addition, 
ur results do not depend in a significant way on whether one uses 
[so/W10) — 1] oF In(wsofw1a) or (Wooa) as the proxy. 

The excluded cases were Belgium 1980-84, Portugal 1980-84, 
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1985, 1990, and 1995 on spending as a share of GDP 
five years earlier, one obtains 


(Spending/GDP), = 1.60 + 0.913 (Spending/GDP),.s, 


where the standard error of the coefficient in front of 
the lagged dependent variable is 0.05, and R? = 85.5. It 
is clear that the influence of past spending on current 
decisions cannot be ignored in empirical work. 

In addition to a Lagged Dependent Variable, we 
control for the rate of unemployment (Unemployment), 
government by conservative parties (Right Govern- 
ment), the turnout for elections to the lower house of 
parliament (Turnout), and the proportion of the pop- 
ulation over age 65 (Percentage Elderly). The rate of 
unemployment is potentially an important determinant 
of spending on unemployment insurance, active labor 
market policies, and even disability insurance (coun- 
tries with high levels of joblessness may classify some of 
the unemployed as disabled) (Pampel and Williamson 
1989). Because our measure of spending on insurance 
against income loss includes survivors’ insurance and 
expenditures on health for those not in the labor 
market, the share of the population who are elderly 
also may affect spending levels. Whether one views the 
fraction of the population who are over age 65 or who 
are unemployed as measuring need or measuring po- 
litical influence has a subtle implication for measument 
choice. As an indicator of need, the relevant control is 
the share of each group in the year at which expendi- 
tures are measured. As an indicator of political influ- 
ence, the relevant control is an average of the size of 
each group in the preceding five years, since policy 
changes lag shifts in the electorate. We let the data 
decide this issue. The unemployment rate fits the data 
much better if measured in the same year that we 
measure social insurance benefits. With regard to the 
proportion of elderly, using the average over the 

ing five years fits slightly better, although the 
difference is small. 

In the literature on partisanship and welfare spend- 
ing, the early emphasis was on the division between 
socialist or social democratic parties and center/Right 
parties (Korpi 1983; Stephens 1979). Like many before 
us, we find that the most important division is between 
the Left/center and the Right (Castles 1982; Esping- 
Andersen 1990). Because the Left versus center/Right 
division was never significant in any of our regressions, 
we only report results based on the classification con- 
servative versus Left/center. Our measure of conserva- 
tive government is the share of cabinet seats held by 
conservative parties (Castles and Mair 1984; Huber 
and Inglehart 1995). 

Turnout also may have an important effect on polit- 
ical support for spending to insure against the risk of 
income loss (Franzese 1998; Liphart 1997). We include 
the average turnout in elections in the lower house of 
parliament in the preceding five years. Summary sta- 
tistics and data sources for all the variables are pre- 
sented in the Appendix. 

Among the earliest findings of the empirical litera- 
ture is that welfare spending is higher as a share of 
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TABLE 1. Effect of Inequality on Government Spending for Insurance agalnst Loss of Income as 
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GDP in countries with a higher level of GDP per capita 
(Wilensky 1975). Yet, like most other researchers 
whose data set is limited to countries with relatively 
high GDP per capita, we found that this factor had no 
explanatory power. Therefore, we did not include GDP 
per capita in the set of independent variables. In 
addition, we used no controls for union strength, such 
as density or centralized wage-setting, since these two 
are among the most important determinants of the 
inequality of earnings distribution (Freeman 1988; 
Hibbs and Locking 2000; Rueda and Pontusson 2000; 
Wallerstein 1999). It should be noted that government 
spending has little influence on the inequality of pretax 
wages and salaries after controlling for the effect of 
wage-setting institutions (Wallerstein 1999). Thus, the 
inequality of pretax wages and salaries can be consid- 
ered exogenous with respect to spending on social 
insurance benefits. 


Testing the Model 


Regressions results with spending on social insurance 
against income loss as a share of GDP as the depen- 
dent variable are presented in Table 1.5 The first 
column is our basic specification. The estimated effect 
of the skewness of the wage distribution on spending as 
a share of GDP is strongly negative, as predicted by the 
model. Among advanced industrial societies, countries 
with a more skewed distribution spend less on imsur- 
ance against income loss than countries with more a 
egalitarian distribution. The estimated effect of an 
increase in inequality (90/10) by one standard deviation 
(.378) reduces spending on insurance against income 
loss as a share of GDP by (2.17)(.378) ~ 0.8 percentage 





B The ed standard errors are estimated assuming that 
E(ee') = X @ I where X is a general 18 X 18 variance-covariance 
matrix for the error terms associated with the eighteen countries and 
Tis the 3 X 3 identity matrix for the three tme periods. See Beck and 
Katz 1995 or Greene [1993] 1997, 651-69 for a discussion of the 
estimation of panel-corrected standard errors. 


1 2 3 4 5 
Lagged dependent varlable 761* (.063) 734* (073) 789* (.089) 778* (.068) 757* (.083) 
Inequality (90/10) —2.17° (0.33)  —1.93* (0.35)  —2.52*(0.55) —1.83* (0.39) 
Unemployment 118 (.073) 122 (.072) 117 (.082) .060 (.071) 119 (.077) 
Tumout —.053* (012) —.050°(.011)  —.043*(.015)  —.044*(.011)  —.053* (.014) 
Rightist goverment —.013* (004)  —.013* (.005) —.013* (004)  —.013* (.005) 
Percentage elderly 108 (.108) 
Inequality (50/10) ~1.31* (0.53) 
Inequality (90/50) —1.53 (0.97) 
adj. A? 90.0 90.1 88.3 90.3 89.8 
N 50 50 50 49 50 


Note: The table shows OLS eatmabon. The dependent vanable is Insurance benefits for lose of income as a share of GDP. Parentheses contain 


panel-correctad standard errors. All regressions include a constant. Column 4 excludes Friland 1996 *p x .06. 














points in five years. In the long run, such an increase 
reduces that spending by 0.8/(1 — .761) ~ 3.3 percent- 
age points. This is a large effect. For example, con- 
sider the contrast between average spending on insur- 
ance against income loss in the Netherlands (15% of 
GDP), ranked highest in this category in the sample, 
and United States (8% of GDP). Given that the 
difference for the inequality (90/10) variable between 
the Netherlands and the United States averaged 1.44 — 
0.43 ~ 1.0, the long-run difference in their spending as 
a share of GDP we can expect on that basis is 
(2.17)(1.0\/(1 — .761) ~ 9 percentage points, compared 
to an actual difference of 7 percentage points. 

The estimated effect of turnout on spending as a 
share of GDP is also negative. Higher turnout is 
associated with less spending on insurance against the 
risk of income loss, controlling for the skewness of 
income distribution. The simplest explanation is that 
the propensity to vote is positively correlated with age 
as well as income. Indeed, Franklin (1996, 220), in an 
analysis of survey data from 21 European countries and 
the United States, found that the effect of age exceeds 
that of income.” When turnout is low, the elderly may 
comprise a larger share of voters than when turnout is 
high. Because some of the insurance benefits going to 
those without earnings are disproportionately received 
by the older voters (obvious examples are disability and 





26 The coefficients reported in Table 1 represent the short run effect 
of a unit change in the independent variables on the dependent 
variables, where the short run is within five years, since all variables 
are measured m five year intervals. The long-run effect of a unit 
change in the independent variables ts given by the short-run effect 
divided by (1 — .761), where .761 18 the estimated coefficient on the 
lagged dependent variable The long run refers to the total cumula- 
tive effect of a permanent change in one of the independent variables 
as time goes to infinity. 

77 Franklin (1996, 220) divided his sample into five age and mcome 
categories. The difference in turnout between the top and bottom age 
quintiles was tely 30 percentage points (from 58.8% to 
88.9%); the difference in turnout between the top and bottom 
income quintiles was roughly 15 percentage points. 
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survivors’ insurance), the negative estimated effect of 
turnout on spending may reflect the fact that smaller 
electorates contain a relatively larger proportion of 
elderly than do larger electorates. 

As an additional control for the age distribution of 
the electorate, we added the proportion of the popu- 
lation who are elderly to the set of independent 
variables in the second column of Table 1. Note that 
cross-national differences in turnout far exceed cross- 
national differences in the share of the population over 
age 65, as shown in Table A-1. Thus, it is not surprising 
that the standard error of the estimated effect of the 
percentage elderly is much larger than the standard 
error of the estimated effect of turnout. The point 
estimates indicate that both a high share of elderly in 
the population and a low turnout are associated with 
higher benefits. 

There are two ways in which income distribution can 
affect spending for insurance against income loss. The 
first is to induce all parties to shift their platform in the 
direction of the policy preferred by the median income 
group. This is implicit in the first column of Table 1. 
The second mechanism is to alter the likelihood of an 
electoral victory of the parties that are most committed 
to social insurance policies, that is, the parties of the 
Left and center. In this case, government by rightist 
parties is an endogenous variable that refiects the 
distribution of income. If income distribution affects 
policy by affecting the likelihood of a conservative 
electoral victory, then inclusion of rightist government 
as a control variable will underestimate the influence of 
income on social insurance benefits as a share of GDP. 
The third column of Table 1 shows that the estimated 
effect of inequality (90/10) does increase in absolute 
value when government by the Right is removed as a 
control, but the difference is not large. 

The last two columns of Table 1 present several 
checks on the robustness of our results. Tests of the 
influence of individual data points revealed that the 
case of Finland in 1995, an outlier with an unemploy- 
ment rate of 17.4%, has a large influence on the 
estimates. Its extraordinarily high rate significantly 
increases the range over which unemployment varies in 

e data and allows more precise estimates of the 

ficients. As the fourth column of Table 1 shows, 
moving the case of Finland 1995 greatly reduces the 
imated effect of unemployment and slightly reduces 

e estimated effect of inequality (90/10). 

In column 5, we replace the wywo ratio with 

59/Wi9 ANd Woo/Ws5y to test the argument of Kristov, 

indert, and McClelland (1992) that support for social 
rance depends on the social affinity felt by the 
edian group toward the poor. Kristov and his col- 
eagues argue that such affinity is a negative function of 

e distance between the middle and the poor, as 


Standard economic theory suggests that unemployment also may 
endogenous, in the sense of being affected by both the distribution 







set of controls had little effect on the estimated coefficients of the 
les that remained. 
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TABLE 2. Effect of Inequality on Government 

Spending for Insurance agalnst Loss of 

Income as a Share of Govemment 

Expenditures In Eighteen Countries, 1980-95 
1 2 


.B45* (105) .851* (.089) 





Lagged dependent 
variable 


Inequality (90/10) 
Tumout 


Rightist 
government 


Unemployment 
Percentage elderty -131 (.101) 
adj. R? 84.9 85.3 


N 46 46 


Note. The table shows OLS estmabon The dependent vanable is 
Insurance benefits as a share of expenditures. 
panet-corrected 


—1.33* (0.27) 
—.080* (.013) 
—.008 (.009) 


—1.74* (0.28) 
—.064* (.013) 
— 008 (.010) 


— 042 (0.78) 


govemment 
standard errors. All 
constant New Zealand and Portugal 1995 are deleted because of 
meeng data ‘p = .06. 


contain include a 


measured by the w.o/w;) ratio, and a positive function 
of the distance between the middle and the rich, as 
measured by the woo/Wsy ratio. Thus, the estimated 
coefficient for inequality (50/10) should be negative, 
and the estimated coefficient for inequality (90/50) 
should be positive. According to our model, in contrast, 
both measures are equivalent proxies of income skew- 
ness. Column 5 indicates that the estimated coefficients 
for both are virtually the same, negative number. 

Our model implies that countries with a highly 
skewed distribution of wages and salaries spend less on 
insurance against income loss as a share of either 
government expenditures or GDP. This is tested in 
Table 2. The dependent variable is social insurance 
spending as a share of government spending, defined as 
total outlays by all levels of government minus gross 
capital formation and other capital expenditures. Be- 
cause the signs of the estimated coefficients for both 
unemployment and the share of elderly in column 1 are 
contrary to our prior beliefs, we removed both controls 
in column 2.79 Both specifications in Table 2 show a 
strong, negative relationship between spending on in- 
surance against income loss as a share of government 
spending and the skewness of the wage distribution. 
Together, columns 1 and 2 imply that a long-lasting 
increase in inequality (90/10) by one standard deviation 
reduces social insurance spending as a share of govern- 
ment outlays by between (1.33)(378)/(1 — .851) ~ 3.4 
and (1.74)(.378)/(1 — .851) ~ 4.4 percentage points in 
the long run. 

Our model generates weaker predictions regarding 
the relationship between the skewness of income dis- 
tribution and total spending on benefits for both the 
employed and people without earnings. The V-shaped 








2 Since spending on pensons reduces the share of government 
spending that 1s spent on insurance against income loss, we expected 
the coefficient on the percentage elderly to be negative. 
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relationship between skewness and T(t) described in 
proposition 2 and illustrated in Figure 3 is compatible 
with any empirical relationship between our measure 
of inequality and T(t) except for an upside-down V. A 
further obstacle to testing the predictions of our model 
regarding T(t) is the difficulty in measuring govern- 
ment spending on benefits for the employed. Defined 
narrowly to be transfer payments received by the 
median group of wage earners, these benefits as a share 
of GDP are close to zero in all advanced industrial 
societies; this is consistent with our model if the income 
distribution in those societies is sufficiently egalitarian 
that a majority prefers transfer payments be targeted to 
those without earnings.>° An alternative approach is to 
equate T(t) with total social expenditures as defined by 
the OECD (1999), which includes insurance benefits to 
those without earnings (about 40% of the total), pen- 
sions (about 30% of the total) and other transfer 
payments and in-kind benefits that do not depend on 
employment status.3! 

Column 1 of Table 3 reveals a strong, negative 
relationship between income inequality and social ex- 
penditures as a share of GDP, as Rodríguez (1998) has 
shown. If one looks for a V-shaped relationship be- 
tween social expenditures and income inequality, it can 
be found. Searching the data for a critical level of 
inequality that generates the strongest V-shaped rela- 
tionship yields the division described in columns 2 and 
3. In those two cases the sample is divided according to 
whether Wo9/W'19 < 3.15 (the distribution of income is at 
least as equal as in Japan) OT WooW = 3.15 (the 
distribution of income is at least as unequal as in 
France). The point estimates in columns 2 and 3 
indicate a V-shaped relationship between spending and 
inequality, but the F-statistic fails to reject the null 
hypothesis that the coefficients in the two subsamples 
are identical. 

To summarize the empirical evidence, there is strong 
support for the model’s predictions regarding insur- 
ance against income loss. In advanced industrial soci- 
eties, the more positively skewed the distribution of 
pretax earnings, the lower is government spending on 
insurance against income loss, whether measured as a 
share of GDP or total government spending. There is 
also a strong negative relationship between aggregate 
social expenditures as a share of GDP and income 
inequality. In other words, there is little empirical 
support for a purely redistributive model of welfare 
expenditures. The empirical relationship between in- 
equality and political support for welfare programs, we 
suggest, cannot be adequately understood without con- 
sidering welfare policies as publicly provided insur- 
ance. 


» Family benefits, the only transfer payments likely to be received by 
a voter with median income, average 1.4% of GDP among advanced 
industrial societies (OECD 1999). 

31 Social include government spending on health and 
housing but not on education. 
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CONCLUSION 


We have developed the implications of the view that 
welfare policies are publicly financed insurance that 
pays out benefits relative to contributions in a redis- 
tributive manner. At the extreme ends of the income 
scale, the insurance aspect is dwarfed by the redistrib- 
utive aspect. The poor in our model receive benefits 
and do not contribute at all, and the rich have no need 
for publicly financed insurance. But the middle group 
of voters in our model benefit from both aspects of 
welfare policies. When the redistributive and insurance 
benefits are considered simultaneously, the effect of 
increasing inequality on political support for welfare 
policies depends critically on the way in which benefits 
are targeted. Increased income inequality that is asso- 
ciated with an increased gap between median and 
mean income increases political support for redistrib- 
utive benefits received by the employed but reduces 
support for publicly provided insurance against income 
loss. When the targeting of benefits is endogenous, the 
model continues to imply that support for spending on 
insurance against the risk of income loss declines as the 
gap between the median and the mean increases. 
Regression results indicate that greater inequality is 
associated with lower spending on programs to insure 
against income loss among eighteen advanced indus- 
trial countries from 1980 to 1995, as a share of both 
GDP and total government spending. 

Our approach does not yield a clear prediction 
regarding how support for insurance against the risk of 
income loss varies across individuals with different 
incomes. In our model, the demand for welfare spend- 
ing comes from those who never work and low-wage 
workers who may lose their employment. High-wage 
workers, who, by assumption, face no risk of income 
loss, oppose spending on social insurance to the extent 
that they vote in a self-interested manner. In reality, 
however, the risk of income loss rises gradually as one 
moves up the income scale. Whether self-interested 
workers earning a higher income would support more 
or less spending on insurance against the risk of job 
loss than workers earning lower income depends on 
their relative risk as well as their relative wage.” Our 
conclusion that a more unequal distribution of income 
leads to less support for social insurance is conditional 
upon holding constant the distribution of the risk of 
income loss. 

Theoretically, the largest gap in our approach is the 
absence of a private alternative to publicly provided 
insurance. We have concentrated on the loss of in- 
come, a risk that cannot be insured privately. The 
politics of the demand for insurance when there is a 
private alternative involves different considerations. 
The policies that constitute the welfare state are het- 
erogeneous in their bases of political support. One 
model will not encompass them all. 

Empirically, there is strong support in our sample for 
the proposition that countries with more skewed in- 








z Iversen and Soskice (2001) find that support for welfare expendi- 
ture declines as survey respondents’ income increases, controllmg for 
the specificity of skills. 
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TABLE 3. Effect of Inequality on Social 
1980-95 








1 (Entire Sample) 
Lagged dependent variable .749* (.065) 
Inequality (80/10) —2.87* (0.79) 
Tumout —.069* (.022) 
Rightist government —.019* (.007) 
Unemployment .259* (.089) 
Percentage elderty 320 (.221) 
adj. A? 92.2 
N 50 
FÇ, 


coeficient vectors n the last two oolumnes are idenbcal *p =< .05. 


come distributions spend less on insurance against 
income loss. A simple cross-sectional comparison of 
wage inequality and spending on social insurance yields 
a clear negative relationship. That relationship holds 
up surprisingly well using panel data with a lagged 
dependent variable; that variable alone explains 85% 
of the variance. The results can be destroyed, however, 
by removing all the cross-sectional variation with a full 
set of country dummy variables, as Devroye (2000) 
shows. The variation in wage inequality within coun- 
tries is too small over the fifteen years we studied to 
provide a reasonable test of the model’s implications. 
To test the relationship using a fixed-effects model, we 
need measures of wage inequality and spending on 
benefits targeted to those without employment over a 
longer period. We also need more empirical work on 

he categories of welfare expenditures that do not 
consist of insurance against income loss. The political 


TABLE A-1. Summary Statistics 


Variable 
Insurance for Income loss/GDP 


Insurance for Income loss/govt. spending 
| Social expenditures/GDP 

Inequality (90/10) 

Inequality (90/50) 


Inequallty (50/10) 


Expenditures as a Share of GDP in Eighteen Countries, 


December 2001 








Note: The table shows OLS estimation. The dependent variable Is total socal expenditures as a share of GDP. Parentheses contain panel-corrected 
standard errors n column 1, OLS standard errora in columns 2 and 3. All regressions include a constant. The F-stahsbc tests the null hypothesis that the 















2 (WoW < 3.15) 3 (Woo W10 = 3.15) 
.715* (078) .730* (.089) 
—5.88* (2.59) 4.06 (2.99) 
—.117* (.035) .034 (.036) 
—.013 (.010) —.028* (.008) 
.288* (.098) .765* (207) 
112 (249) .964* (340) 
91.8 95.9 
34 16 
1.55 






contests over pensions or government spending on 
health care may differ significantly from the political 
contest over programs that insure against the loss of 
income. Finding the level of disaggregation that best 
explains the dynamics of political support for welfare 
policies should be high on the agenda of future work. 


APPENDIX 


Descriptive statistics for all the variables used in the data 
analysis are presented in Table A-1. Insurance against in- 
come loss refers to spending on welfare program targeted to 
those without earnings, excluding pensions, as described in 
the text. Data are for 1985, 1990, and 1995 m the case of 
social insurance benefits, government spending and unem- 
ployment. All the other variables represent the average value 
for the periods 1980—84, 1985-89, and 1990-94. See note 20 
for the countries in the sample. The source for spending on 
social insurance, health care, and pensions is OECD (1999). 
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Wage inequality (i/j) is In[(w, — w,)w,], as described in the 
text. The data on wage inequality are from OECD (1996) 
and, in the case of the United States, OECD (1993). The 
share of government spending, the share of elderly in the 
population, and unemployment figures are from OECD 
(1997). Conservative government is based on Swank (1992), 
updated using recent issues of Keesing’s Record of World 
Events. The classification of parties in terms of Right versus 
center and Left is based on Castles and Mair (1984), updated 
with Huber and Inglehart (1995). Turnout refers to elections 
for the lower house of parliament or for president in the 
United States. The source for turnout is Blais and Dobryzyn- 
ska (1998). The data set is available from the authors upon 


request. 
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e present a theory of social policy preferences that 
The key to our argument is that individuals 


emphasizes the composition of people’s skills. 
demand 


who have made risky investments in skills will 


against such uncertainty. This is not the case for general skills workers, for whom the costs of social 
protection weigh more prominently. We test the theory on public opinion data for eleven advanced 
democracies and suggest how differences in educational systems can help explain cross-national differences 


in the level of social protection. 


capital as a source of personal and national wealth. 
Indeed, it is the single most important determinant 
of personal income in advanced industrialized coun- 
tries. Yet, whereas physical assets—buildings, machin- 
ery, goods, and money—have long been recognized as 
essential for understanding the political interests of 
their owners, surprisingly little is known about the role 
of human capital in explaining public policy prefer- 
ences. With the exception of trade policy,! only the 
cognitive aspects of education have received systematic 
attention in explaining political preferences (Duch and 
Taylor 1993; Kitschelt 1991; Klingemann 1979). 
Following Becker (1964), we conceptualize human 
skills as an investment and ask how the character of this 
investment affects workers’ preferences for social pro- 
tection. We approach this question in a fashion similar 
to the way transaction cost economics explains the use 
of nonmarket institutions to overcome market failures 
(Williamson 1985). In a political version of this logic, 
endogenous trade theory hypothesizes that investments 
in physical assets that are specific to a particular 
location or economic transaction lead firms to lobby 
the state for protection against uninsurable risks (see 


I: is well known that human capital rivals physical 


Torben Iversen is Professor of Government, Center for European 


at the Hoover Institution, Stanford University. 
1 The trade policy argument treats skilled labor as a more or less 
abundant factor, and relative abundance is to favor free 
trade. See Rogowski 1987, Frieden 1991, and especially Scheve and 
Slaughter 1999 for different versions of this argument. 


Alt et al. 1999).? Since it is difficult to withdraw these 
assets in response to adverse market conditions, firms 
want protection against the effects of such adversity. 
We start from the similar idea that investment in skills 
that are specific to a particular firm, industry, or 
occupation exposes their owners to risks for which they 
will seek nonmarket protection. Skills that are porta- 
ble, by contrast, do not require extensive nonmarket 
protection, just as the exchange of homogeneous goods 
does not require elaborate nonmarket governance 
structures.? 

Our theory does not necessarily contradict a long 
tradition in the literature that emphasizes redistribu- 
tion as a key political motive behind the welfare state 
(e.g., Esping-Andersen 1990; Korpi 1983). Indeed, 
Meltzer and Richard’s (1981) influential work on the 
median voter and government spending, which focuses 
on the redistributive aspect of social protection, 
emerges as a special case in our model. Given a 
particular composition of skills, workers with higher 
income are likely to demand less social protection than 
are workers with lower income. Our argument parts 
ways with the Meltzer-Richard model, however, be- 
cause we explicitly recognize that social protection also 
has an insurance aspect (Moene and Wallerstein 2001; 
Sinn 1995) and that demand for insurance varies 
among workers according to their degree of exposure 
to labor market risks (Baldwin 1992). A critical point is 
that in our model to risk is inversely related 
to the portability of skills. We show that this proposi- 
tion has strong support in public opinion data from 
eleven advanced democracies, with important implica- 
tions for explaining differences in the level of social 
protection across countries. 

The remainder of this article is divided into three 
sections. In the first we present the model and its main 


increases the incentives of capital owners to oppose 
trade and financial mobility. See Frieden 1991 and Bates, Brock, and 
Tiefenthaler 1991. 
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empirical implications. In the second we test these 
implications on public opinion data from eleven coun- 
tries. In the third we discuss the broader implications of 
the model for explaining differences in social protec- 
tion across countries. 


THE MODEL 


Assumptions 


Workers derive their income from skills that can be 
either general or specific. Specific skills are valuable 
only to a single firm or a group of firms (whether an 
industry or a sector), whereas general skills are porta- 
ble across all firms. We distinguish three employment 
situations, or states of the world, each associated with 
distinct levels of income. In State I a worker is em- 
ployed in a firm that uses both her specific and general 
skills; in State I the worker is employed in a firm that 
only uses his general skills; and in State MI the worker 
unemployed (i.e., none of her skills are being used). 
We define by g the market value of a worker’s 
general skills in State If when his specific skills are not 
being used. In State I (when the specific skills are being 
used as well} the worker is paid sg, the value of her 
eombined specific and general skills. If a worker has no 
specific skills, then s = 1, and she is always employed 
ht the market value of her general skills. The key 
hssumption is that general skills are marketable in all 
kectors of the economy, whereas specific skills are only 
narketable in one sector (the size of which is defined 
py skill specificity). 

In addition to market income, workers receive trans- 
er income from the government, hereunder unem- 
ployment benefits, health care benefits, pensions, and 
bther forms of nonwage compensation. Although some 
of these benefits are received by people outside the 
abor market, what matters to our argument is that they 
dre viewed by workers as part of their compensation (in 
fhe neocorporatist literature, sometimes termed the 
social wage”). Those most fearful of losing the labor 
harket power of their skills, and hence their ability to 
secure good health and pension plans through their 
am ployer, also will be most concerned about guaran- 
ing a high level of benefits, even if the benefits are 
‘eferred” to the future. 

We assume that transfers come in the form of a 
ffat-rate payment, R, which incorporates the idea in the 
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The Three States of an Individual In the Labor Market 


State Ill 
) (unemployed) 


Share of LF: y = plp + q) 
Disposable Income: R 





= (1- 2RW) g+R 


Meltzer-Richard model that there is a redistributive 
aspect to social protection.‘ Following the terminology 
in Estevez-Abe, Iversen, and Soskice (2001), one can 
distinguish between transfers that go to support the 
income of employed workers, or Wage Protection, and 
transfers that go to the unemployed, or Unernployment _ 
Protection. In developing our model we will discuss 
what happens if R only goes to unemployment protec- 
tion, but ultimately we assume that all workers receive 
the same flat-rate subsidy, which may simply be termed 
Income Protection. 

Transfers are paid out of a flat-rate tax (t) on all 
wages. Total per-capita receipts are T, and all receipts 
are spent on transfers (i.e., we assume balanced bud- 
gets). As in the Meltzer-Richard model, taxation is 
assumed to create work disincentives, captured here by 
the following simple labor supply function: 


ie) = 1/1 + 4), (4) 


where l(t) is the number of hours worked or the 
intensity of effort (the particular form of this function 
is chosen for mathematical convenience). Define w as 
average hourly pretax earnings. Then, tax income per 
capita is 


= 


t . 

T=t-w-l(t) EF R. (2) 

Figure 1 illustrates the three states of the labor 
market and shows the disposable (aftertax) income 
associated with each: I: 3g, IN: R, and I: g. For a given 
period, there is a probability, p, of losing one’s job and 
another probability, q, of reemployment. In equilib- 
rium p+ ¢ = q > y, Where e is the share of employed 
workers, and y is the share of the workforce that is 
unemployed (e = 1 — y). This implies that in 
equilibrium e = q/(p + q). Furthermore, if r is the 
probability that an employed worker is in State I (ie., 
is in a job that uses both general and specific skills), 
then the equilibrium share of the labor force employed 
in State I is 


4 This is also a realistic assumption, since aftertax income 1s distrib- 
uted significantly more equally than pretax income (see Gottschalk 
and Smeedmng 2000; Huber and Stephens 2001). 

5 A third type identified by Estevez-Abe, Iversen, and Soskice (2001) 
is employment protection, which refers to legal and other barriers to 
layoffs. This type could be modeled as the probability of keeping a 
job that fully uses a worker's skills. 
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a=r'e=r:q/i(p+ą). (3) 


Likewise, the share of the labor force employed in 
State II is 


B=(1-r)-e=(1-r)-qip+q). (4) 


The share of the labor force in State II (unemploy- 
ment) is 


y=pli(p+q). (5) 


For any individual worker with both specific and gen- 
eral skills, the proportions a, B, and y can be inter- 
preted as probabilities in a lottery with three possible 
outcomes. An employed sg worker will therefore seek 
to maximize the expected utility of income across all 
three states. If we ignore the discounting of future 
income (which makes no substantive difference to our 
results), tbis is captured by the following utility func- 
tion: 

V= a 'u(sg) + B-u@) + y-u(R), (6) 
where u(.) is the worker’s utility from net income, 
which for simplicity we assume is spent on consump- 
tion. Using standard assumptions, we impose the fol- 
lowing constraints on u: 


u, > 0, 
u., < 0, and (7) 
lim, 9 u’ (c) = ©. 


A number of the results below hold for this general 
form of utility function (notably the Meltzer-Richard 
results). Yet, since the insurance function of the social 
wage will play an important role and since we then 
need specific conditions on risk aversion, we use the 
standard assumption of a constant Arrow-Pratt relative 
risk aversion (RRA) utility function. Specifically, 


l-« 


1l-a 





u(c) = Va>0,#1 


(7a) 


With these assumptions in mind, we can now deter- 
mine workers’ utility-maximizing preferences for social 
protection. 


= loge fora = 1. 


Optimizing Soclal Preferences 


We first consider a simple baseline model with no 
insurance effects, no tax disincentives, only general 
skills, and no unemployment (case 1). We then intro- 
duce tax disincentives to get the Meltzer-Richard result 
(case 2) and subsequently add insurance effects (and 
unemployment) to explore the effects of risk aversion 
(case 3). Finally, we show what’ happens to the demand 
for social protection when the composition of skills is 
allowed to vary (case 4). To keep the presentation 
simple, all proofs are in Appendix A. 


. Case 1: No Insurance Effects, No Disincentive Effects: The 
“¢ = 1” Model. In solving workers’ maximization 
problem we begin by assuming a labor force with only 


general skills (s = 1) and no unemployment (e = 1). 
In the simplest case there are no tax disincentive effects 
on the number of hours supplied, so that I(t) = 1 
rather than [(t) = 1/(1+f), as we shall subsequently 
assume. 

Whens = 1,e = 1, and I(t) = 1, equation 6 reduces 
to: 


V=u((1 — tìg + tw) = u(e(1 — R/w) + R) 


= — (g(1 — R/w) + R). (8) 


We want to choose R to maximize V, where R is 
bounded between R = 0, corresponding to t = 0, and 
R = w, corresponding to t = 1. Because 


Vr = (g(1 — R/w) + R)": (1 — giw), 


and because 0 = R = w, g > w implies that Vp is 
uniformly negative for all values of R and hence t. 
Therefore, maximization of V requires t = 0. Thus, 
voters with income above average will choose a zero tax 
rate. An analogous argument for g < w implies that 
voters with income below average will choose the 
maximum tax rate of 100%. This is the standard result 
that, in the absence of insurance functions and tax 
disincentives, voters will want the maximum R (i.e., t = 
1) ifg < w and a zero R (t = 0) ifg > w. If the median 
voter, M, has an income less than the average income 
of w, the median voter will always vote for a maximum 
tax rate. The result is illustrated in Figure 2A. 


Case 2: Disincentive Effects, No Insurance Effects: The 
Melizer-Richard Model. If we now include the tax 
disincentive effect that I(t) = 1/(1 + t), we have: 


E 1-t tw \ 
V=u Ir +141 = u(g(1 — 2R/w) + R) 


1 
Sica (g(1 — 2R/w) + R)". (9) 
This implies 
Vr = (g(1 — 2R/w) + R)* (1 — 2g/w), (10) 


so in the Meltzer-Richard model, only voters with a g 
below that of half the average hourly wage (g = w/2) 
will vote for a maximum tax rate. As illustrated in 
Figure 2B, if the median voter has ag above w/2 he will 
not vote for the maximum tax rate. Because of the 
simplicity of our tax disincentive function, voters with g 
below w/2 will vote for t = 1, and voters with g above 
w/2 will vote for t = 0.6 With the more complex tax 
disincentive function used by Meltzer and Richard, 
workers with income in the range [w/2, w] will prefer 
taxation up to the point at which the benefits to. them 
from redistribution are exactly outweighed by the 
efficiency costs of tax disincentives. If the median voter 
is in this range, as the Meltzer-Richard model assumes, 


6 The Meltzer-Richard model has a more general tax disincentive 
function than that used here. In consequence the tax rate that 
maximizes tax revenue can be lees than 1. 
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A The ‘t= 1" Model 
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hen she may vote for a positive tax rate less than 1 (as 
illustrated in Figure 2). 
One implication of the Meltzer-Richard model is 
t voter turnout will be positively related to the level 
f government transfers because nonvoting tends to be 
ncentrated among low-income people (Lijphart 
997). There is some cross-national evidence in sup- 
rt of this proposition (see Franzese 1998), but there 
iş little empirical support for another key implication of 


idence). Among advanced countries the relationship 
is actually the reverse (Bénabou 1996). 


3: Disincentive Effects, Insurance Effects: The Insur- 
Model. Moene and Wallerstein (2001) offer one 
ible explanation for this puzzle by introducing 
ce effects (see also Sinn 1995). For insurance 
eects to matter, we need at least two states of the 
rid. In contrast to the Moene and Wallerstein (2001) 
odel, in which there are high- and low-wage groups in 
addition to the unemployed, we now assume that 
rkers can either be employed at a gross wage equal 
their “tax-incentivized” skill level g/(1+t) or be 
employed. There are no specific skills (s = 1), so 
tion 6 becomes: 


V=B-u@) + y-u(R). 


FIGURE 2 Four Models of Social Policy Preferences 


B. The Mettzer-fechard Model 


` 
Note A = preferred level of social spending; M = Income of median voter; w = mean Income; RRA = relative nek avermor; sig = specific to general skiis 
mho. Arrows indicate preferred level of socal spending by the median voter. 
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In this model, if relative risk aversion is constant and 
greater than unity, then workers will choose a higher 
tax rate as they become wealthier. In other words, their 
aversion to risk outweighs the increased cost to them of 
insurance as their income increases. Thus, the relation- 
ship between income and preferred level of spending is 
positive, as can be seen in Figure 2C.7 

To get this result, we assume that R is only paid to 
those who are unemployed, so that = g + (1—2R/w). 
It can then be shown (as we do in Appendix A) that 


dRidg<0 iff RRA <1, (12) 
and 
dRidg>0 iff RRA>1. (13) 


A key implication of the result for RRA > 1 is that, 
contrary to the Meltzer-Richard model, provided that 
the income distribution is skewed to the right, a 
means-preserving increase in inequality will reduce the 
median voter’s preferred level of social protection. The 


7 This differs from Moene and Wallerstein (2001) because in their 
model workers with high mcome are assumed not to be exposed to 
labor market risks and hence prefer zero spending and taxation. 
Since we do not make any distinctions in the risk exposure of 
different income groups, we refer to case 3 as the insurance model 
rather than the Moene-Wallerstem model. In the empirical section 
we test this insurance model, not the Moene-Wallerstein model 
(which, like our own argument, is conmstent with both positrve and 
negative effects of income when tested across the entire income 
scale). 
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reason is that a rise in inequality lowers the income of 
the median voter, and since the insurance motive 
dominates the redistribution motive (RRA > 1), 
demand for social protection will decline. In the Melt- 
zer-Richard model there is no insurance motive, so a 
fall in the income of the median voter always leads to 
a rise in the demand for social protection. Risk aver- 
sion is thus one potential explanation for the empiri- 
cally puzzling link between income equality and redis- 
tributive spending. 

Yet, despite the neatness of this result, our econo- 
metric estimations clearly reject its implication that 
people with lower income prefer less redistribution. 
This leaves the negative relationship between redistri- 
bution and inequality as an important unsolved issue 
for comparative political economy. We discuss below 
how our distinction between specific and general skills 
permits an alternative and more plausible interpreta- 
tion. 


Case 4: Disincentive Effects, Insurance Effects, Specific 
and General Skills: The Asset Model. ‘This is the most 
general model and requires us to consider all three 
states in Figure 1. We therefore return to the present 
value of utility given by equation 6: 


V=a-u(sg) + B-u@) + y-u(R), (6) 
where 
sg = sg: (1 — 2R/w) +R, 
and 
=g: (1 - 2Rw) +R. 


In addition, we introduce an important variable, ex- 
pected hourly income before taxes and transfers, y. 
This is simply defined as 


yma-sgt+ Bg. 


We then proceed to ask three questions of the model. 
First, up to what value of y is the chosen R maximal, 
that is, £ = 1? Second, under what RRA conditions 
does R fall or rise as y rises above this value? Third, 
what happens to the choice of R as the balance of 
general and specific skills changes, holding y constant? 
In the first result we show that a worker will only 
choose the maximum tax rate if y = w/2. Stated 
formally (the proof is in Appendix A): 
Resutt 1: Given the assumptions of the asset model, 
t = 1ify s w/2. 
With this property established, we now consider what 
happens to R when y increases. As we show in Appen- 
dix A, this yields the following result: 
ResuLT 2: Given the assumptions of the asset model, 
holding s constant and with y > w/2, 
sgn dR/ay = sgn[ RRA (5g) — 38/68 — #/2)]. 
What this equality says is that the direction of the 
relationship between R and income (the sign, sgn, of 
aR/ay) depends on the level of risk aversion, just as in 


the simple insurance model. In order for income to be 
positively related to support for spending, however, the 
RRA requirement is more stringent (RRA > sg/(sg — 
#/2)) than before (RRA > 1). The reason is that R 
now goes to the employed as well as to the unem- 
ployed, and since employed workers in the insurance 
model only have an insurance incentive in relationship 
to unemployment, RRA must be higher for the insur- 
ance motive to dominate the redistribution motive. 
This implication is also demonstrated by Moene and 
Wallerstein (2001). 

Now we come to the critical result that differentiates 
our approach from previous ones. Central to our 
argument is the proposition that an increase in specific 
skills relative to general skills, holding constant the 
level of expected income, implies an increase in pre- 
ferred R. Put broadly, workers with specific skills will 
prefer higher taxes and social protection than workers 
with general skills. The following result is also proved 
in Appendix A: 

RESULT 3: Assuming a constant relative risk aversion 

utility function and RRA > Q, aR/as > 0, holding y 

constant. 


In other words, as s rises, the preferred level of R 
also rises. The intuition behind this key result is that 
workers with specific skills have more to fear if they 
lose their job than workers with general skills. Workers 
with specific skills who are laid off face the risk of being 
reemployed in a sector in which their skills are not 
needed. If this happens they will lose some of their 
previous income, including employer-provided insur- 
ance against illness and old age. General skill workers 
do not face this problem because they are always 
compensated at the value of their general skills. Hence, 
the more income derived from specific as opposed to 
general skilis—that is, the higher the ratio s/g—the 
greater is the demand for income protection (R). The 
logic, as illustrated in Figure 2D, implies that the 
median voter’s support for social protection depends 
on the composition of her skills. 


Summary. With the simplest set of assumptions— 
only one state of the world (employment), only general 
skills, and no tax disincentives—the politics of social 
spending is all about redistribution or class politics. 
Those with a wage below the mean will want a maxi- 
mum rate of taxation (t = 1), whereas those above the 
mean will want zero taxation. If we add tax disincen- 
tives, however, the cost of redistribution may deter 
low-income workers closest to the mean from demand- 
ing confiscatory taxation, and the median voter is likely 
to be among those workers. This is the Meltzer- 
Richard model. 

When an unemployment state is added, an entirely 
new motive enters into workers’ calculations of their 
interests: insurance against loss of income. If workers 
are sufficiently risk averse, and if all transfers go to the 
unemployed, then rising income may be associated 
with increased demand for protection, since high- 
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income workers have more to lose than low-income 
workers. This is the simple insurance model. If some 
transfers go to the employed, however, then the thresh- 
old of risk aversion for which this relationship holds 
rises, since transfers to the employed only serve redis- 
tributive purposes. 

Finally, when skill specificity is introduced, which 
requires at least two employment states (State I and 
State I in our model), the insurance motive plays a 
crucial role—even when workers are only moderately 
risk averse (0 < RRA < 1) and even when transfers are 
distributed to both employed and unemployed work- 
ers. The reason is that employed workers risk losing the 
income from their specific skills, regardless of their 
exposure to unemployment. This coupling between 
skills and demand for insurance thus transforms the 
relationship between income and social policy prefer- 
ences. The next section explores whether this proposi- 
tion is supported by empirical evidence from public 
opinion surveys in eleven advanced democracies. 


TESTING THE MODEL 


Statistical Model 


In our asset model the relationship between the “pre- 
ferred” level of R and the two exogenous variables, y 


(expected income) and s (skill specificity), is given ‘by 
wo implicit equations: 

VR(R, s, g) = 0, andy =a-sg+B-g. 
rom equation 14 we derived result 2, 


(14) 


5g 


dR 
—<0 if 0<RRA<W— 
ay n sg — w/2’. 


And result 3, 


ðR 
~—>0 if 0< RRA. 


ðs 
We show in Appendix B that 
RoE ae 


$ the first-order Taylor expansion of equation 14. 
hus, our regressions take the form 


R=k+b-ytess. (15) 
implication, if our estimate of b is significantly 
different from zero and negative, we can infer that 0 < 


More generally, the asset model encompasses the 
other three. Hence, we can test for these models as 


inependent of cyclical variations in the unemployment rate. This 
implication can be tested through multilevel modeling, as discussed 
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well. The first model (Meltzer-Richard without tax 
disincentives) implies that b = c = 0. The Meltzer- 
Richard model, with tax disincentives, implies that b < 
0 and c = 0. The insurance model, with 

5g 
5g — W/2” 


implies that b > 0 andc = 0.9 


RRA > 


The Data 


We use data from eleven advanced democracies ob- 
tained from two sets of national mass surveys con- 
ducted under the auspices of the International Social 
Survey Program (ISSP), one in 1996 and the other in 
1997 (ISSP 1999, 2000).!° These surveys offer by far the 
best individual-level data on skills and preferences for 
social protection. We supplement this information with 
economywide unemployment data. The next two sec- 
tions describe the operationalization of the dependent 
and independent variables. 


Dependent Variables. ‘The 1996 survey contains four 
spending questions that closely match our theoretical 
emphasis on income protection (R). Three appear in a 
cluster that asks whether the respondent would like to 
see more or less government spending on unemploy- 
ment benefits, health care, and pensions (see Appendix 
C for details). Pertinent to an assumption in our model 
is the fact that respondents were warned that more 
spending may require higher taxes. The fourth variable 
is based on an item that asks whether the respondent 
favors government spending on declining industries for 
the purpose of protecting jobs (see Appendix C for 
details). This question is as much about job security as 
it is about income security, but the two are obviously 
closely related, and we expect those with specific skills 
to be more concerned than those with general skills 
about keeping their present job and income. Moreover, 
although respondents were not explicitly told about the 
potential costs of government subsidies, such subsidies 
are widely acknowledged to be problematic for eco- 
nomic efficiency. 

The survey also asked about more or less spending 
on “culture and the arts” and “the environment.” 
These policy areas are clearly unrelated to social 
protection, but they are nevertheless relevant to our 
argument. It is often claimed that better general edu- 
cation is associated with increased support for spend- 
ing on “postmaterialist” activities, whereas our theory 
says that it is related to reduced support for spending in 


? Any general insurance model is, of course, consistent with ours. 
The hypothes:s about the insurance model we are testing m the 
particular version (case 3) together with the assumption of RRA > 
w/g — w/2). This solves the inequality-spending puzzle by 
implying a positive relationship between mcome and support for 
redistributive spending. 

10 The countries are: Australia, Britain, Canada, France, Germany, 
Ireland, Italy, the Netherlands, Norway, Sweden, and the United 
States. Japan, covered in both ISSP surveys, could not be included 
because of missing data on a key occupational variable (explained 
below). 
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the social policy area (cf Kitschelt 1991).11 Because one 
might object that our findings for skills reflect general 
ideological opposition to government spending among 
those with more years of formal education, it is useful 
to show that the relationship between skills and sup- 
port for spending varies by policy area. 

For economy of presentation we use confirmatory 
factor analysis (CFA) to create two indexes: one for 
social spending and one for postmaterialist spending 
(both constructed to have a standard deviation of one). 
The adjusted goodness of fit index of the CFA is 0.94 
and varies little by country (the range is 0.90-0.98). 


Independent Variables. Our two different approaches 
to the measurement of skill specificity reflect different 
aspects of the theoretical model. The first classifies 
workers’ skills, or the skills required to perform certain 
jobs, according to their degree of specialization or 
specificity. This is an attempt to gauge s directly. The 
second starts from the model assumption that the 
difficulty of finding a job in which one’s skills are 
needed is proportional to their specificity. This is an 
attempt to gauge s indirectly through rq: the probability 
of reemployment into State I. 

The first approach is based on the latest detailed 
classification of occupations by the International Labor 
Office: the International Standard Classification of 
Occupations (ISCO-88). ISCO-88 bases classification 
on two criteria: the level of skills required for an 
occupation and the degree of specialization of those 
skills. A distinction is made among four broad skill 
levels, which are a function of “the range and complex- 
ity of the tasks involved” and explicitly depend on 
informal as well as formal training (ILO 1999, 6). Skill 
level thus corresponds to (s + g) in our model. All 
other distinctions are based on the specialization of 
skills required to carry out particular jobs, reflecting 
“the type of knowledge applied, tools and equipment 
used, materials worked on, or with, and the nature of 
the goods and services produced” (p. 6). Guided by this 
logic, the subdivision of skills proceeds through four 
levels of aggregation until a high degree of skill homo- 
geneity is reached within each group.“ At the most 
disaggregated level, called the unit level, there are 390 


11 Duch and Taylor (1993) make a similar argument concermmg 
postmaterialist attitudes (although they do not directly discuss 


spending). 
12 We used LISREL version 8.5 for the CFA and the resulting factor 


indicators of two latent spending variables: social and postmaterialist 
spending. The factor loadings for the latent variable are as follows: 
social 52 (subsidies to protect jobs), .48 (health insur- 
ance), 58 (pensions), and .55 (unemployment insurance); postmate- 
rlalist spending—.58 (environment) and 51 (culture and the arts). 
Alternatively, the indexes can be constructed from the results of 
fitting CFA to the covariance matrixes for indrvidual countries, but 
the regression results are only marginally affected. These results, the 
results for each spending area separately, and the country-level CFA 
results are available from the authors. 

B We are not claiming that homogeneity is equivalent in every unit 
group. Yet, skills that are clearly distmct from one another are 
unlikely to be m the same group at the most disaggregated level, and 
major groups with a highly diverse skill structure will tend to have 
more mimor and unit groups. 
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occupational categories (unit groups) with highly spe- 
cific job descriptions. 

Since the occupation of every respondent in the ISSP 
surveys was classified according to ISCO-88 at either 
the most detailed or second most detailed level (for 
exceptions, see Appendix C), we can exploit the skill- 
based hierarchical structure of ISCO-88 to capture the 
specialization of workers’ skills. We accomplish this by 
comparing the share of unit groups in any higher level 
class to the share of the workforce in that class. The 
logic is that the number of unit groups in any higher 
level class will be a function of (1) the size of the labor 
market segment captured by that class and (2) the 
degree of skill specialization of occupations found in 
that particular labor market segment. For example, 8% 
of the workforce across our countries is classified as 
“plant and machine operators and assemblers” (major 
group 8), but this group accounts for 70 of the 390 unit 
groups, or 18%. If occupations at the unit level are, on 
average, equally homogeneous in terms of skills, then 
the disproportionate share of unit groups in major 
group 8 will reflect a high degree of skill specialization 
within that major group. By dividing the share of unit 
groups (.18) by the share of the labor force (.08), we 
can generate a measure of the average skill specializa- 
tion within that particular major group (2.1). This 
calculation also can be done at the lower submajor 
level, and we have used the mean of these calculations 
to get proxy for s.15 The resulting variable has 27 
values, ranging from 0.4 to 4.7. 

Because the theoretical concept of skill specificity is 
a relative variable, the final step is to divide the 
absolute skill specialization measure, s, by the ISCO 
measure of the level of skills.1° This gives us a proxy for 
s/(s+g) which we will refer to as Skill Specificity 1 or s4. 
Alternatively, we can divide s by a proxy for peoples’ 
general skills, g, which gives us a measure for s/g. We 
call this alternative measure Skill Specificity 2 or s2. The 
proxy for g that we use is the respondent’s highest 
academic degree as recorded by the respondent (see 
Appendix C for details). 

The second approach to measuring skill specificity is 
based on the observation that the probability of moving 
from any particular job into one that makes use of a 
worker's skills (State I) is rq for specific skills workers 
and q for general skills workers, where r < 1. If we 
conceive of rq as an element in the continuum (0, q], r 
would then be a measure of the asset-specificity of a 


14 Unit group 3144, for example, represents “air traffic controllers,” 
within the minor group “ship and aircraft controllers and techni- 
clans,” itself one of five categories in the major group “techniclans 
and associate professionals.” 

15 The sensitivity of s to small differences in the number of unit 
groups assigned to each higher level group is greater at lower levels 
of aggregation, and these differences may not accurately reflect 
differences in skill specificity. This source of error ms minimized at the 
highest level of aggregation. Yet, the greater variance of the measure 
at lower levels of aggregation helps reduce the standard error on the 
estimated parameter for the skill variable. 

16 An absolute measure of s generates results that are downward 
biased. At the limit, if the (unknown) correlation between s and g is 
1, s will have no effect on preferences. It is therefore important to 
develop relative measures. 
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TABLE 1. 
Variable 











(share of labor forces) divided by 
ISCO level of skills" 


(Share of ISCO-88 level 4 groups} 
(share of labor force) divided by 
level of general education" 


rker’s skills. At the heart of the concept of job 
city is the idea that outside options are more 
ited for workers with specific skills than for workers 
with general skills. 

The 1997 ISSP survey contains a question that 
isely taps the assessment of outside options: “If 
u were looking actively, how easy or difficult do you 
it would be for you to find an acceptable job?” 
e response choices were: “very easy,” “fairly easy,” 
neither easy nor difficult,” “fairly difficult,” or “very 
difficult.” The difficulty of finding an acceptable job is 
kely to be related to skill portability. High specificity 
means that relatively few jobs use those skills, and the 
umber of openings is also likely to be small because 
asset-specific investments by employers and employees 
end to lengthen tenure and limit turnover. In addition, 
the probability of finding an appropriate job close to 
home, which is also a likely component of what people 
consider “acceptable,” depends on the number of 
qpenings in a given geographical area.17 All else equal, 
he item about finding an acceptable job is thus likely 



















Ebout individual work histories and employment con- 
ditions in particular labor market niches, the question 
is about as good a measure of rq as is possible. We refer 
tp it as Skill Specificity 3 or s3. 

There is an ambiguity in the relationship of s, to the 
neoretical concept of s. We cannot know for certain 
whether survey responses reflect a person’s absolute 
el of specific skills or the relative share of his skills 
hat are specific. To make sure that the skill measure is 
a| relative measure, as required by the theoretical 
model, we can divide s, by g. We call this alternative 
measure Skill Specificity 4 or s4. If s3 is already a relative 


a er ee rales Scheve and Slangntes (2001) argue, 
show empirically, that home ownership can be treated as a 


Summary of Independent Skill Variables 
Intercorrelations 


r Response to question about difficulty .38 .53 — 
of finding an acceptable job 

s,° 8, divided by level of general 
education 


“Shares are oaloulated at both the first and second ISCO-88 level and than 
"Before oaiculating the Intercorrelationa, the number of ontagones for & and a, were reduced to the same number as for $ and 8o 
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Not clear whether this is a measure 
of absolute or relative share of 
skills 


Assumes that s, measures 
absolute skills (although s, will 
always be a relative measure) 












measure, we simply get another relative measure that 
should also be positively related to preferences for 
social spending. 

The different skill measures and their intercorrela- 
tions are listed in Table 1. Not surprisingly, the corre- 
lations are higher between measures that use either the 
survey question or the ISCO classification. The lowest 
correlations are between s, and either s, or s2. To some 
extent this may reflect that s, is an absolute rather than 
a relative measure, but the main reason is simply that s, 
is influenced by a number of factors (such as how much 
people like their current coworkers) that are unrelated 
to either skills or policy preferences. These 
factors will wash out in the regression, but they reduce 
the correlation with the other measures. To facilitate 
comparison of the effects of the different variables in 
the subsequent regression analysis, all proxies for s 
have been divided by their standard deviation. 

One final methodological issue needs to be ad- 
dressed. Because the item used as the basis for s} and 
S4 was asked only in the 1997 survey, whereas all the 
questions about spending were asked only in the 1996 
survey, it was necessary to “translate” the 1997 infor- 
mation on s3 $o it could be used in the 1996 survey (s, 
can always be calculated from s3). For this purpose we 
calculated averages for s} at the three-digit ISCO-88 
level in the 1997 survey and then assigned these values 
to individuals in the 1996 survey based on their three- 
digit ISCO classification in that survey.!8 Since the 
Classification of occupations is based on the skills 
required, it is reasonable to expect that the original 
information about s is preserved to a considerable 
extent in this translation. Moreover, because the 1996 
and 1997 samples are drawn from the same popula- 

tions, we show in Appendix D that s, averaged by 


18 There are 116 unique groups at the three-digit level. The more 
fine-grained four-digit level is not available for some countries and 
sometimes contains a large number of empty categories. 

19 With the minor that people who turned 18 in the 
interim were not eligible for the 1996 survey. 
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ISCO three-digit groups, is an unbiased estimator for 
the original variable. 

In addition to the skill variables (s;-s,), we used 
self-reported pretax and pretransfer income as a proxy 
for y (converted into dollars at 1996 exchange rates and 
square root transformed), as well as the following set of 
controls. 


Age. Older workers are likely to be more concerned 
with job security and income than are younger workers 
because their time to retirement is shorter, and their 
ability to find new employment is likely to be more 
limited. 

Gender. As argued by Orloff (1993) and Estevez-Abe, 
Iversen, and Soskice (2001), women may demand more 
protection than men in comparable jobs because they 
need to not only leave but also return to the labor 
market for the purpose of child rearing. 


Union Membership. A main function of unions is to 
insure members against labor market risks, so it is 
reasonable to expect that union members are particu- 
larly concerned with social protection (see, e.g., Korpi 
1989). 


Part-Time Employment. On the one hand, part-time 
employees are often in vulnerable labor market posi- 
tions, and this may cause particular concern for job 
security and income protection. On the other hand, 
they depend more on flexible labor markets to generate 
nonstandard jobs, which suggests a countervailing ef- 
fect. 


Nonemployed. Esping-Andersen (1999) argues that 
some outsiders may share an interest in social and 
economic policies that maximize their ability to enter 
employment, but this is an extremely heterogencous 
group that may not have common policy preferences. 
We need to include the variable to control for the 
possibility that the nonemployed have very different 
attitudes from the employed. 


Unemployed. The obvious expectation is that the un- 
employed, relying as they do on transfers, will support 
a high level of income protection. 


Self-Employment. The self-employed are expected to 
favor free markets and a low level of social protection 
because they depend on flexible labor markets and 
often on relatively low-paid workers. 


Information. It is conceivable that better information 
about the economy yields particular views on the 
desirability of social spending. There was an intense 
public debate about the proper role of the state in the 
1990s, and it can be argued that better informed people 
may have adopted the predominant view, which tended 
to'see budget cuts as necessary on efficiency grounds 
(corresponding to a higher cost of distortionary taxa- 
tion in our model). Information is measured by 
respondents’ subjective understanding of politics (see 
Appendix C for details). 


2 This idea was suggested by an anonymous reviewer. 
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Left-Right Position. Attitudes toward social protection 
may partly reflect ideological predisposition or perhaps 
the socializing effects of political parties.21 We control 
for this possibility by including positions on a Left- 
Right scale based on the respondent's declared party 
support (see Appendix C for details). 


National Unemployment. Although our theory implies 
that people discount cyclical unemployment, it still may 
affect individual-level social preferences. Testing this 
requires a multilevel modeling procedure, with coun- 
tries as level 1 and individuals as level 2. Collapsing 
both levels into a single equation (as shown in Appen- 
dix E) implies the inclusion of the product variables 
U,: y, and U, +s in the regression model, where U; is 
the rate of unemployment in country j (see Appendix C 
for details on measurement). 


Findings 


We estimated the regression model in equation 15 on 
all countries (technically speaking, as a single-stage 
multilevel procedure to incorporate the possible effect 
of national macroeconomic conditions).” To cope with 
problems of missing observations we used a multiple 
imputation technique developed by Honaker and otb- 
ers (1999). This strategy is superior to the traditional 
approach of listwise deletion, which is both inefficient 
and potentially biased (King et al. 2001).% The follow- 
ing presentation is divided into a section on the key 
results and a section on their robustness, which also 
discusses potential objections to the way we interpret 
the results. 


The Basic Results. To give a sense of the central 
tendency of the estimates, Table 2 shows the results 
from a pooled analysis, including a full set of country 
dummies. Because the Italian survey was conducted in 
1990 and lacks information on several of the control 
variables, it was not included in the calculation. In the 
next section we show that the results for Italy are 
consistent with those presented in Table 2. 

The model in column 1 uses the average of the 
four measures of skills, Skill Specificity Composite or 
s as a summary variable for skill composition. 
The next four columns show the results for each of the 
component measures (s,-s,). Model 6 is identical to 1 
except that the regression now includes union mem- 
bership as an independent variable. Since union mem- 

ip was not recorded in Australia, the estimation 
of model 6 excludes this country. 

In interpreting the results, first note that the param- 
eters for income, y, and the four measures of skill, s4-54, 
are in the predicted direction and highly statistically 
significant. The negative effect of income implies that 


21 This was suggested by an anonymous reviewer. We note that party 
support may m part be to skills. If so, the effect of skills 
will be underestimated by the for s. 

22 All data analysis was done using Stata 6.0 for Windows. 

D In practice, however, our results are very similar to those obtained 
by using listwise deletion. The effects of our theoretical variables 
tend to be slightly stronger when we use that method, but the 
standard errors are also larger. 
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(0.0002) (0.0002) 
Sccniposite 0.233" — 
(0.014) 
Skill Spectfictty 1 (s4) — 0.148 
(0.010) 
Skil! Spectfictty 2 (s,) — — 
Skill Spectficity 3 (sq) — = 
Skill Spectfictty 4 (s4) — = 
Age 0.0029" 0.0043** 
(0.0006) (0.0006) 
Gender (female) 0.215™ 0.208" 
(0.018) (0.018) 
Union membership — = 
Part-time employment —0.029 —0.041 
(0.028) (0.028) 
Unemployed 0.293" 0.313 
(0.041) (0.041) 
Nonemployed —0.079™ —0.081* 
(0.025) (0.025) 
Self-employed —0.232" —0.235™ 
(0.029) (0.028) 
Informed —0.041* —0.045" 
(0.008) (0.008) 
Left-Right party support —0.050™ —0.051™ 
(0.004) (0.004) 
UV, —0.0002" —0.0002** 
(0.0001) (0.0001) 
U,’ S, —0.008 —0.004 
(0.005) (0.004) 
Adj. R? 0.21 0.20 
N 14,101 14,101 


“Excludes AustraSe, ireland, and ttaly, for which data are not avaiable. 
?Exciudes Australia, for whioh umon membership data are not available. 


risk aversion is not sufficiently high to make public 
demand for transfers rise with income. Technically 
speaking, RRA < 5g/(sg — w/2), which means that the 
Meltzer-Richard redistribution logic dominates the 
insurance logic. As expected, the relationship is little 
affected by differences in national unemployment rates, 
despite their considerable variation in the survey year. 
Thus, an increase of one standard deviation in unem- 
ployment would only change the parameter on y from 
.0033 to .0038. 

Yet, for our purposes the key finding is the positive 
effect of specific skills on preferences for spending 
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TABLE 2 Support for Social Spending among the Publics of Ten Countries, 1996 
(1) 2) (3) (4 (67 (oP 
Income —0.0033** —0.0036™ —0.0038** —0.0044* —0.0035** —0.0036"* 


Note. All regressions include a full set of country dummies (not shown). Standard errors are m parentheses. “p = 06, “p = .01. 
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(0.0002) (0.0002) (0.0002) (0.0002) 
— — ` — 0.219* 
(0.013) 
0.150% = = = 
(0.010) 
= 0.105™ a = 
(0.013) 
= = 0.218" = 
(0.014) 
0.0034" 0.0042" 0.0018" 0.0027“ 
(0.0005) (0.0006) (0.0006) (0.0006) 
0.205" 0.124" 0.148“ 0.198" 
(0.018) (0.019) (0.019) (0.019) 
= = = 0.185" 
(0.023) 
—0.033 —0.076* —0.058 —0.031 
(0.028) (0.031) (0.031) (0.029) 
0.311" 0.320" 0.309* 0.325" 
(0.042) (0.047) (0.046) (0.043) 
—0.086* —0.080" ~0.074"* —0.038 
(0.025) (0.026) (0.026) (0.026) 
~0.250"" —0.222* ~0.201"" —0.184* 
(0.028) (0.028) (0.029) (0.027) 
~0.047" ~0.069"* ~0.050"* —0.043* 
(0.008) (0.010) (0.010) (0.009) 
—0.050™ —0.047™ ~0.047"* ~0.041"" 
(0.004) (0.005) (0.005) (0.005) 
—0.0002*  —0.0003*  —0.0004  —0.0003™ 
(0.0001) (0.0001) (0.0001) (0.0001) 
—0.002 —0.008 —0.012* —0.012* 
(0.004) (0.005) (0.005) (0.004) 
0.20 0.18 0.20 0.22 
14,101 10,956 10,958 11,950 







(which implies that RRA > 0). Each of the four 
(standardized) skill variables is associated with signifi- 
cantly higher support for spending, and three of the 
four measures exhibit similar magnitudes of effects. 
Again, these relationships hold for all levels of unem- 
ployment, as can be seen from the negligible parame- 
ters for U;+s,.4 The parameter for s, is lower than for 
the other measures, which is not entirely unexpected, 
because this variable may capture absolute rather than 


™ For example, an increase of one standard deviation in U, only 
reduces the effect Of Ssompomts from .23 to .22. 
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income + Soampostte 


26 
Age 1 
Gender (female) 6 
Union membership 1 
Part-time employment 0° 
Unemployed 3 
Nonemployed 0 
Self-employed 2 
Informed 1 
Left-Right party support 5 
Us Vy 1 
U’ S, 0 

38 

27 


All controls combined 


relative endowment of specific skills (or a combination 
of both). For all correlations between s and g that are 
greater than —1, absolute measures of s will yield lower 
parameter estimates than relative measures.» 

Considering the very different approaches to mea- 
suring skills, it is reassuring that the results are consis- 
tent across definitions. Yet, statistically significant ef- 
fects do not necessarily imply large substantive effects. 
In Table 3 we estimate the portion of the explained 
variance for each independent variable as well as the 
effect on preferences of a change in each variable of 
one standard deviation. The estimates are based on the 
results of model 6 in Table 2, which includes all the 
relevant variables. 

Although we cannot precisely attribute the propor- 
tion of explained variance to each independent vari- 
able, we can calculate the likely ranges. The upper 
bound is found by recording the increase in explained 
variance (measured as a percentage of the total) when 
a variable is included as the first predictor (apart from 
the country dummies). This number encompasses ev- 
ery direct, indirect, and spurious effect of the variable. 
The lower bound is calculated as the increase in 
explained variance (as a percentage of the total) when 
a variable is entered as the last predictor. This proce- 


25 In fact, the correlation between s, and a measure of g based on 
general education is close to 0 in our data, which implies an 
estimated effect of s, is half the “true” effect of skills. 


TABLE 3. Estimated Magnitude of the Effects of Independent Variables 


Proportion of Explalned Varlance* Effect of One SD Change? 
; 95% Confidence Interval 
Lower Bound? Upper Bound? Upper Bound Lower Bound 


PT GOTT CUI a NN 
“increase in explained vanance by each variable #3 a proporbon of the total explained variance of ail (nondummy) variables (based on model 6 m Table 
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—0.19 
0.22 


51 —0.22 
0.19 





2 0.03 0.05 
17 0.08 0.11 
4 0.07 0.08 
0 —0.02 —0.00 
9 0.06 0.08 
8 —0.03 —0.01 
8 —0.07 —0.05 
8 —0.05 0.04 
5 —0.09 —0.07 
13 
8 
73 0.38 0.44 
52 0.36 0.47 


dure eliminates all hypothesized individual-level spuri- 
ous effects of the variable, but it also discounts all 
possible indirect effects. The true explanatory power of 
any variable is likely to be somewhere in between these 
bounds. 

Using this method, Table 3 shows that income and 
skills are unambiguously the most important factors in 
explaining social policy preferences among the vari- 
ables included in this analysis. Income accounts for 
between 11% and 51% of the total explained variance, 
and skills account for between 26% and 38%. Jointly, 
they capture between 38% and 73%, with the rest 
accounted for by the controls. 

The key role of income and skills is confirmed when 
we consider the effect of a change of one standard 
deviation in these variables (column 3). Such a change 
in either variable is associated with about 20% of a 
standard deviation change in preferences (since the 
dependent variable is standardized, the recorded ef- 
fects can be interpreted directly in terms of standard 
deviations). Together, the influence of income and 
skills is as great as the joint effects of a standard 
deviation change in all controls simultaneously. Note 
also that the effects of both variables are estimated very 

precisely, varying in a narrow range between (—)0.19 
aa (- 0.22 (95% confidence interval). 

The results for the controls also generally confirm 
our expectations. Individuals who are particularly ex- 
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posed to labor market risks—the unemployed, women, 
and older workers—are more favorably disposed to an 
increase in social spending than are others. The same is 
the case for union members, whereas the self-em- 
ployed are more likely to oppose social spending. 
Those who consider themselves well informed about 
politics are also likely to oppose spending, which may 
reflect a political environment at the time that was 
hostile to the welfare state. Supporters of rightist 
parties, not surprisingly, also express less support for 
social spending than supporters of leftist parties. Fi- 
nally, the attitudes of part-time employees and those 
outside the labor market are indistinct from the atti- 
tudes of others. These groups are evidently too heter- 
ogeneous to share any common interest in social 
policies. 

Gender stands out among the control variables, 
accounting for between 8% and 17% of the total 
explained variance, and it has the greatest effect among 
the controls. As argued by Estevez-Abe, Iversen, and 
Soskice (2001), women require more protection than 
men in comparable jobs because of maternity leave, 
but almost half of this effect disappears if the skill 
variable is removed from the equation. The reason, we 
believe, is closely related to our theoretical argument. 
Because women know they are likely to leave their job 
before they can reap the full returns on specific skill 
investments, they are dissuaded from making such 
investments (Estevez-Abe, Iversen, and Soskice 2001). 
This shows up in our data as a negative effect of 
(female) gender on s. Thus, our skill specificity variable 
is 0.27 standard deviations lower for women than for 
men (f = 14). In other words, when women invest in 
specific skills they are more likely than men to support 
a high level of social protection, but they are somewhat 
less likely to invest in these skills in the first place. 


Robustness Tests. In this section we test the robustness 
of the results and address some potential objections to 
our interpretation. We first note that the findings for y 
and s stand up to any combination of the controls we 
included, and they are robust to the inclusion of any 
other variable used in the survey (region, public sector 
employment, urbanization, and supervisory position) in 
any combination. Although income and skills are 
powerful explanatory variables in the pooled analysis, 
pooling can disguise considerable cross-national varia- 
tion in the strength of the results, and estimated 
parameters may even reverse in particular cases. In 
eddition, pooling usually yields exaggerated t-scores 
compared to those found for individual countries.?’ 
Therefore, we ran our regressions on each of the 
countries individually. The results for the theoretical 
variables are shown in Table 4. 





2 None of these was used in every survey, so rather than clutter the 
presentation with several addrhonal columns, we omitted them from 
the mam analysis. 

7 The reason is that the standard error has the form (s.e. of equation 
error)/(s.c. of variable). Since the denominator 1s the square root of 
the sum of squares of the explanatory variable divided by N, this 
normally increases with N, because a squared term is added on the 
tap, and 1 is added to the bottom (although it does not have to be so). 
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Note that every regression yields results that are 
consistent with the pooled analysis; each of the 60 
parameters records the correct sign, and most are 
significant at the .01 level or better. The composite skill 
variable is always significant at .01 or better, and for 
nine of the countries the parameter estimates for s vary 
in a fairly narrow range between 0.16 and 0.29 (the 
parameter in the pooled analysis is .23). Only Ireland 
and Italy fall slightly out of the pattern with parameters 
just below .12. Yet, the effects for these countries are 
still statistically highly significant, and it should be 
noted that 5.:npomte in both cases is based on only two 
proxies for s. In the case of Italy, these proxies also use 
a crude occupational variable that maps rather poorly 
onto ISCO-88, which potentially dilutes the skill dis- 
tinctions among categories. 

As in the pooled analysis, we also note that the 
results for s are somewhat weaker across all cases than 
for the other skill measures, but only in one instance 
(the United States) is there a statistically insignificant 
result. Given the variety of countries and the differ- 
ences in measurements, the combination of results is 
clear support for our theory. 

Another objection that can be raised to our findings 
for skills is that they may capture an ideological 
aversion to government spending among those with 
higher education. Two measures of s have formal 
education in the denominator, and the other two 
implicitly assume it. In quantitative terms, general 
education accounts for roughly one-third of the vari- 
ance in s . It is therefore conceivable that the 
proxies for skills may partly capture an ideological 
effect of higher education. For example, much of the 
economic theory taught to university students during 
the 1990s emphasized the efficiency of free markets 
over state intervention. 

To some extent we contro! for this possibility 
through the variables for self-assessed level of informa- 
tion and party support. If the highly educated consider 
themselves better informed about the costs of generous 
social spending, this is likely to show up in the infor- 
mation variable. Likewise, those who are ideologically 
committed to a small welfare state are presumably 
more likely to support rightist parties. The fact that a 
large effect of skills persists after we control for these 
variables suggests that our conception of skills as assets 
is correct. 

There still may be unmeasured aspects of formal 
education that somehow confound the effects of our 
skill variable. One way to address this issue is simply to 
include general education as a separate variable, which 
means that s wili only pick up the effects of 
specific skills. In this setup we would expect formal 
education to have the opposite effect of the specific 
skills variable, and the separating out of general skills 
will necessarily weaken the effect of the original vari- 
able if general education is indeed a measure of 
general skills. We can be certain, however, that what- 


In terms of the formal model, this can be captured by different 
assessments of the distortionary effects of taxation. 
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TABLE 4. 


0.156" 
(0.027) 
0.219" 
(0.042) 
0.219" 
(0.053) 
0.235" 
(0.038) 
0.255" 
(0.030) 
0.116 
(0.028) 
0.105" 
(0.037) 
0.257" 
(0.033) 
0.176" 
(0.038) 
0.282** 
(0.039) 
0.294" 
(0.052) 


New Zealand 


Sweden 


United States 


0.129" 
(0.023) 
0.105" 
(0.026) 
0.102* 
(0.042) 
0.158" 
(0.035) 
0.182“ 
(0.028) 
0.092" 
(0.029) 
0.104* 
(0.040) 
0.130" 
(0.024) 
0.125“ 
(0.030) 
0.130“ 
(0.030) 


0.192" 
(0.030) 
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0.127" 
(0.026) 
0.121 
(0.027) 
0.140" 
(0.045) 
0.147" 
(0.028) 
0.155" 
(0.023) 
0.122" 
(0.027) 
0.095** 
(0.034) 
0.165" 
(0.027) 
0.094" 
(0.030) 
0.140" 
(0.028) 
0.189" 
(0.037) 


0.135" 
(0.039) 


0.087* 
(0.041) 


0.250" 
(0.030) 
0.131“ 
(0.034) 
0.278" 
(0.030) 


0.272 
(0.058) 


Note: All regressions inolude the same set of controls as In Table 2, column 1. Standard errors are n parentheses. “p = .05, “p = .01. 
“The effect of mcome s oniy shown fOr Soomposse Dut vanes bttie across the four measures of s. 
Parameter 


>Data are not available to estimate thes 


ever effect remains of s, it cannot be attributed to 
general education. 

The first column of Table 5 shows the results of 
reestimating model 1 in Table 2, using formal educa- 
tion as a separate independent variable. Formal edu- 
cation has a strong negative effect on support for social 
protection, which is consistent with the skill asset 
arpument. More important, the parameter on the 
specific skills variable remains positive and statistically 
significant. It is not surprising that the effect of s falls 
from 0.23 to 0.14, but this is still a very considerable 
influence. Even if we completely discount the effect of 
general education as a measure of general skills, there- 
fore, the results lend unambiguous support to our 

ent. 

Yet, it would be a mistake to treat general education 
as a proxy for unmeasured ideological effects, and we 
can support this claim with results for the postmateri- 
alist spending index. Surely, if highly educated individ- 
uals believe in the efficiency of free markets and the 
waste of government spending, then they also should 
oppose public spending on the environment, culture, 
and the arts.29 But the exact opposite is true as shown 


D “The environment” may be conceived as a collective good that 
mmproves overall welfare (it is a little harder to argue this with respect 
to fine arts subsidies), but by the same token social protection may be 
conceived as welfare-improving insurance. The point is not that the 
highly educated are more informed about what 1s “good” and “bad” 
spending (we already control for this); rather, they may have 





in column 2 of Table 5. People with high general 
education are much more likely to support government 
spending in these areas than are others. Conversely, if 
we use Our composite measure of specific skills (col- 
umn 3), the effect of skills is reversed: Specific skill 
workers want less postmaterialist spending, even 
though they support more social spending. Evidently, 
people prefer government spending in areas that are 
particularly conducive to their personal welfare. Work- 
ers with general skills demand little social protection 
but are enthusiastic consumers of a clean environment 
and state-subsidized culture. Workers with specific 
skills are deeply concerned about social protection but 
not enthusiastic about state subsidization for the envi- 
ronment and the arts. There is no blanket support for, 
or opposition to, government spending among any 
particular group of workers. 


CONCLUSION 


It is well known that a substantial portion of both 
national and personal income can be attributed to 
human capital, broadly conceived. It is therefore not 
surprising that the asset specificity of this capital mat- 
ters a great deal for the amount of social insurance 
demanded by individual workers. Like physical capital, 


internalized a general aversion to government spending through 
their educational experience. 
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TABLE 5. Formal Education and Support for Two Types of Spending in Ten Countries, 1996 
Support for Soclal Spending Support for Postmaterlalist Spending 






















Formal education —0.105** (0.008) 0.130** (0.007) — 

S orias 0.143™ (0.015) — —0.097** (0.014) 
Income ' —0.0027** (0.0002) —0.0004 (0.0002) 0.0004 (0.0002) 
Age 0.0015* (0.0006) —0.0064** (0.0006) —0.009** (0.001) 
Gender (female) 0.203** (0.018) 0.082** (0.018) 0.087** (0.019) 
Part-time employment —0.025 (0.028) 0.104** (0.029) 0.112** (0.030) 
Unemployed 0.303** (0.040) 0.085* (0.043) 0.089* (0.044) 
Nonemployed —0.067** (0.025) 0.077** (0.024) 0.083** (0.025) 
Self-employed —0.243* (0.028) 0.021 (0.025) 0.011 (0.026) 
Informed —0.031** (0.008) 0.069** (0.009) —0.083** (0.009) 
Left-Right party support —0.049** (0.004) —0.060" (0.005) —0.059" (0.005) 
U Vy —0.0002 (0.0001) 0.0000 (0.0001) 0.0001 (0.0001) 
U: 8, —0.006 (0.005) 0.008 (0.005) 0.007 (0.006) 
Ad}. R? 0.22 0.09 0.07 

N 14,101 14,101 14,101 










Note Regressions include a full set of country dummies. Standard errors are in parentheses. *p = .06, “p = 01. 








FIGURE 3. Vocational Training Intensity and Government Transfers In Twenty Countries 
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human capital can be more or less mobile, and workers 
who have made heavy investments in asset-specific 
skills stand a greater risk of losing a substantial portion 
of their income than do workers who have portable 
skills. For this reason, workers with specific skills have 
a strong incentive to support policies and institutions 
that protect their jobs and income. 

Because social protection tends to benefit low-in- 
come people more than high-income people, position 
in the income distribution also divides public opinion. 
At any given level of income, however, workers with 
specific skills are more inclined to support a high level 
of protection than are those with general skills. This 
tendency can help us understand cross-national vari- 
ance in social protection, because the profile of skills is 
likely to vary in accord with the structure of a nation’s 
educational system. Some countries provide much 
more training than others in vocational skills that are 
specific to particular jobs, firms, or industries. In 
Germany, for instance, more than one-third of any age 
cohort goes through extensive vocational training (3-5 
years), whereas in the United States the comparable 
figure is only a few percentage points (even counting 
those in the junior college system). If these differences 
are reflected in political preferences, either through 
the electoral system or the system of interest represen- 
tation, then a new explanation of the welfare state 
based on differences in national skill profiles may be 
appropriate. 

The empirical plausibility of such an explanation is 
suggested by Figure 3, which uses the vocationally 
trained share of an age cohort:as an indicator for the 
composition of labor force skills, and government 
transfers as a share of GDP as the proxy for the income 
transfer variable (R). As expected, there is a strong 
positive association between the two (r = .82), al- 
though future research is needed to eliminate the 
effects of potentially confounding variables. 

Cross-national differences in skill composition also 
may explain why income equality is linked to higher 
social spending in cross-national comparisons. Since 
the extent of vocational training is strongly positively 
related to pretax income equality—the correlation 
coefficient is .73 using the earnings of a worker in the 
bottom decile of the earnings distribution divided by 
the earnings of a worker in the top decile of the 
earnings distribution (d9/d1 ratios) as a measure of 
equality (OECD n.d.}—income equality and social 
spending tend to go hand in hand across countries. In 
the pure Meltzer-Richard model this is ruled out 
because the pressure for redistribution is always great- 
est in countries with the most skewed mcome distribu- 
tion. 

Finally, our model points to an important source of 
cross-time variance in support for social protection: 
unanticipated shocks to the occupational structure. 
When workers invest heavily in skills that are not fully 
transferable, an increase in the risk of havmg to move 
across a skill boundary in the economy raises the level 
of demand for social insurance. This helps explain why 
the dramatic decline of industrial employment in many 
countries over the past three decades is a very good 
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predictor of welfare state expansion (Iversen and Cu- 
sack 2000). More generally, changes over time in the 


exposure to risks, in the training system, and in the 
international division of labor all affect the political 


demand for social protection. Modeling these dynam- 
ics and testing them empirically are important tasks for 
future research. 


APPENDIX A: MATHEMATICAL PROOFS 


Derivation of Equations 12 and 13 

The choice of the optimal R requires that: 
Vr = 08 -u’(®) * 2g/w = y'u’ (R). 

Totally differentiating both sides we get: 


2 

rR ew @u'@l 

dg FON. a. O 

E p (2) Otrum 

Since the denominator is negative, 

dR os 2 

ke O if [Bu G) +u'®]<0. 
which implies 

_,_ _ gu") 
RRA(g) = TE >1, 

where RRA(x) is the Arrow-Pratt definition of relative risk 


aversion defined at c = x. The inequality conditions specified 
in equations 12 and 13 follow directly. 


Proof for Result 1 


Note first that t = 1 maximizes t/(1 + t) when 0 =t s 1. 
Also, iff = 1, R = w/2. 
From equation 6 the necessary condition for optimal R is 


aug (1-2) +p-w@-(1- 3) +y-v@eo. 


(A-1) 
If R = w/2, sg = g = R; hence the maximum combination 
syd cm. assuming it exists, requires that 


this condition hold with equality and that u'(sg) = u ) = u'G) = 
u'(R). Sa aaa Set Bg = 


@tptyyay=5. 


Proof for Results 2 and 3 of the Asset 
Model 


The necessary condition for optimal choice ee 5, 
g) = 0. This is given by A-1. Totally differentiating Vp gives: 


a fw (2-1) (1-F) ets =| +as 
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2 2 
-1) +B wE Z1) + yeh a 
(A-2) 
Note that the term in braces on the right-hand side, which we 
will call B, is negative. Also, we can write (5g — w/2)+ (1 — 
2R/w) = sg — w/2. Furthermore, 
[u" (s8) « (SR — W/2) + u'(3g)] 


a5 pee! gee gate WE 
= u' (sg) £ RRA T 


far: A 





| = u’ (sg) - L(38), 
(A-3) 
so equation A-2 can be written: 
u' (38) L8) ' a'g: ds + u' (5g) L(38)' a-s dg 
+ u’ (8) : LẸ): Bdg = (w/2) -B dR. (A-4) 


Since dy=a g ' ds +a ' s - dg+Ļß - dg, we can further rewrite 
equation A-4 as 


_ 2a-B-g [u (sg) L(g) — u'&)-L@) 
aR = wB | as+B -as 
2 [u’ (sg) - L8) as +u’) LE- B 
+= | as +B |- 
(A-5) 


To prove results 3 and 4, note that in terms of equation A-5 
dR/ay = dR/dy and aR/as = dR/ds. We show first that L (sg) 
< L(®). From the definition in equation A-3, this follows if 
s > 1—s is the case apart from purely general skills—and if 
RRA > 0. Result 3 is that sgn aR/ay < 0 if RRA < 58/(3g — 
w/2). Since B < 0, L(sg) < L(g), and u’(x) > 0, this 
follows from equation A-5 if L(sg) > 0. This requires that 
RRA < sg/(sg — w/2). This is a sufficient condition; a 
necessary and sufficient condition is that the numerator of the 
second term in square brackets on the right-hand side of 
equation A-5 is 

Result 4 is that sgn aR/as > 0. Since B < 0, this requires 
that the numerator in the first square bracket on the right- 
hand side of equation A-5 is negative. Since u’ (sg) < u’ (8) 
from marginal utility, a sufficient condition is that 
L(35g) < L(g), which is true so long as RRA > 0 ands > 1. 
So result 4 follows from the existence of risk aversion and 
specific skills. 


APPENDIX B: DERIVING THE ESTIMATING 
EQUATION 


We show that the estimating equation we use, 


R=k+b-:y+c's, (B-1) 

is equal to 
Rate fel 2 
= ay ap N (B-2) 


where equation B-2 is a first-order Taylor expansion of Vp(R, 
5, 8 = O andy = a-s-g+B-g evaluated around (R, 5, g) 
R, 5,8) = £. 


Proof: The first-order Taylor expansion of Vg is given by: 


Vrs Frs 
Rs Beye vee : l (B-3) 
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In terms of equation A-5: - s 


Vua) uE) L) ag 
V WB 





(B-4a) 
and 


Vr) _ 


Vre) 


cine -B 
(B-4b) 
The first-order Taylor expansion of y is: 
y=k@)+[a-F+ B]-gt[a-gl-s. 
Rewrite equation B-5: 
„J KE) -oag 
ge ET: 


and substitute into equation B-3, using equations B-4a and 
4b. This yields equation B-2. 


(B-5) 


APPENDIX C: DETAILED INFORMATION 
ABOUT VARIABLES 


Dependent variables 


The spending variable, R, is measured by four issue items in 
the ISSP surveys. The first three are based on the following 
question: “Listed below are various areas of government 
spending. Please show whether you would like to see more or 
less government spending in each area. Remember that if you 
sae ening it might require a tax increase to pay for it.” 
t is then presented with the different spending 
areas (unemployment, health, retirement) and the folowing 
range of possible responses: (1) spend much more; (2) spend 
more; (3) spend the same as now; Oe spend less; (5) spend 
much less; (8) can’t choose, don’t kn 
The fourth varisble i based Ga tig Gal question: 
“Here are some things the government might do for the 
economy. Please show which actions you are in favor of and 
which you are against. Please tick one box in each line.” One 
of the actions is: “Support for industries to protect 
jobs.” The options are: (1) strongly in favor of; (2) in favor of; 
(3) neither in favor of nor against; (4) against; oe strongly 
against; (8) can’t choose, don’t know, (9) NA, refused. 


Independent Variables 
s, and s, In some countries an earlier version (ISCO-68) 
of occupational classifications was used, but these can be 
translated into ISCO-88 with considerable i 
using a coding scheme developed by Harry Ganzeboom at 
Utrecht University (see Ganzeboom and Treiman 1996 and 
http://www.fss.uu.nl/soc/hg/ismf for details). The Swedish oc- 
cupational classification is based on an’ amended version of 
an older edition of ISCO. Statistiska Centralbyrån (Statistics 
Sweden) provided us with a conversion table to translate 
these codes into ISCO-88 in a reasonably consistent manner. 
Britain uses its own national classification system, but it is 
closely related to ISCO-88 and likewise uses skills as the 
basis. We received the British translation codes from UK 
National Statistics. The only problematic case is Italy, where 
the few broad categories used in the 1996 ISSP survey are 
completely unrelated to the ISCO-88 categories. We went 
back to an earlier 1990 ISSP study (ISSP 1993), which 
contains a somewhat more detailed occupational variable for 
Italy. Using this variable in conjunction with information on 
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Party Classification for Ten Countries 


Soziale Union. Party affiliation ls not available for Italy. 


education levels enabled us to map the. Italian codes to the 
one-digit ISCO-88 level in a fairly consistent manner. Yet, 
because of the lack of direct correspondence, the results for 
Italy must be viewed with caution. 


General Skills (used in the denominator of s, and s,). The 
variable is used as a proxy for g and has five levels: (1) 
completed primary degree or lower; (2) incomplete second- 
ary; (3) completed secondary; (4) incomplete and completed 

er degree, or incomplete university degree; and (5) 


completed university degree (in some countries a distinction 
is made between incomplete and complete primary educa- 


tion, but we do not use this distinction). Alternatively we 
could have used years of formal schooling as a measure of g, 
but the results are very similar. 


Information. An item asked the degree of agreement with the 
following statement: “I feel that I have a pretty good under- 
standing of the important political issues facing our country.” 
The options were: (1) strongly agree; eee agres; (3) neither 
agree nor disagree; (4) disagree; (5) strongly disagree. The 
coding was reversed so that higher values represent more 
information. 


Left-Right Position. This variable is based on the classification 
of parties from Left to Right developed by the International 
Social Survey Program to facilitate comparison of party 
support across countries. Parties are classified as shown in 
Table C-1 (data are not available for Italy). 


National Unemployment. The standardized rate of unemploy- 
ment at the time of the national surveys (1996 unless noted 
otherwise) minus the overall OECD rate of unemployment at 
that time (the subtraction eliminates problems of multicolin- 
earity while leaving the substantive results unaltered). Source: 
OECD 2000. Unemployment rates were: Australia, 8.5; 
Britain, 8.2; Canada, 9.6; France (1997), 12.3; Germany, 8.9; 





Ireland, 11.6; Italy, 11.7; Norway, 4.9; New Zealand (1997), 
6.7; Sweden, 9.6; United States, 5.4. 


APPENDIX D: STATISTICAL APPENDIX 


A problem arises in our use of s, and s, as explanatory 
varibles. use it is the same in both cases, we will simply 
refer tos.) The question used as the basis for s was asked only 
in the 1997 survey, whereas all the questions about spending 
were asked only in the 1996 survey, so it was necessary to 
“translate” the 1997 information on s for use in the 1996 
fo an ISCO a ee a the 1007 eae can ae 
level in the 1997 survey and then as- 
see les Oal ale 05> ones based on 
their three-digit ISCO classification in that survey. We show 
here that the estimated coefficient of b is consistent but has 
i small e bias that biases the estimated 
ient downward toward zero if b > 0 and upward 
toward zero if b < 0. 
The structural model is 


Pek Pb yy te = sy Fe, (D-1) 


where each observation is drawn from the 1996 survey, and 
where i indexes the ith individual in the jth ISCO three-digt 
occupation group. We do not have data ons™, , Assume 

is generated by the process 


sh =s +n”, 


where s, is exogenous. Although s, itself is unobservable, we 
have data from the 1997 survey generated by the same 


process: 


(D-2) 


s+; 
En = 0 VLj,x; 
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a =O0=Ene Yir,j; 
=02; Eè =h (D-3) 


that is 73° and ng’ Ane nye erm si 
from the same distribution. We now run the Reoreenon 


RS =k + by% + c3 +e% + um, 
where 
DEA 


N, 





5 m 
and 
vs m c[s® — 3], (D-4) 


and N, is the number of individuals in ISCO category j in the 
1997 survey. From equations D-3 and DA, 


Pee i 
EWTN Ga 


We can simplify the exposition considerably by assuming no 
correlation between 5 and y. This implies: 
Eb =a, 


2 5,7 
Et=c ME ya — |. 
5) 


i 


If we make the appropriate probability limit assumptions, it is 
not difficult to show that ĉ is a consistent estimator of c. We 
can get a better insight from the expectation of the exact 
first-order Taylor expansion of 


Dn 
Po 


around the expected values of numerator and denominator. 


(D-6) 


espe ae apr ONAL 
EX in 


Let there be J ISCO categories, and assume for convenience 
that N, = N V j. This approximation produces: 


ot 
Eé=c Vr . (D-7) 
ot +N, 2 : 


J 


Since J is constant, equation D-7 tells us first that as N 
increases the approximate bias goes to zero. More important, 
it implies that for a small sample 


c<E@<0 if ¢<0; {D-8} 

c>Eé>0 if c>0. 
Finally, it implies that 

if c=0>&8é@=0, (D-9) 


and the standard significance tests hold. 
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APPENDIX E: MULTILEVEL MODEL 


Write the level 2 observation on individual / in economy) as 
R, = y+ WwW, + ws, + X8 + e, where X, is the vector 
of controls, and define level 4 by the fixed effects model n = 
7,U, and p = 2. + hU, The single-stage regression model is 
derived by substituting the a 1 model into level 2: R, 
yt ny, + NYU, + bs, + wy U, + X’ B+ e. If we make 
the assumption that all U ae o d tei 
nonrandom effects model, then the implication is that the 
single-stage multilevel equation conforms to the standard 
ordinary least-squares conditions. 
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survey. 


democratic transitions has been portrayed as a 

serious threat to incumbent governments that 
could complicate the consolidation of political reforms 
(Diamond 1996; Diamond and Linz 1989; Finifter and 
Mickiewicz 1992; d and Kaufman 1995; Linz 
1978; Przeworski 1991, 1995). For the most part these 
arguments are founded on a sanctioning model of 
economic voting (Downs 1957; Fiorina 1978, 1981): 
Citizens respond to economic dislocation in a fairly 
homogeneous fashion by punishing the incumbent 
government. Yet, evidence suggests considerable het- 
erogeneity in the political response of citizens to 
economic shocks. This essay proposes a developmental 
model of economic voting that accounts for this heter- 
ogeneity. It is hypothesized that as citizens become 
more informed about democratic processes they en- 
gage in higher levels of economic voting. Similarly, as 
they develop more trust in nascent democratic institu- 
tions, they are more likely to anticipate a responsive 
government and will be more likely to engage in 
economic voting. 

To test these propositions, I draw on a survey of 
Hungarian public opinion that I conducted in Decem- 
ber 1997. Hungary is an appropriate context because 
recent elections there in some sense typify the paradox 
of economic voting in postcommunist societies. The 
country experienced serious economic problems in 
1994, and the defeat of the incumbent government, led 
by the Hungarian Democratic Forum (Magyar Demok- 
vata F6rum—MDF) conforms nicely to the economic 
voting model. The 1998 elections conform less well to 
the standard model: The incumbent Hungarian Social- 
ist Party (Magyar Szocialista Part—MSZP) coalition 
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was defeated despite an improving economic situation. 
The Hungarian analysis was replicated in Poland em- 
ploying the Polish Public Opinion Research Center 


Election Study of June 1997. 


THE ECONOMIC VOTE IN CONSOLIDATING 
DEMOCRACY 


The standard economic voting model posits that citi- 
zens make electoral decisions based on their percep- 
tions of economic performance. Voters punish incum- 
bents who fail to deliver acceptable economic 
outcomes and reward governments that meet their 
expectations. This model assumes an agency relation- 
ship between voters and politicians. Voters are the 
principal and have the power to sanction or reward 
their agents. Politicians are the agent and are subject to 
voter oversight and sanctions. Elections are essentially 
a referendum on the economic performance of the 
incumbent government. Typically, support for the 
model rests on evidence of a correlation between 
economic outcomes, or perceptions of them, and vot- 
ing behavior; that is, positive outcomes lead to support 
for the incumbent, and negative outcomes favor the 
challenger.1 

For the most part, aggregate-level models of eco- 
nomic voting treat the voters as homogeneous in their 
reaction to economic performance (Goodhart and 
Bhansali 1970; Haller and Norpoth 1994; Mackuen, 
Erikson, and Stimson 1992; Stigler 1973). Even much 
of the individual-level research treats the population as 
rather homogeneous in this regard (Fiorina 1978, 1981; 
Kinder and Kiewiet 1979, 1981; Lewis-Beck 1988; also 
see Lanoue 1994; Nannestad and Paldam 1994; Nor- 
poth 1996). An important consequence of such treat- 
ment is the expectation that citizens in a wide variety of 


1 As Powell and Whitten (1993) and others point out, there is 
considerable variation in the ability to identify clearly the mcumbent. 
In a two-party parliamentary system, as m Britam, the identity of the 
incumbent is usually obvious. In the case of coalition governments, 
certain parties may bear more responsibility for economic outcomes 
than others. In semipresidential systems, as in France and Poland, 
responsibility for economic outcomes can shift from the president to 
the prime minister, depending upon who controls the legislature. 
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national contexts will respond to economic fluctuations 
in a fashion consistent with this sanctioning model of 
economic voting (Paldam 1991; Whitten and Palmer 
1999). 

Building on the notion that economic voting can be 
widely generalized, researchers have extensively ap- 
plied the model to the study of citizen responses to the 
economy at varying stages of democratic development. 
The literature on postcommunist regimes typically 
assumes a traditional economic voting specification. 
There is support in this research for the notion that 
perceptions of economic performance condition eval- 
uations of, or likelihood to vote for, the incumbent 
government (Duch 1993, 1995; Mishler and Rose 1994, 
1996; Przeworski 1991). The success of former commu- 
nist opposition parties in Hungary in 1994 and in 
Poland in 1993, for example, tends to support the 
referendum model of economic voting (Markowski and 
Toka 1998). Apparently, voters responded to serious 
economic dislocation by rejecting the incumbent gov- 
ernment. The notion that the classic economic voting 
mode! applies to countries at varying levels of democ- 
ratization is also supported by Remmer’s (1991) work 
on Latin America. Her cross-national analysis of elec- 
tion results during the 1980s suggests that economic 
voting is not sensitive to a country’s democratic matu- 
rity. 

A number of recent works on nascent democracies 
question the appropriateness of the conventional eco- 
nomic voting model. Przeworski’s (1996) analysis of 
support for incumbents and for economic reform in 
Poland yields results that are at odds with the conven- 
tional model: Although unemployment reduced sup- 
port for the incumbent, higher inflation favored the 
incumbent, which is exactly the opposite predicted. 
Reinforcing this finding is the Powers and Cox (1997) 
analysis of a mass public survey conducted immediately 
after the September 1993 parliamentary elections in 
Poland. At best, they argue, economic evaluations 
(change in living standards) had a weak effect on vote 
choice. A much more important factor was whether 
voters attributed responsibility for declining living stan- 
dards to state socialism or to policies adopted by the 
reformers. Tucker (1999a, 1999b) tested the model in 
five East and Central European nations and concludes 
that voters do not so much reward or punish incumbent 
parties as vote for (or against) parties with reform or 
antireform policy credentials. Dominguez and McCann 
(1996) find little support for the economic voting 
model in Mexico. These studies suggest that voters in a 
transition context may not respond to economic out- 
comes according to the conventional model, that is, by 
voting for the incumbent when times are good and vice 
versa. 

Other studies suggest that individuals in both mature 
and nascent democracies respond to the economy in a 
heterogeneous and nuanced fashion. Hibbs (1982) 
demonstrates that class or occupational groupings re- 
spond differently to fluctuations in consumer prices 
and unemployment indicators. Mackuen and Mouw 
(1995), Krause (1997), Rivers (1998), Duch, Palmer, 
and Anderson (2000), and Smith (1998) show the 
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extent to which responsiveness to economic perfor- 
mance varies significantly among different subgroups of 
the U.S. electorate. Recent efforts to understand public 
support for structural reform in Mexico, Peru, and 
Poland (Buendia 1996; Kaufman and Zuckermann 
1998; Przeworski 1996; Stokes 1996) indicates the 
heterogeneity of group responses to the economy. It 
appears the model must recognize that various seg- 
ments of the population respond differently to eco- 
nomic outcomes. The question is why. 

I argue that information and trust in nascent demo- 
cratic institutions are two important sources of heter- 
ogeneity in economic voting in transition democracies. 
Economic voting develops among postcommunist elec- 
torates as ambiguity regarding the link between gov- 
ernment policy and economic outcomes declines. The 
link becomes less ambiguous as citizens become more 
informed about how democratic institutions function 
and gain increasing confidence or trust in the respon- 
siveness of these institutions to public preferences. In 
the early period of democratization the conditions 
necessary for an effective agency relationship between 
voter and incumbent are not yet fully developed. 


Information 


The referendum model presumes that voters, or at 
least a preponderance of them, are reasonably well 
informed. If these principals are poorly informed about 
politics and the economy as well as how their political 
choices affect economic outcomes, then they cannot 
effectively monitor their agents, incumbent govern- 
ments. Without effective monitoring they cannot sanc- 
tion or reward incumbents for economic outcomes. 
The importance of this information assumption is 
widely recognized by scholars. In summing up the 
literature on democracy and accountability, Manin, 
Przeworski, and Stokes (1999, 23) note: 


Thus, although we end up with a vague sensation that what 

governments do has some connection to what citizens 

want, the modality of this connection remains opaque. ... 

Yet there are some things we have learned. Perhaps 

foremost is the importance of information—a theme per- 

vasive in most contributions. The main difficulty both in 
instructing governments what to do and in judging what 
they have done is that we, citizens, just do not know 
enough. 
This is reiterated by Fearon (1999), who formally 
demonstrates the extent to which the information 
requirements of the referendum model of democratic 
accountability undermine its applicability for explain- 
ing voting decisions. 

There is considerable empirical evidence, particu- 
larly from mature democracies, that political informa- 
tion varies significantly within populations and has 
important implications for political behavior in general 
and for economic voting in particular. Zaller (1992) 
and others (Delli Carpini and Keeter 1996) persua- 
sively demonstrate that the reception of messages 
regarding policy outcomes (and their causes), such as 
economic performance, is highly contingent upon the 
level of political information, which varies considerably 
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within the population around a fairly low mean value. 
Krause (1997, 1192) finds that the uninformed are less 
likely to employ evaluations of economic performance 
in their vote decision, although Zaller (2001) argues 
exactly the opposite. 

Moreover, there is a tendency for the less informed 
to rely more on personal financial circumstances as 
opposed to overall economic conditions in their vote 
decision (Conover, Feldman, and Knight 1987; Delli 
Carpini and Keeter 1996; Krause 1997; Weatherford 
1983; but see Hetherington 1996). They also are less 
likely than the informed to perceive correctly the actual 
trends in economic performance (Conover, Feldman, 
and Knight 1987; Krause 1997). Finally, Suzuki (1992) 
suggests that the informed segments of the electorate 
are affected by the political business cycle differently 
from the uninformed. Thus, a significant body of 
evidence suggests that information affects the level and 
nature of economic voting. 

Information is significant for economic voting in new 
democracies because the development of an informed 
citizenry is a critical part of the process. During this 
formative stage, the populace adapts to the novel 
information opportunities and demands of a demo- 
cratic polity. Inglehart (1977) points out that a princi- 
pal correlate of democratization is what he labels 
cognitive mobilization, which essentially is defined as 
the growing political awareness of citizens. It reflects 
the increasing supply of information and messages in 
democratic contexts and also the rising mass demand 
for information that is critical for the fulfillment of 
citizen responsibilities in a democratic polity. In the 
early stages of democratization, political information 
levels are likely to be less well developed than in more 
mature democracies. 

In former communist regimes, voters are particularly 
affected by the novel information demands and oppor- 
tunities associated with democracy. They are exposed 
to increasingly numerous, heterogeneous, and conflict- 
ing messages regarding both the economy and politics. 
They also face a much broader set of political choices 
for which this information is relevant. During the 
transition period, they learn how to employ the mes- 
sages they receive, such as those about the economy, in 
translating preferences into political choices. Within 
postcommunist regimes there are significant differ- 
ences in the level of political information, particularly 
between major urban areas and rural settings. Jones, 
Willerton, and Sobel (1998) point to this as one 
explanation for the fact that the major urban centers of 
Russia (Moscow and St. Petersburg) respond to mac- 
roeconomic crisis, whereas the response is much more 
muted in the more rural parts of the country. 

In successive elections, these information barriers 
are overcome as voters become better informed about 
politics and the economy. This results from observation 
and experience as well as improved media coverage 
(see Goidel and Langley 1995; Hetherington 1996 
discusses the effect of media coverage on economic 
evaluations). As voters become more knowledgeable 
about the political process, ambiguities regarding the 
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link between government policy and economic out- 
comes decline, and their level of economic voting rises. 


Trust 


A second element of the economic voting model that is 
particularly relevant to new democracies is trust in 
political actors and governmental institutions. Political 
trust is typically conceptualized as an indicator of the 
basic ethical qualities of public officials, the efficiency 
of government, and the correctness of its decisions 
(Hetherington 1998, 791). Distrust in government can 
be viewed as expecting a relatively high level of shirking 
or rent seeking on the part of elected officials. Heth- 
erington (1998, 1999) demonstrates that trust, or lack 
of it, shapes voter attitudes toward incumbents and 
basic democratic institutions. Voters with a relatively 
high level of trust are likely to believe that rent seeking 
by politicians can be controlled through the threat of 
electoral defeat (Ferejohn 1986). They expect that 
replacing incumbents who perform poorly can lead to 
the implementation of better economic policies. 

Voters with a low level of trust in political actors and 
governmental institutions believe that a pure sanction- 
ing strategy may result in the election of agents with a 
penchant for rent seeking or shirking regardless of 
whether they are incumbents or challengers. Replacing 
the incumbent has a high probability of producing just 
as much if not more rent seeking, but reelecting 
incumbents only rewards those who are already shirk- 
ing. To the extent that these distrustful voters are able 
to discern political actors who engage in less rent- 
seeking behavior, they may opt for nonsanctioning 
strategies, such as the selection of “good” (or less bad) 
representative types, as argued by Fearon (1999). This 
implies that economic voting, which is a pure sanction- 
ing strategy, will be more prevalent among citizens with 
a higher, as opposed to a lower, level of political trust. 

In nascent democratic regimes, a low level of polit- 
ical trust may inhibit voters from employing sanction- 
ing strategies that are critical for the economic voting 
model. If trust is low, voters who supported an incum- 
bent party in the previous election may be unwilling to 
elect the opposition, even in the case of poor economic 
outcomes. Although the sanctioning model predicts 
that voters will abandon the incumbent, the risk asso- 
ciated with voting for the opposition may simply out- 
weigh the pay-offs associated with sanctioning the 
incumbent (Przeworski 1991). Morgenstern and Zech- 
meister (2001) demonstrate that risk-averse individuals 
in Mexico are less likely to sanction the incumbent for 
poor economic performance, whereas risk-acceptant 
individuals are significantly more likely to engage in 
economic voting. They note that voter uncertainty with 
the opposition parties declines over time, which re- 
duces the risk of employing a sanctioning strategy. 

A key characteristic of the democratization process 
is the development of trust in political actors. In the 
early stages we expect low levels of political trust, due 
to a relatively high level of uncertainty as well as 

iarity with different political parties and with 
the process itself. An extensive empirical literature 


897 


A Developmental Model of Heterogenous Economic Voting in New Democracies 


establishes that this is the case. Mishler and Rose 
(1997) and Rose, Mishler, and Haerpfer (1997) docu- 
ment the extent to which political trust is underdevel- 
oped in the postcommunist regimes of East and Cen- 
tral Europe. Inglehart (1990) demonstrates the extent 
to which trust is strongly correlated with democratiza- 
tion and is higher in more mature democracies. 

In sum, in nascent democracies there is likely to be a 
high percentage of low-trust individuals, and the cor- 
relation between economic performance and voting 
behavior will be weaker than in democracies with 
relatively high trust levels. As trust increases and the 
expectation of rent-seeking behavior by politicians 
declines, citizens are more likely to engage in economic 
voting. 


Reform Pollcies 


A simple sanctioning model may not be appropriate for 
postcommunist regimes, which often implement pain- 
ful economic structural reforms. Sanctioning incum- 
bents for short-term fluctuations in economic out- 
comes may not be a dominant strategy. Information 
deficiencies may make it virtually impossible for voters 
to determine the extent to which short-term economic 
outcomes are the fault of policy initiatives as opposed 
to macroeconomic forces beyond the control of the 
national government (such as International Monetary 
Fund or European Union dictates). The rational strat- 
egy may be for voters to entrust politicians, who have 
policy perspectives close to theirs, with the mandate of 
making correct decisions. Given the complexity of the 
macroeconomy and information asymmetries, some 
argue that voters can do no better than adopt this 
equilibrium strategy (Fearon 1999). 

There is strong evidence that reform policy prefer- 
ences shape the manner in which voters in transitional 
postcommunist regimes react to the macroeconomy. 
Tucker (1999a, 1999b) argues, for example, that anti- 
reform parties are likely to benefit from poor economic 
performance, whereas proreform parties will do well 
during periods of good economic performance. The 
ageregate-level evidence from Tucker (1999b) and 
Pacek (1994) suggests this is likely to be the case. 
Powers and Cox (1997) also find that including controls 
for blame attribution (the “first-wave” reformers versus 
the antireform postcommunist parties) significantly 
reduces the effect on vote choice of economic evalua- 
tions. 

These studies make it quite clear that the economic 
calculus that shapes postcommunist voting decisions 
likely includes concerns regarding short-term fluctua- 
tions in the economy as well as the party’s stance on 
economic reforms. It is critical that the model correctly 
capture this economic calculus. A decision to support 
the incumbent, which happens to be an antireform 
party, despite a negative economic evaluation could 
have two interpretations. It may simply reflect an 
absence of economic voting. Alternatively, it may re- 
flect the fact that antireform parties benefit from a bad 
economy because respondents blame economic re- 
forms for bad economic outcomes and express their 
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dissatisfaction by supporting parties with antireform 
credentials (in this case the incumbent). The appropri- 
ate model specification should include both an evalu- 
ation of economic outcomes and a measure that con- 
trols for the extent to which the voter agrees with the 
incumbent party’s stances on economic reform. 

In the Hungarian case this specification is compli- 
cated by the fact that the parties’ traditional positions 
regarding economic reform shifted considerably, par- 
ticularly as the 1998 election approached. The MSZP, 
in spite of its communist heritage, moved increasingly 
in a proreform direction, particularly during its 
1994-98 tenure in government. During that tenure, the 
MSZP essentially championed virtually all the eco- 
nomic reform policies advocated by the International 
Monetary Fund and to some extent imposed on Hun- 
gary by the European Union, especially following the 
formal invitation to begin accession talks in 1997. 

In contrast, the leading opposition parties of the 
center and center-Right were increasingly critical of 
many of the economic reforms enacted by the social- 
ists. In its populist campaign in 1998, for instance, the 
Federation of Young Democrats-Hungarian Civic 
Party (Fiatal Demokraták Szivetsége-Magyar Polgári 
Párt—FIDESZ-HCP) often took what seemed to be a 
more traditionally leftist position than the MSZP on 
several issues. On nationalist issues, tbe coalition of the 
FIDESZ-HCP and the Smallholders’ Party remained 
more traditionally rightist than the MSZP. But with 
regard to several well-politicized economic issues— 
such as the socialist-imposed university fees, higher 
pensions, and fiscal support for families—the FIDESZ 
was more redistributive than the MSZP. This compli- 
cates the specification of an economic reform measure 
in a model of support for the incumbent. Given the 
parties’ shifting positions, voters could have very dif- 
ferent perceptions of where the parties stood on the 
reform issue. An individually calibrated measure to 
deal with this problem is described below. 


DATA AND MEASURES 


Economlc Voting In Hungary 


Hungary fits two essential criteria for testing the dif- 
ferent perspectives on economic voting. First, it is a 
young postcommunist democracy that held two sets of 
democratic elections (1990 and 1994) before the survey 
employed in this analysis. This is a context in which we 
would expect meaningful variation in attitudes toward 
nascent democratic institutions and in levels of knowl- 
edge about politics under the new regime. 

Second, given the mixed economic signals experi- 
enced by Hungarians in 1997, we can expect consider- 
able variation in economic evaluations. In 1993, the 
overall economic indicators in Hungary began to re- 
cover from the negative economic shock that accom- 
panied postcommunist reforms. Real GDP essentially 
stopped declining, and both industrial production and 
gross investment experienced real positive growth 
rates. Yet, indicators that matter for the average 
household continued to suggest serious economic dis- 
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location. Consumer prices rose at about 20% per year 
up to the period during which the survey was con- 
ducted, and the unemployment rate remained in the 
10% range (compared to 1.9% in 1990). Between 1990 
and 1995 there was only one year of positive real 
growth in net wages. In 1996, personal consumption 
continued to decline at a rate of 10%, and real 
disposable income did not begin expanding until 1997. 
Added to this was the decrease in government subsi- 
dies and social benefits proposed as part of the “Bokros 
Package” of liberal economic reforms (Morlang 1999; 
Racz 2000). 

There is, in fact, considerable variation in Hungari- 
ans’ evaluations of economic performance as revealed 
in the December 1997 survey. It clearly indicates a 
response to improvements in the economy. The pro- 
portions expressing negative retrospective evaluations 
of both personal finances and the national economy 
(respectively, 48% and 37%) are about twice the 
proportions expressing similar prospective assessments 
(respectively, 26% and 18%). Moreover, those with 
positive expectations about their future financial situ- 
ation (26%) are double those who indicated their 
personal financial situation had improved over the past 
12 months (12%). Although about one-third of the 
respondents expect overall economic conditions to 
improve, most are relatively unsure or expect things to 
remain basically the same; almost one-fifth of respon- 
dents expect the situation to worsen. With respect to 
personal economic conditions, the level of optimism is 
even lower: Approximately half the sample believes 
things will remain the same, 25% expect an improve- 
ment, and an equal proportion expects a deterioration. 

The statistical analysis employs individual-level data 
from the 1997 Hungarian Public Opinion survey con- 
ducted as part of the Markets and Democracy Project. 
Appendix A explains the sampling and interviewing 
strategies. Appendix B provides details on explanatory 
variables, which are discussed below. 


Measures 


The standard prospective and retrospective economic 
evaluations of personal finances and the national econ- 
omy, summarized above, are employed in the Hungar- 
ian economic voting model. The two variables captur- 
ing level of democratic development at the individual 
level are Political Trust (generated from a factor anal- 
ysis of four trust items) and a measure of Political 
Knowledge (generated from a factor analysis of re- 
sponses to three factual questions about Hungarian 
politics). 

To tap attitudes toward market reform, I use a 
measure that captures the policy distance between the 
respondent and the incumbent party based on the 
respondent’s location of both the incumbent party and 
herself in the economic reform issue space. Specifi- 
cally, respondents were asked to locate the political 
parties on the issue of state control of industry (0 = 
support for strong state control, 10 = strong opposition 
to state control). The market reform variable (Distance 
from MSZP on Market) in the incumbent support 
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model is constructed by taking the absolute value of the 


difference between the respondent’s placement of self 


and the respondent’s placement of the MSZP. 

The Workers Party scored highest on state control (a 
mean value of 3.53). The members of the governing 
coalition scored next highest; the MSZP mean was 
4.30, and the Alliance of Free Democrats (Szabad 
Demokraták Szévetsége—SZDSZ) received a mean 
value of 4.68. The centrist parties were given middling 
scores: the independent Smallholders Party (Független 
Kisgazda-, Földmunkás-és Polgári Part—FKgP), 4.93; 
the Christian Democratic People’s Party (Kéresztény- 
demokrata Népp4rt-KDNP), 5.01; the MDF, 5.04; and 
FIDESZ, 5.12. The right-wing nationalist Hungarian 
Party of Trust and Justice (Magyar Igazság Es Élet 
P4rtja—MIEP) received a mean score of 4.77, placing 
it slightly to the right of the socialists but clearly to the 
left of the centrist parties. Even with the shifting party 
positions on economic reform, public perceptions of 
stances on state control conform reasonably well to 
historical patterns. 

A second variable, Democratic Satisfaction, measures 
a more general sentiment toward the functioning of 
democratic institutions in Hungary (1 = not satisfied, 
4 = very satisfied). This controls for the possibility that 
a respondent’s enthusiasm for democratic political 
reforms may be confounded with assessments of eco- 
nomic performance. 

It is important that the sanctioning of an incumbent’s 
economic performance not be confounded with the 
collective economic situation of particular groups 
within the electorate. For example, the constituents of 
the incumbent parties may be a favored group relative 
to others (say, white-collar workers as opposed to 
pensioners). Controlling for socioeconomic status re- 
duces the possibility of confusing the intelligentsia’s 
favored economic status with an economic vote that 
responds to short-term economic fluctuations. Efforts 
to explain vote choice in postcommunist regimes as a 
function of socioeconomic cleavages have yielded 
mixed results (Evans and Whitefield 1993; Toka 1996). 
Nevertheless, there is a sense that such voting occurs to 
varying degrees in East and Central Europe and is 
increasing (Duch 1998; Mateju, Rehakova, and Evans 
1999; Shabad 1999). 


Intention to Vote for Incumbent Parties 


The dependent variable, has three values. A 0 indicates 
an intention to vote for any one of the opposition 
parties; 1 indicates an uncertain response (don’t know) 
or the intention not to vote; and 2 indicates an inten- 
tion to vote for one of the two incumbents (MSZP or 
SZDSZ). Uncertain voters and nonvoters are treated 
as a middle category between support for incumbent 
and support for challenger.? According to the eco- 





2 This categorization assumes that uncertain respondents are indif- 
ferent between voting for an incumbent or an opposition party. I 
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nomic voting model, as perceptions of economic per- 
formance improved in Hungary, respondents should be 
more likely to indicate an intention to vote for the 
governing coalition, so these two parties are predicted 
to have a higher score on the incumbent vote measure. 
Ordered probit models (for an exposition of this 
method, see Greene 1999) of the respondents’ vote 
intention are employed here. 


RESULTS 


The developmental model of economic voting pro- 
poses the following propositions. First, economic eval- 
uations (of the overall economy and of personal fi- 
nances) are correlated with voting behavior. Positive 
evaluations lead to support for the incumbent, and 
negative assessments lead to support for the chal- 
lenger. Second, there is a positive interaction between 
trust in government and economic evaluations. That is, 
economic voting is contingent upon level of trust, and 
a higher level of trust generates a stronger relationship 
between the economy and vote choice. Third, informa- 
tion interacts with economic evaluations. A higher level 
of information generates a stronger relationship be- 
tween the economy and vote choice. Two alternative 
explanations are also evaluated. Economic voting may 
result from preferences regarding economic reform 
policies or from socioeconomic cleavages. 

The economic voting model tested here is a pure 
sanctioning model: To what extent is the voters deci- 
sion to oppose (or support) the incumbent government 
motivated by an evaluation of economic performance? 
A pure referendum model of economic voting predicts 
that people with pessimistic views of the economy are 
much less likely to support the current government. My 
model suggests that those who are better informed and 
those who are more trusting of government are more 
likely to engage in this sanctioning behavior. 

election, compared to 86% of those who were not “uncer- 


tam” (coded 0 or 2). There is little indication that uncertainty 
represents the likelihood of not voting. 
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Table 1 presents the bivariate relationship between 
retrospective evaluations of the national economy and 
intention to vote for the incumbent parties (i.e., the 
MSZP or SZDSZ) versus a vote for one of the 
challenging parties. The traditional model predicts that 
as we move from the left side (bad economic percep- 
tions) to the right side (good economic perceptions) of 
the table, the percentages support for incumbents 
should rise and the percentage of support for challeng- 
ers should drop. At the extreme left there should be a 
large net difference in favor of the challenger; at the 
extreme right, a large net difference in favor of the 
incumbent. 

The developmental model of economic voting pre- 
dicts that this relationship will be significant for those 
with high levels of information and trust in government 
but will be much more muted for low information and 
trust types. Table 1 compares the effect of economic 
perceptions on intended vote choice (incumbent versus 
challengers) for the high versus low information and 
high versus low trust types. First, there is clear evidence 
that, regardless of information/trust type, respondents 
with negative economic assessments were unlikely to 
vote for either incumbent party. Among the 37% who 
expressed a negative (a lot or somewhat worse) retro- 
spective assessment of the economy, 87% indicated 
they would not vote for incumbents. Second, high 
information and trust types respond much more to 
economic conditions than do low information and trust 
types. In Table 1, the increase in support for incum- 
bents as economic perceptions go from bad (left side) 
to good (right side) is considerably steeper when 
information or trust is high rather than low. For 
example, among low information voters, 89% of the 
economically dissatisfied indicated a preference for a 
challenging party, compared to 60% among the eco- 
nomically most satisfied. For those with a high level of 
political knowledge, the difference in challenger sup- 
port between the economically most dissatisfied and 
most satisfied is considerably greater: 89% versus 45%. 
Among those with a low level of political trust, 90% of 
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the most economically dissatisfied will vote for a chal- 
lenging party, compared to 70% of the most econom- 
ically satisfied. For those with a high level of political 
trust, the split is 87% versus 36% between the negative 
and positive economic perception groups. 

Note that when information or trust is high, those in 
the most positive economic evaluation category 
strongly favor incumbents over challengers, which is 
what the economic voting model predicts. But this is 
not the case when information or trust is low: At the 
most positive extreme of the economic evaluation 


“TABLE 2. Ordered Probit Models of Support for Governing Coallttion, Hungary 1997 


Polttical Information x positive retrospective national economy 
Polttical Information x negative retrospective national economy 


Note: Standard errors are n parentheses. See Appendix B for measurement of vanables *p < 05, “p< .01. 
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—1487 —1469 —1463 
` 275“ 309" 322 
38 
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scale, support for challengers continues to outweigh 
support for incumbents. In other words, for high 
information and trust individuals, positive economic 
evaluations are more likely to lead to support for the 
incumbent; low information and trust types are less 
likely to reward incumbents for good economic perfor- 
mance. 

A final point regarding Table 1 is worth mention. At 
the very negative extremes of perceived economic 
performance there is little response to variation in 
economic perceptions, compared to the positive ex- 
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tremes of economic performance. As economic per- 
ceptions improve from “a lot worse” to “somewhat 
worse” to “remained the same,” there is relatively little 
increase in incumbent support. Support for the incum- 
bent responds much more aggressively to a movement 
from “remained the same” to “somewhat/a lot better.” 
There appears to be a certain asymmetry in how 
economic perceptions shape vote choice. Moreover, 
this applies to both information and trust types. These 
are simply bivariate relationships, but they indicate that 
levels of information and trust in democratic institu- 
tions condition the extent to which voters reward 
incumbents for good economic performance. The next 
section explores this argument using multivariate spec- 
ifications. 

Table 2 presents the ordered probit regression re- 
sults. The dependent variable is the trichotomous 
measure of intended support for either of the two 
ruling coalition parties in 1997. The first column pre- 
sents the results for the standard economic voting 
model, which predicts positive and statistically signifi- 
cant coefficients on the economic perception variables. 
Both retrospective and prospective national economic 
evaluations are significant in the equation. The per- 
sonal finance terms are not statistically significant. This 
is consistent with many of the individual-level eco- 
nomic voting models. These typically find that the 
perceptions of overall economic performance shape 
voting decisions, but personal finances tend not to be as 
important (Kinder and Kiewiet 1979). 

The coefficients on the economic perception vari- 
ables establish that the conventional sanctioning model 
of economic voting applies in the Hungarian context. 
In order to distinguish this sanctioning behavior from 
voting decisions motivated by preferences regarding 
economic reform policies, I measured the extent to 
which the policy preferences of the respondent differ 
from the perceived MSZP’s position on economic 
reform. The coefficient of —.11 indicates that as the 
respondent’s position diverges from that of the MSZP, 
the likelihood of voting for the incumbent parties 
declines. The significant coefficient for the economic 
reform variable suggests that the policy positions of the 
incumbents and challengers played an important role 
in the voters’ decision calculus. 

The equation includes terms designed to calibrate 
the respondents’ engagement in the nascent demo- 
cratic process in Hungary: political trust, democratic 
satisfaction, and political knowledge. These control 
variables ensuré that sanctioning for economic perfor- 
mance is not confounded with enthusiasm for, or 
engagement in, the democratic process. Two of the 
variables, satisfaction and trust, are correlated with 
support for the incumbent parties. 

Table 2 suggests that the incumbent government was 
favored by white-collar segments of the electorate, 
which many argue the MSZP went to some lengths to 
cultivate (Morlang 1999), and also by older voters, 
many of whom may have supported the predemocratic 
incarnation of the MSZP. 

The developmental economic vote model proposes 
that the economic voting effect, identified in equation 
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1, is depressed among those with a lower level of 
political mformation and trust in government. I test 
this proposition in equation 2 by including interaction 
terms that are the product of political information and 
trust, on the one hand, and the retrospective and 
prospective overall economic assessments, on the other 
(see Appendix B for measurement discussion). In 
constructing these interaction terms, I dichotomized 
information and trust (1 = high level, 0 = low level). 
Significant positive coefficients would suggest that the 
economic voting effect is higher for those with high 
levels of information and trust. Both interactions (trust 
and political information) with retrospective evalua- 
tions of the national economy are positive and statisti- 
cally significant. In addition, the interaction between 
trust and prospective national economic evaluations is 
positive and significant. Note that the likelihood ratio 
test statistic, which evaluates the constrained model 
(without the interaction terms) against the saturated 
model (including the interaction terms), is statistically 
significant. The significant interaction terms, the posi- 
tive signs on the coefficients, plus the likelihood ratio 
test lend strong support to the developmental eco- 
nomic voting model.3 

Table 1 suggests an asymmetry in the relationship 
between the economic evaluation terms and voting 
behavior: There are only small differences among those 
with neutral or negative economic evaluations but large 
differences between those with neutral versus positive 
retrospective evaluations. This possible asymmetry is 
tested in equation 3 of Table 2 by including separate 
interaction terms for positive versus negative retro- 
spective and prospective evaluations of the economy. 
The Negative Retrospective National Economy and the 
Negative Prospective National Economy variables have 
values of —2 and —1 for those responding, respectively, 
“a lot worse” or “somewhat worse,” and 0 otherwise. 
The Positive Retrospective National Economy and the 
Positive ive National Economy variables have 
values of 2 and 1 for those responding, respectively, 
“somewhat better/a lot better” or “about the same,” 0 
otherwise. Equation 3 of Table 2 confirms that there is 
asymmetry in the interaction effects. The only interac- 
tion terms statistically significant in this specification 
are those involving positive economic evaluations. This 


3 Some argue that if those least supportive of democratic institutions 
do not vote, the results I identify may be relatively insignificant m an 
actual election because this would mean the more mformed and 
trusting crtzzens would shape election outcomes. If actual voters have 
a higher level of information and trust and hence are likely to engage 
in economK voting, then incumbents in effect will be rewarded or 
punished for therr management of the economy. A simple strategy 
for empirically testing the argument is to estimate the model using 
only likely voters. If the criticism outlined above is correct, then the 
significant mteraction of trust and mformation with economic eval- 
uations (shown m Table 2) should not be statistically significant when 
the sample excludes likely nonvoters. The Hungary 1997 survey 
included a question that asked ts to ndicate the likelihood 
they would vote of an election were held next Sunday. The develop- 
mental economic model from Table 2 was estimated on a subsample 
that excluded likely nonvoters. The results are essentially exactly the 
same when estimated with (N = 1,498) and without (N = 1,172) 
likely nonvoters. Most important, the interaction effects hold up 
when the likely nonvoters are dropped from the analysis. 
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FIGURE 1. Change In Incumbent Vote Probabilities, by Variation In Retrospective Economic 
Evaluations for High Information Respondents, Hungary 1997 
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is the case with political knowledge and retrospective 
evaluations and with trust and prospective assessments 
of the economy. The trust and positive retrospective 
interaction term is not quite significant at the .05 level. 
Note that none of the negative interaction terms is 
statistically significant. 

These results, combined with the patterns revealed 
in Table 1, suggest two asymmetries in the reaction of 
voters to the economy. First, in general, if respondents 
have negative perceptions of economic performance 
they are likely to vote for challengers, and their vote is 
unresponsive to incremental improvements in the 
economy. Second, and this essentially follows from the 
first point, the interaction effect with information and 
trust is more evident among those with positive eco- 
nomic evaluations. Thus, political information and 
trust primarily condition the voting behavior of those 
with relatively positive economic perceptions. Those 
with positive economic evaluations and high political 
information or trust are much more likely than any 
other group to reward incumbents with their vote. 
These asymmetric results indicate that in Hungary 
there is a high economic performance threshold re- 
quired for incumbents to reap the rewards of incre- 
mental improvements in the economy. Below this level, 
voters are so discouraged with the performance of 
incumbents that they overwhelmingly support challeng- 
ers, and this support does not vary by relatively small 
perceived differences (at the negative end of the con- 
tinuum) in economic outcomes. 

Figure 1 illustrates the effect on the probability of 
voting for the incumbent of incremental differences in 
national retrospective economic assessments by re- 
spondents who score high on the political information 
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measure (similar results obtain when the trust interac- 
tion is modeled). 

The difference between thinking the economic situ- 
ation “stayed the same” versus “improved/improved a 
lot” is an increase of almost 8% in the probability of 
voting for the incumbent parties. This contrasts with no 
significant differences in vote probabilities between low 
information respondents who fall in these two evalua- 
tion categories (not shown). The confidence intervals 
on the predicted probabilities give us strong assurances 
that the predictions are significantly different from zero 
and suggest how they would likely range in the re- 
peated sampling. 


Calibrating the Electoral Consequences of 
the Economy, Information, and Trust 


The developmental model of economic voting suggests 
that as democratic values mature the electoral fortunes 
of incumbents will increasingly correlate with economic 
performance. In 1998, however, after a widely noted 
improvement in economic performance, the incumbent 
government in Hungary was defeated. This may seem 
at odds with the maturity argument, but the survey data 
show that a majority of respondents did not perceive 
any improvement. 

As Table 1 makes clear, those with negative percep- 
tions of economic outcomes, regardless of their infor- 
mation/trust type, are overwhelmingly likely to support 
challengers. The multivariate results in Table 2 
strongly suggest that negative economic sentiment in 
Hungary contributed to the declining electoral fortunes 
of the two incumbent parties, whose vote share 
dropped from 53% in 1994 to 40% in 1998. The 
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standard economic voting model suggests that this 
decline could have been moderated by a more positive 
perception of macroeconomic performance. In my 
model, any electoral pay-offs for incumbents from 
positive economic sentiment will be disproportionately 
higher among the high trust and information types, so 
their distribution m the population will affect the 
outcome. 

We can calibrate the magnitude of this “develop- 
mental” effect with a hypothetical example based on a 
set of population assumptions similar to those em- 
ployed in Figure 1. I assume that before the 1998 
elections the incumbent parties were able to improve 
the mean evaluation of the Hungarian economy by 
about a standard deviation, from a score of 2 (no 
change) to a score of 3 (some improvement and a lot of 
improvement). The actual mean value for the retro- 
spective national economy variable was 1.8. The esti- 
mates in Table 2 are based on a construct that catego- 
rizes half the sample as having a high level of trust in 
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democratic institutions. If we maintain the distribution 
at 50%, the hypothetical improvement in economic 
evaluations would add approximately 4 percentage 
points to the incumbent coalition’s popular vote, rais- 
ing it to 44%. If the proportion of “trusting” respon- 
dents were at 75% of the population, the incumbents’ 
share of the popular vote would have stood at 46%. 
Other factors played a role in the defeat of the 
MSZP-SZDSZ government. As Morlang (1999) points 
out, the Bokros Package of liberal economic reforms 
adopted by the government alienated key MSZP con- 
stituents, who abandoned the party m the 1998 elec- 
tion. The results in Table 2 suggest that the coalition 
parties were not favored by the working class (note the 
positive coefficient for white-collar respondents). In 
addition, the significant negative coefficient on the 
market reform policy measure suggests that dissatisfac- 
tion with reform initiatives played a major role in 
undermining support for the government. Neverthe- 
less, some of its loss in vote share can be attributed to 
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its inability to translate real economic improvements 
into votes.* Both a low level of trust and the individual- 
level finding that most respondents did not perceive 
economic improvements were important factors in the 
MSZP-SZDSZ defeat. 


A Comparative Perspective on Economic 
Voting In Postcommunlst Regimes 


I argued earlier that the interaction between trust and 
information, on the one hand, and economic voting, on 
the other, is likely to have significant implications for 
postcommunist democracies because a relatively low 
level of trust and information marks the early periods 
of transition. If so, then results similar to those in 
Hungary should be found in other postcommunist 
democracies. As a step in that direction, I examined a 
similar period in Poland, just before the 1997 legisla- 
tive elections. My “out-of-sample” replication employs 
the Polish Public Opinion Research Center Election 
Study of June 1997. 

The items in the Polish survey permit the estimation 
of a model that is reasonably similar to the Hungarian 
model (details are presented in Appendix C). The 
dependent variable is intention to vote for one of the 
two incumbent coalition parties, the Polish Peasants 
Party (Polskie Stronnictwo-Ludowe—PSL) and the 
Democratic Left Alliance (Sojusz Lewicy Demokraty- 
czne}-SLD), coded 2; those who indicated they would 
not vote or were uncertain were coded 1, and those 
who indicated they would vote for any of the challeng- 
ers were coded 0. A direct measure of trust was not 
available in the survey, but Confidence in the Political 
System serves as a proxy. This measure is based on a 
question that asked respondents whether the political 
system is good and requires no change or is bad and 
requires significant change. To construct the confi- 
dence interaction term, this variable was dichotomized 
(1 = good; 0 = bad) and multiplied by the retrospec- 
tive and prospective national economic evaluation vari- 
ables. A measure of political information also was not 
available in the Polish survey. Education is employed as 
a proxy in creating an information interaction term for 
the retrospective and prospective national economic 
evaluation variables. The other variables in the equa- 
tions are similar to the Hungarian specification (see 
Appendix C). 

The Polish economic voting model estimates are 
presented in equation 4 of Table 3. The extent of 
economic voting in 1997 is indicated by the coefficients 
on the four economic evaluation variables. None has 
significant coefficients. The developmental model pro- 
poses that an absence of economic voting is confined to 
respondents with low levels of trust and political infor- 
mation, but in this case both the confidence and the 
information interaction terms are not statistically sig- 
nificant. As we might expect, confidence in the political 
system on its own has a positive effect on the likelihood 
of supporting the incumbents, but democratic satisfac- 


4 For a detailed account of factors m the socialist defeat in 1998, see 
Racz 2000. 
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tion (an assessment of the implementation of demo- 
cratic reforms) does not appear to have an influence. In 
terms of demographic variables, white-collar workers 
and people with less education are more likely to 
support the government. 

Equation 5 proposes an alternative estimation that 
identifies the subgroup of the population with a high 
level of both confidence and information. This involves 
creating a dummy variable that assumes a value of 1 for 
this type, 0 for all other respondents. The economic 
interaction terms are the product of this dummy vari- 
able and the retrospective and prospective national 
economic evaluation measures. This specification gen- 
erates support for my developmental model since the 
coefficient for prospective economic assessments inter- 
acted with the political confidence and education dum- 
mies is positive and statistically significant. Those with 
a high level of combined information and confidence 
employ prospective economic evaluations in their vot- 
ing decision. The magnitude of this economic voting 
effect can be illustrated once again using the method 
described earlier. All the variables were set to their 
median value, and a change in the prospective eco- 
nomic interaction term was simulated, raising it from a 
value of 2 to a value of 3 (one standard deviation). This 
generated an increase of .07 in the probability of 
supporting the incumbent parties (with 95% confi- 
dence intervals going from .003 to .16). 

In the Polish case, a reasonably high combined level 
of confidence in the political system and education 
seems to increase the extent to which economic eval- 
uations guide voter choice. These results provide ten- 
tative confirmation of the Hungarian findings and 
Suggest more generally that levels of trust and infor- 
mation in postcommunist democracies may shape the 
extent to which economic evaluations are employed in 
voting decisions. 

The overall importance of this interaction effect is 
predicated on the extent to which transition democra- 
cies have a significant number of low trust or low 
information types in the population. In postcommunist 
Europe, aggregate trends in attitudes toward nascent 
democratic institutions suggest that the numbers were 
sufficiently high to moderate economic voting in these 
regimes during their transition to democracy. The 1990 
East and Central European Transition to Dem 
Survey (Bruszt, Simon, and Wessels 1993) and the 1997 
Hungarian surveys include two similar questions re- 
garding trust in government, that is, whether the 
government is run for the interests of a few or for the 
benefit of all.5 In the Hungary portion of the 1990 
regional survey, 35% of the sample indicated it was run 
for the interests of a few and in the 1997 Hungary 
survey 59% of the sample agreed. A second item asks 
respondents how much they can trust the government 
to do what is right, and the results suggest very stable 


3 The wording of the 1997 Hungarian question is: “In your opinion 
the government ...is run by a small number of people who are 
looking out for themselves, or = mum for the benefit of the whole 
naton?” The 1990 regional question is: “Generally speaking, would 
you say thet this country is run by a few big interests looking out for 
themselves or that it is run for the benefit of all the people?” 
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levels of trust. In both surveys, 16% indicated that the 
government can never be trusted, and 56% believe that 
it can be trusted sometimes. At best, trust in govern- 
ment did not increase in Hungary during the transition 
period and may have eroded.® 

In Poland and most other postcommunist countries 
the trends in institutional confidence are similar to 
those in Hungary. Attitudes can be gleaned from the 
1990 and 1995-97 World Values surveys, which ask a 
series of questions about the degree of confidence in a 
set of social and political institutions. (Details on the 
surveys can be found in Inglehart 2000.) Since the 
political trust item was not asked in the 1990 World 
Values survey, expressions of confidence in parliament 
and the civil service are substituted as measures of 
support for nascent democratic institutions. Over this 
relatively short time, confidence in parliament declined 
by more than 40% in Poland and Lithuania, by 12% in 
Slovenia, by 22% in Russia, and by 6% in Bulgaria. 
Confidence in the civil service decreased by 12% in 
Slovenia and Bulgaria, by 15% in Lithuania, and by 
40% in Poland; it remained constant in Russia. Clearly, 
faith in these central institutional features of democ- 
racy is faltering, which is consistent with Toka’s (1996) 
empirical findings from the region. These transition 
regimes have a large number of low-trust individuals, 
and the individual-level results of this study suggest 
that low trust tends to moderate economic voting. 


CONCLUSION 


The results reported here speak to the importance of 
understanding the heterogeneity of economic voting in 
consolidating democracies. There are systematic differ- 
ences in how voters respond to the economy, and these 
can have important aggregate-level political implica- 
tions. For new democracies in particular, limited infor- 
mation and skepticism about democratic institutions 
undermine economic voting. 

It is widely accepted that even in new democracies 
there is a correlation between economic perceptions 
and political outcomes. Many observers view this as a 
potential barrier to the adoption of painful economic 
teforms considered critical to the long-term success of 
these regimes. It is widely expected that governments 
will face serious electoral penalties when voters are 
unhappy with the economic dislocation caused by 
reform. This presumes that political accountability in 
new democracies follows the classic principal-agent 
relationship, with voters sanctioning incumbents for 
economic outcomes. The results of this study suggest 
that there may be significant divergences from the 
standard model, at least for large segments of postcom- 
munist societies. People with limited information and 
little faith in democratic institutions respond to eco- 
nomic performance in a fashion somewhat different 
from those who are knowledgeable and trusting. The 

tly informed and the democratic “cynics” are less 
ely to reward incumbents for good economic out- 


6 Toka (1994) finds levels of dissatisfaction with democracy in 
Hungary to be high compared to other democracies. He also points 
to a variety of explanatory factors, m addition to economic concerns. 
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comes. Consistent with democratic theory, as people 
in more information and greater trust in political 

institutions, their evaluations of incumbents may be 

more responsive to the performance of government. 

Another intriguing tading is that the economic 
effects on support for incumbents are asymmetric. The 
Hungarian evidence suggests that among those with 
generally negative economic assessments, small differ- 
ences in perception have no significant influence on 
their vote. To the extent that the economy matters in 
new democracies, it matters most for individuals with 
relatively positive views about economic performance 
(in addition to the information and trust criteria). 
Attitudes toward democracy and levels of political 
information moderate economic voting only if citizens 
have relatively positive assessments of the economy. As 
Table 1 makes clear, moving from feeling “a lot worse 
off” to “somewhat worse off” has little effect on likely 
support for incumbents. In transition democracies, 
gleaned | postcommunist societies, expectations or 

opes may be fairly high, and unless clear improve- 
ments are seen, people do not respond positively to the 
economic efforts of incumbent governments. 

These results confirm the essence of economic vot- 
ing theory and the work on heterogeneity in economic 
voting. Kramer’s (1971) important theoretical contri- 
bution was to highlight the importance of linking vote 
preferences to economic outcomes that were unambig- 
uously associated with government policy. This notion 
is perfectly consistent with the results reported here. 
Economic voting emerges in postcommunist elector- 
ates as ambiguity regarding the link between govern- 
ment policy and economic outcomes declines. The link 
becomes less ambiguous as citizens gain information 
about how democratic institutions function and de- 
velop confidence or trust in the responsiveness of these 
institutions to public preferences. 

Recently, a number of contributions to the literature 
focus on the heterogeneity of economic voting (Duch, 
Palmer, and Anderson 2000; Krause 1997; Mackuen 
and Mouw 1995; Rivers 1998; Smith 1998; Zaller 
2001). These build on Kramer’s notion that an ambig- 
uous link between economic outcomes and govern- 
ment policy reduces economic voting. The work on 
heterogeneity identifies segments of the population for 
whom this ambiguity is relatively high or low and points 
to self-interest and information costs as important 
factors. In the case of transition regimes, ambiguity is 
determined by the level of knowledge about nascent 
democratic institutions and the extent to which they are 
viewed as responsive. 

These findings highlight key challenges confronting 
many regimes in transition: Promote confidence in 
democratic institutions and disseminate information 
about the political process as well as economic perfor- 
mance. These steps are important for consolidating 
nascent democracies. Consistent with theory, as the 
democratic character of the population grows, eco- 
nomic voting will increase, and incumbents will be 
increasingly rewarded for good economic performance. 

Although there have been a number of successful 
consolidations, many nations either are in the early 
stages or have not yet begun the transition to demcc- 
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racy. Those that choose to implement democratic 
reforms are likely to face economic shocks associated 
either with internal policy initiatives or with trends in 
the global economy. Efforts to understand the political 
implications of these shocks are typically based on a 
very simplistic generalization of the classic notion that 
voters sanction incumbents for bad economic out- 
comes. The results reported here suggest that the 
relationship between the economy and voting also 
depends on political attitudes. 

An important implication is that high-risk strategies 
that promise large positive economic pay-offs are a 
dominant choice in these circumstances. Although the 
political cost of failure is high, it should be kept m mind 
that very large numbers of citizens already have a 
negative economic evaluation; my results suggest that 
incremental improvements that might make individuals 
slightly less negative than the next person do not have 
any appreciable effect on support for incumbents. If the 
risky policies yield a very positive pay-off, however, a 
substantial numbers of people may alter their negative 
assessment, which significantly benefits incumbents, 
especially if accompanied by a higher level of trust and 
political knowledge. This may partially explain the 
receptiveness of some postcommunist governments to 
the fairly draconian economic policies advocated by 
various international organizations. 

In order for incumbent governments in new democ- 
racies to reap the political benefits of an improved 
economy, they simultaneously need to promote the 
development of democratic aptitudes. A sequential 
strategy of first improving the economy and then 
attending to aptitudes is clearly suboptimal. Resources 
should be devoted to accelerating an education in 
democracy. Of course, this strategy has a potential 
political risk. If efforts to improve the economy fail but 
levels of trust and information rise, then incumbents 
will be punished. 

The unpleasant irony is that nascent democracies 
with the least sound economic policies tend to be 
regimes in which information and trust are least devel- 
oped, and incumbents are the least likely to garner 
political rewards for any small improvements. Exam- 
ples are some of the new regimes in Africa, Latin 
America, and Asia in which education levels are low 
and democratic aptitudes are still underdeveloped. The 
absence of rewards for small (but possible) economic 
improvements may encourage rent seeking on the part 
of incumbents rather than sound economic policies. 


APPENDIX A: THE 1997 HUNGARIAN 
PUBLIC OPINION SURVEY 


The 1997 Hungarian Public Opmion survey interviewed 1,544 
respondents between November 26 and December 8. The 
questionnaire was designed by Raymond Duch, translated 
and administered by the Social Research Informatics Center 
of Budapest (TARKI), and funded by NSF Grant # SBR 
9600306. The survey codebook and data can be accessed at 
www.ulLedu/democracy. TARKI (www.tarki.bu), a consor- 
tium of nine academic organizations, has been conducting 
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and archiving survey data for more than ten years. The 
probability sample was selected in multiple stages. In the first 
stage, localities were divided into eight strata, and 71 locali- 
ties were selected from these strata with simple probability 
sampling (without proportional stratification). In the second 
stage, the number of individuals to be sampled from each 
stratum was set according to the proportion of the adult 
population in that stratum. These proportions were modified 
according to the projected dropout rate. The presumed 
dropout rate for each stratum was determined on the basis of 
1996 Census characteristics. The names and addresses of 
individuals were acquired from the Central Registration and 
Election Office with simple probability sampling. 

The original sample was designed so that, after dropouts, 
the final sample size would be 1,700, but the dropout rate was 
higher than expected. The overall response rate was 55.6% 
(ie, 2,775 addresses were visited). The average interview 
length was 58 minutes. 

The demographic characteristics of the survey sample 
match those of the adult population relatively closely. 
Women constituted 54.7% of the sample, compared to 53.1% 
in the 1996 Census, and 20.5% of respondents reside in 
Budapest and 43.0% reside in towns (the rest in villages), 
compared to 19.4% and 43.7% in the 1996 Census. The 
sample was slightly older and less educated, however, than 
the adult population. In terms of age, 335% of respondents 
were 18-39 years old and 45.7% were 50 or older (39.9% and 
40.1%, respectively, in the 1996 Census). Similarly, 11.8% of 
respondents are college or university educated, and 13.9% 
have leas than an elementary school education (13.0% and 
11.7%, respectively, for the adult population). 


APPENDIX B: CODING OF EXPLANATORY 
VARIABLES FROM THE 1997 HUNGARY 
SURVEY 

Education is the respondent’s highest education qualification. 
Coding is as follows: 0 for did not attend schoo! and less then 
elementary school, 1 for elementary school, 2 for vocational 
training, 3 for secondary school with a final examination, 4 for 
college degree, and 5 for university degree. Age cohort is 
coded as follows: 1 for 18 through 25 years, 2 for 26 through 
40, 3 for 41 through 70, and 4 for more than 71. Income is 
self-reported after-tax household income—either actual 
amounts in Forints or one of six mcome categories—and the 
analysis combines these two responses into a single six- 
category indicator. Occupation consists of four dummy vari- 
ables: unemployed, pensioners, self-employed, and white collar 
(managers and nonmanual). All other occupations are cap- 
tured in the intercept. 

Intention to vote for incumbent partes was measured by: “If 
the general election was held on the upcoming Sunday, which 
party would you vote for?” Responses were coded 2 for the 
two government parties from 1994-97, the MSZP and 
SZDSZ; 1 if the respondent was unsure; and O if the 
respondent named any challenger party or did not plan to 
vote. 

The fact that the survey was conducted six months before 
the election could affect the response of the average voter. As 
the following table indicates, however, vote preferences in 
the survey are reasonably close to the actual outcome, 
although support for the incumbents (MSZP and SZDSZ), is 
higher and support for the winner (FIDESZ) is somewhat 
understated. 
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Nationa! economic evaluation was based on two items, retro- 
spective—“How do you think the economic situation of the 
country has changed in these last 12 months?”—and prospec- 
tive—“In the next twelve months to come, do you expect the 
economic situation of the country will improve a lot, improve 
somewhat, remain the same, worsen somewhat, or worsen a 
lot?” Responses were coded from 0 for “worsen a lot” to 3 for 
“improve somewhat” and “improve a lot” (the two categories 
were collapsed because of the small number of observations 
in the highest category); “don’t know” responses were coded 
2 (Le., the same as “remained the same”). 
Personal financial evaluation, both retrospective and pro- 
ive, was measured by “How did your and your family’s 
financial situation change in the last 12 months?” and “Think 
of your future. Will your and your family’s situation a year 
from now improve a lot, improve somewhat, remain the 
same, worsen somewhat, or worsen a lot?” Responses were 
coded from 0 (“worsen a lot”) to 3 (“improve somewhat” and 
“improve a lot”; (these two categories were collapsed be- 
cause of the small number of observations in the highest 
category); “don’t know” responses were coded 2 (ie. the 
same as “remained the same”). 

The Political knowledge variable is based on the scores 
resulting from the factor analysis of three variables that tap 
the respondent’s factual knowledge about politica. Political 
sophistication is the interviewer's assessment of the respon- 
dent’s general level of information about politics (0 = “very 
low,” 4 = “very high”; “Cannot decide” was coded 2). 
Knowledge of the composition of the governing coalition was 
coded 2 if the respondent named either the MSZP or the 
SZDSZ; 1 if the respondent named any other Hungarian 
political party, and 0 if no party was named or for “Don’t 
know.” Knowledge of the party responsible for unemployment 
policy in the coalition was coded 2 if the respondent named 
either the MSZP or the SZDSZ; 1 if the respondent was able 
to identify any of the other Hungarian political parties; coded 
0 if no party was named or for “Don’t mow.” Political 
knowledge is represented as a dummy variable; factor scores 
less than 0 were coded 0, and those greater than 0 were coded 
1. 

Politica! trust is based on the scores resulting from the 
factor analysis of four questions. (1) “How often do you feel 
that you can trust the government to do the right thing?” (2) 
“How do you think the people working for the government 
use the money of the taxpayers?” (3) “In your opinion the 
government. . .is run by a small number of people who are 
looking out for themselves, or is run for the benefit of the 
whole nation?” (4) “In your opinion the government in- 
cludes. . .a lot of dishonest people, some dishonest people, or 
no dishonest people at all?” For the analysis reported here, 
the variable is represented as a dummy variable; factor scores 
less than 0 were coded 0, and those greater than 0 were coded 
1. 

Distance from MSZP on market economy is the perceived 
distance (on an eleven-point scale) between the respondent's 
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issue position on state control of the market economy and his 
or her placement of the major governing party, the MSZP. At 
the low end of the continuum the statement reads: “The state 
should play an important role in controlling the market 
economy.” The high end of the continuum reads: “The state 
should not control the market economy.” Respondents who 
did not place themselves and/or did not place the MSZP on 
the issue continuum were given the mean distance score for 
the sample, 2.15. 

Democratic satisfaction is measured by responses to “How 
satisfied are you about the way the H ian democracy is 
working?” Coding ranged from 1 (“not satisfied”) to 4 (“very 
satisfied”), with “Don’t know” categorized as the sample 
mean of 2.34. In this analysis, democratic satisfaction is 
recoded into a dichotomous variable with values of 1 and 2; 
“Don’t know”. was coded 0, and 3 and 4 were coded 1. 

Asymmetric interaction terms include positive retrospective 
and prospective national economic evaluations coded 0 for 
the two negative categories (0 and 1), 1 for the neutral 
category, and 2 for the positive category. Similarly, for 


negative retrospective and prospective national economic 
evaluations, the coding was: 0 for the neutral and positive 


categories, —1 for the “worse” category, and —2 for the 
“much worse” category. 


APPENDIX C: POLISH PUBLIC OPINION 
RESEARCH CENTER ELECTION STUDY, 
JUNE 1997 


The Polish study in June 1997 is based on a stratified 
representative random sample of 1,500 adult respondents. 
The response rate was 74%. Details on the sample design and 
fieldwork are available at http://vww.cbos.pl/ENGLISH/ 
cbos.en htm. 

Incumbent vote is based on: “If a parliamentary election 
were held next Sunday would you vote? Which party would 
you vote for?” Responses were coded 2 for one of the 
incumbent governing coalition parties (the Polish Peasants 
Party or the Alliance of Democratic Left), 1 for “Don’t 
know” or “Uncertain,” and 0 for a challenger. 

Retrospective national economic evaluation is based on: 
“How do you evaluate the economic situation in Poland?” 
PODO PEE D T A A E T 

national economic evaluation is measured by: “In 
the next year will the economic situation improve or deteri- 
orate?” Responses ranged from strongly deteriorate (0) to 
strongly improve (4). 

Retrospective personal financial evaluation is based on: 
“How do you evaluate your and your family’s living condi- 
tions?” Responses ranged from very bad (0) to very good (4). 
Prospective personal financial evaluation is measured by: “Do 
you think that a year from now you and your family will be 
doing. . . ?” Responses ranged from much worse (0) to much 
better (4). 

Confidence in political system is based on items that ranged 
from “political system is bad and needs changing” (0) to “the 
political system is good” (3). 

Education ranged from across eight categories, from “less 
than primary” to “college.” 

The Combined confidence and education dummy was coded 
1 for those with both a high level of confidence and informa- 
tion, 0 for all other respondents. 

The income question asks for per-capita household in- 
come, and occupation consists of dummy variables for pen- 
sioner, skilled worker, white-collar employee, self-employed, 
and unemployed. 
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committee to be representative of the chamber on all its policy dimensions simultaneously. In addition, with 
regard to distributive issues salient to members, we find that both committees are homogencous, 
high-demand outliers. Our study has important implications for distributive, informational, and partisan 


theories of legislative behavior and organization. 


The agriculture policy domain should be understood as a 
useful set of programs that at any one time meets the needs 
of numerous public policy players. 

—Browne 1995, 39 


of Congress concerning the theoretical and em- 
pirical applicability of various theories of legis- 
lative behavior and organization. Some scholars con- 
tend that one perspective is a sufficient explanation of 
legislative organization and decision making (Krehbiel 
1991), but others argue that various theories apply at 
varying times and under differing circumstances. 
“Rather than seek a single, universal account of con- 
gressional politics—whether distributive, informa- 
tional, partisan, or anything else—we should recognize 
that all of these considerations operate in varying 
degrees and that the variation is systematic and pre- 
dictable” (Rohde 1995b, 134).1 This study is an effort to 
extend and apply the latter view. 
We want to assist in the effort to see that these 
varying legislative theories are “knitted together” 


T here has been substantial debate among students 
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(Shepsle and Weingast 1995a, 6). To that end, we 
analyze agriculture policymaking in the 104th House. 
The House Agriculture Committee is considered a 
classic constituency committee. Assignments to it are 
dominated by constituency interests (Deering and 
Smith 1997; Jones 1961), and its policies fit the distrib- 
utive politics mold (Ripley and Franklin 1991).? With a 
new Republican majority for the first time in 40 years, 
however, the 104th House exhibited some of the most 
partisan behavior in this century (Aldrich and Rohde 
1996; Fenno 1997). Moreover, a number of legislative 
initiatives, generally supported by the new House lead- 
ership, sought to eliminate certain entrenched agricul- 
ture programs, which posed a potential conflict be- 
tween the partisan and distributive inclinations of some 
members. 

We use a number of roll call votes on agriculture 
policy in the 104th Congress to analyze the behavior of 
members of both the House Agriculture Committee 
and the Agriculture and Rural Development Subcom- 
mittee of the House Appropriations Committee in 
comparison to the remainder of the House. Our pur- 
pose is not to support one theoretical view over 
another. Rather, evidence indicates that various per- 
spectives apply simultaneously, and each perspective 


does not account for some empirical patterns. 


THEORIES OF LEGISLATIVE BEHAVIOR 
AND ORGANIZATION 


The three theories of legislative behavior and organi- 
zation of interest here are addressed in detail else- 
where (see, e.g., the articles in Shepsle and Weingast 
1995b), and we will discuss them only briefly. The 
distributive perspective posits that legislative gains 
from trade result from exchange of votes and influence 
across multiple jurisdictions. (Early works include 
Ferejohn 1974 and Rundquist and Ferejohn 1975. 
Among more recent works are Rundquist, Lee, and 


2 Indeed, Lowi (1964) specifically cites agriculture policy as an 
example of distributive policy m his origmal classification. 
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Rhee 1996 and Adier and Lapinski 1997.) This process 
is reinforced by the committee system, both because of 
differing preferences between committees and the floor 
and through the use of conference committees (see 
Fiorina 1987; Shepsle and Weingast 1987; Weingast 
and Marshal! 1988). 

Krehbiel (1991, 1998) contends that instead of log- 
rolling to maximize distributive gains, legislators act in 
order to obtain and distribute information. His model 
posits a unidimensional policy space, at least within the 
jurisdiction of individual committees (see, e.g., Gilligan 
and Krehbiel 1987). According to informational the- 
ory, committees generally are representative of the 
parent body and provide it with information on the 
merits of legislation within the committee’s jurisdic- 
tion. From this perspective, “legislative choices in 
salient policy domains are median choices” (Krehbiel 
1991, 263). 

Partisan theories contend that parties can exert an 
independent influence on behavior and outcomes (see 
Aldrich 1995; Cox and McCubbins 1993). Rohde’s 
(1991) “conditional party government” theory argues 
that a changing environment in the electorate resulted 
in greater homogeneity within the parties in the House, 
and when certain issues exhibit strong party congru- 
ence, members of the majority party will provide the 
leadership with authority to exert influence to ensure 
desired partisan results (see Aldrich and Rohde 
2000b). The theory is labeled “conditional” because it 
applies only in specified circumstances. Moreover, 
Rohde (1995a, 4) contends that each of these perspec- 
tives on legislative organization has applicability: 

The complex environment I have been discussing cannot 
be captured by any single-motivation, universal perspec- 
tive, and we should not seek to demonstrate that one of the 
rationales above is superior to the others across all con- 
gressional activity. Our challenge, as students of congres- 
sional politics, is instead to specify the conditions under 
which each rationale is relevant to behavior and outcomes 
of interest, and in particular the ways in which the sets of 
multiple motivations interact to affect results. 


We share with informational theory the view that the 
median floor preference is a powerful force in major- 
ity-rule institutions such as the House. Yet, under some 
conditions, either distributive or partisan influences 
can move the legislative outcome away from the center 
of the floor. That is, House members may desire 
committees to provide information regarding legisla- 
tion within their jurisdiction, but that is not necessarily 
the only thing that parties and members want from 
committees, so it is not the only structuring influence 
(see Jones, Baumgartner, and Talbert 1993). 

Under what conditions would we expect the legisla- 
tive outcome to shift from the median House prefer- 
ence? From a distributive viewpoint, when certain 
issues within the jurisdiction of a particular committee 
are of great import to its members but not to the rest 
of the House, the committee will use its influence in 
various ways to ensure that the outcome on the floor 
accords with its policy preferences (Shepsle and Wein- 
gast 1987). Preferences on these distributive issues will 
cut across party lines on the basis of geographic or 
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economic interests, and the committee may be able to 
affect the legislative outcome if the preferences of its 
members are both sufficiently strong and homoge- 
neous. As Adler and Lapinski explain (1997, 899), this 
condition includes the traditional notion of a commit- 
tee, such as House Agriculture, as a supplier of “pri- 
vate goods” to its constituents. We discuss this perspec- 
tive in greater detail in the section on preference 
outliers. 

From a partisan viewpoint, member preferences on 
partisan issues will be shaped by conflicts along party 
lines among rank-and-file voters or among party activ- 
ists (Jacobson 2000). When the majority party is rela- 
tively (but not perfectly) homogeneous in its beliefs on 
an issue salient to its members, and those beliefs are 
different from the floor median, the majority will use its 
influence over the rules and structures of the body to 
attempt to steer the outcome away from the center of 
the floor and toward the median of the majority party 
(see, e.g., the discussion of the Clinton impeachment 
resolution in Aldrich and Rohde 2000a). For example, 
the majority party may populate certain committees 
with party loyalists and offer positive or negative incen- 
tives to influence the preferences of some House 
members (Aldrich and Rohde 1997; Binder, Lawrence, 
and Maltzman 1999; Cox and McCubbins 1993), or 
through rules and in other ways it may set the agenda 
of the House (Aldrich and Rohde 2000b; Binder 1997). 

In general, when an issue is not highly salient to a 
constituency committee or the majority party, or when 
the membership of either is not sufficiently homoge- 
neous on a salient issue, we would expect no distribu- 
tive or partisan influence. Instead, the preference of 
the median member should dictate the outcome, as 
informational theory posits. We do not imply that 
distributive issues have no partisan content or vice 
versa? Certainly, these analytic categories have real- 
world overlap, and we classify them on the basis of 
their predominant character. Some issues will provoke 
partisan divisions, others will involve distributive con- 
cerns, some others will involve both, and still others 
neither. For convenience, we will describe the set of 
distributive issue dimensions as the distributive domain 
and the set of partisan issues as the partisan domain. 

Related to these general theoretical expectations is 
the question of multidimensionality. Specifically, can 
legislative policymaking be represented adequately by 
a single dimension, within either the whole legislature 
or individual committees? Our views on this matter 
derive partly from the preceding theoretical discussion 
and partly from our assumptions about member moti- 
vations. Some theories contend that legislators are 
motivated by a single goal, such as the desire for 
reelection (Mayhew 1974). Following Fenno (1973), we 
assume that members of the House have multiple goals 
(reelection, T, good public policy), that some may 
hold more than one of these goals, and that the goals 
potentially can conflict. 








3 We also do not imply that members who seek a nonmedian 
outcome on distributive or partisan issues will always be successful. 
Rather, we are trying to identify the issues on which they will attempt 
to achieve that end. 
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The ideas that distributive, and informa- 
tional interests can be invo in a single issue and 
that members can have complex and conflicting mo- 
tives imply multidimensionality. Poole and Rosenthal 
(1997, 19) state: “Just how many dimensions are 
needed to describe the structure of roll call voting is‘an 
empirical question.” Based on their analysis of two 
centuries of congressional voting, they contend that 
decision making is unidimensional. That is, in most 
Congresses, one dimension accounts for a large portion 
of the variance across all votes, and a second dimension 
usually does not account for very much. If one’s 
purpose is to provide an empirical characterization of 
the entire scope of congressional decision making, as 
they do, we agree entirely. As a theoretical matter, 
however, to explain decisions of committees and mem- 
bers on peake pieces of legislation, we believe that 
one dimension is not a te.4 This does not mean we 
disagree with the Poole and Rosenthal analysis. 
Rather, we are focusing on different things. The con- 
text of our analysis is multiple dimensions within a 
narrow subset of the full policy space. The secondary 
dimensions within each of these subsets would not be 
evident in an aggregate analysis of congressional roll 
calls. That is, we believe secondary dimensions are 
important within each narrow issue area, but their 
alignments are dissimilar across issues.° 

Based on this conception, multidimensionality can 
occur within the jurisdiction of a single committee. For 
instance, more than one distributive dimension may be 
relevant within a committee, i because com- 
mittees are com of members with diverse inter- 
ests and goals (Adler 2000). Indeed, one empirical 
feature we wish to demonstrate is that multiple dimen- 
sions at times are manifested within the same bill 
(Aldrich and Rohde 1997-98; Binder, Lawrence, and 
Maltzman 1999; Rohde 1995a, 1995b). 

Our view is similar to that of Maltzman and Smith 
(1995, 269), who contend that the issues considered by 
the Agriculture Committee include a social welfare 
dimension (e.g., food stamps), in which respect the 
committee is somewhat representative of the chamber, 
as well as an agricultural subsidy dimension, which is 
primarily a constituency benefit, and in this regard the 
committee is rather an outlier. Yet, we think their 
dichotomy is not sufficient and disagree with an as- 
sumption of their model, namely, that the Agriculture 
Committee deals in multiple dimensions only when 
very distinct issues are addressed. Instead, we contend 
that multidimensionality can exist on similar issues, 
even within the same bill, and any theory that seeks to 
explain or predict legislative behavior must reflect this 
empirical reality (see also, Maltzman 1997). 


COMMITTEE OUTLIERS AND 
JURISDICTION 


The issue of committees as preference outliers is 
related to the debate on the number and type of policy 


4 Krehbiel (1991), however, contends that a unidimensional model is 
sufficient to explain le ive organization and behavior. 

5 In this connection, see Koford (1989, 1994), who contends that the 
techniques used by Poole and Rosenthal have weaknesses in detect- 
ing small dimensions, such as those in which we are interested. 


dimensions within the House. Essentially, a committee 
is an outlier if the preferences of its members system- 
atically differ from those of the legislature as a whole. 
The members “more or less homogeneously demand 
high levels of benefits from policies that fall into their 
committees’ jurisdictions” (Krehbiel 1990, 149; see also 
Niskanen 1971).6 Because the outlier debate is ad- 
dressed extensively elsewhere, we highlight only a few 
of the points most relevant to our analysis. 

In an effort to test whether outliers exist in Congress, 
Krehbiel oy compared mean and median interest 
group ratings for certain committees to that of non- 
members, the null hypothesis being no difference be- 
tween chamber and committees.” He reports very few 
outlier committees for the years in his study, as only 
House Armed Services consistently achieved that sta- 
tus. He concludes that the House Agriculture Commit- 
tee was not an outlier, grouping it with others as 
“microcosms of the House” (p. 155), although he 
claims (p. 158): “At best, then, the Agriculture Com- 
mittee is a somewhat homogeneous medium-high de- 
mander [of benefits].” Furthermore, Krehbiel (p. 158) 
contends that Senate Agriculture exhibited more “an- 
tiagriculture” behavior than the full Senate. His clear 
contention is that committees are, at least ro ; 
mirrors of the floor: “At a minimum, the results of this 
study force us to entertain the ibility that the 
standard preference outlier story is a stylized fiction” 
(p. 159). 

The study by Hall and Grofman (1990) of the 
committee assignment process and its implications for 

ference outliers and related issues provides a some- 
what different perspective. They assert that committee 
bias depends upon the expanse (i.e., heterogeneity) of 
its jurisdiction. They also question the use of interest 
group ratings to measure outliers, because this intro- 
duces “measurement error that will systematically di- 
minish the likelihood that a statistical test will establish 
a committee-chamber difference when one exists” (p. 
1155). This is especially true when roll calls on non- 
jurisdictional committee issues are used in the analy- 
sis. Applying jurisdiction- and issue-specific data, Hall 
and Grofman reach conclusions different from Kreh- 
biel’s. In icular, they found that Senate Agriculture 
is aay home eneous and “is clearly not antiagri- 
culture” (p. 1158). They also report that the Senate 
Agriculture Appropriations Subcommittee is a “homo- 
geneous high-demander” (p. 1161). 

We do not perceive interest group ratings per se as 
the problem. Because we expect committees to be 
outliers on some (e.g., distributive) issue dimensions 
but not on others (e.g., those salient to parties), the 
relevant measurement issue concerns the appropriate 
level of analysis. Both Krehbiel and Hall and Grofman 


6 The discussion of outliers arises most often among 
distributive theorists because, due to (probably implicit) legislative 
logrolling, an outlier committee can inftuence polices that primarily 
benefit its own constituency. 

7 Krehbiel (1990) also tested for bipolar committee outliers, not 
simply the central tendency of a committee, but the work on 
homogeneous outliers is most relevant here. 

3 Snyder (1992) similarty asserts that interest group data can produce 
bimodal conclumons when such a condition does not exist, resulting 
in the mcorrect of the null. In response to these various 
assertions, Krehbiel (1994) defends his measurement techniques. 
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employ an aggregate measure of the committee me- 
dian versus that of the chamber.® To determine 
whether the committee is representative of the floor, 
we believe it is necessary to examine committee mem- 
ber and nonmember behavior on particular roll calls. 
Accordingly, we compare the floor votes of Agriculture 
Committee members and nonmembers on matters 
specific to the committee’s jurisdiction. This allows us 
to analyze whether (and when) members of a commit- 
tee have preferences that are different from those of 
the parent chamber, whether the committee is an 
outlier, and under what circumstances. Moreover, if 
the committee is an outlier, we can then explore 
whether this translates into committee power, as dem- 
onstrated by legislative outcomes that are clearly af- 
fected by the preferences of the committee members. 


AGRICULTURE POLICY AND COMMITTEES 
IN THE 104TH HOUSE 


With “jurisdiction over programs that have concen- 
trated benefits (for one’s constituents) but widely dis- 
persed costs,” the House Agriculture Committee is 
considered one of the “constituency committees” from 
which classic distributive policies generally flow (Adler 
and Lapinski 1997; Deering and Smith 1997, 75). In 
addition, while the eee Committee is re- 
garded as one of the c. r’s “prestige committees” 
eee and Smith 1997, 63), its Agriculture and 
ural Development Subcommittee (hereafter, agricul- 
ture appropriations subcommittee) exists in a similar 
environment to the Agriculture Committee, as it de- 
termines spending allocations for agriculture policy 
each fiscal year. We chose to study both in our com- 
ison of committee member behavior to that of the 
House. 

One consequence of the 1994 elections was that 
agriculture policy in the 104th House broke with 
tradition because of the isan dimension injected 
into an otherwise distributive context. At least since the 
Eisenhower years, most of the GOP had been staunch 
defenders of farm price suppor but in 1995 the 
Agriculture Committee held hearings on the so-called 
Freedom to Farm bill (HR 2195), which was strongly 
supported by the new Republican leadership. In addi- 
tion to reauthorizing farm programs, the primary pur- 

of the bill were to cut program budgets in general 
B about $16 billion over seven years and to eliminate 
and/or phase out specific farm subsidies (Congressional 
Weekly pri ede 1995, 2717-18). As originally 
drafted, the bill “ tened to shatter GOP unity” 
ial Quarterly Weekly Report, 1995, 1824). In 
ct, after it held hearings, the Agriculture Committee 
rejected Freedom to Farm when four southern Repub- 
licans joined all the Democrats in o ition. It was 
apparent that partisan and distributive motivations 
were in conflict for some committee members. 

Another version of the bill was introduced by Chair- 
man Pat Roberts (R-KS) in 1996 (HR 2854). Although 
the committee approved this draft with greater GOP 
support, dissension among committee Republicans re- 


’ Hall and Grofman (1990, 1152) discuss the appropnate level of 
analysis, but they opt for the subcommittee level. 
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appeared on the floor when various amendments de- 
signed to phase out price supports, such as peanut and 
sugar programs, were offered. Our analysis concerns a 
number of roll calls from this version of Freedom to 
Farm (HR 2854). Comparable votes are chosen from 
the agriculture appropriations bills for fiscal 1996 (HR 
1976) and 1997 (HR 3603). 

The agriculture policy issue offers a particularly 
useful context within which to test the competing 
perspectives. The separate inittatives to eliminate spe- 
cific farm programs provide a distinctive opportunity to 
separate distributive preferences from partisan or 
other motivations. Most studies that use roll call data 
focus on conflict, since that is what most roll calls 
reveal. Yet, because distributive programs ordinarily 
pass by consensus (Weingast 1979), either by voice vote 
or nearly unanimous roll calls, variations in preferences 
usually are not revealed. Nevertheless, when the era of 
expanding government effectively ended with the in- 
dexing of tax brackets in 1981 and ballooning budget 
deficits, competition increased among members for 
dispensing pork to their constituencies. Thus, if the 
floor seeks to cut substantially or eliminate a distribu- 
tive program that is salient to a constituency commit- 
tee, an opportunity arises to examine conflict in dis- 
tributive policy.!° 

Let us summarize the theoretical expectations that 
will be judged against the evidence. We expect that 
multiple issue dimensions will be revealed in the voting 
on agriculture policy. More specifically, some issues 
will divide members along party lines. Others will cut 
across party lines based on constituency interests. 
Members from farm districts will be highly supportive 
of distributive agriculture policies, and those from 
nonagricultural districts will be much less supportive. 
These patterns will be reinforced by committee mem- 
bership: Members of the two agriculture committees 
will exhibit greater support for distributive policies 
than other members, ceteris paribus. This means that 
we expect the agriculture committees to be high de- 
manders of benefits and unrepresentative of the House 
on distributive matters, in contrast to the predictions of 
Krehbiel’s informational theory. Note, however, that 
this expectation applies only to distributive issues, we 
expect the committee to be more representative of the 
chamber on partisan issues. Finally, we expect these 
predicted effects to hold in a multivariate analysis that 
controls simultaneously for other variables of theoret- 
ical mterest. 


DATA AND METHODS 


In addition to the 1996 farm bill, the 104th Congress 
passed two agriculture appropriations bills, one for 


10 Although efforts to eliminate farm programs represent such an 
opportunity, this situation is not necessarily unique to agriculture 
policy or the 104th House. Conflict should emerge whenever a 
distributrve is exposed to major cuts on the floor. Examples 
include, inter alia, termination or reduction of funding for the arts, 
environmental programs, the space station, and the superconducting 
supercollider as well as the closure of military bases at the end of the 
Cold War. 
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TABLE 1. 

























Summary Statistics 
Distributive Peanut Amendment (Roll Call 2034) 


Committee vs. Noncommittee mem 
Subcommittee vs. Noncommittee members? 


Partisan Democratic Substitute (Roll Call 2040) 
Committee vs. Noncommittee members? 
Subcommittee vs. Noncommittee members? 

“The pars are yea-nay vote; the percentage 

"Representatrves who are not members of ather the 

*p = .06,“p = 01. 


fiscal 1996 (hereafter, FY96) and one for fiscal 1997 
(hereafter, FY97). Although these involved less con- 
flict than Freedom to Farm, there were partisan and 
distributive aspects that shed light on the multiple 
interests of committee members. In floor action on 
these three bills there were thirty-seven roll calls, 
twelve on the farm bill, nineteen on FY96, and six on 
FY97. In order to draw’ the clearest comparisons for 
issue types, we selected all roll calls that we could 
identify a priori as either distributive or partisan. In 
particular, these roll calls allow us to compare member 
behavior on distributive and partisan dimensions in the 
same bill. 

Distributive issues reflect Deering and Smith’s 
(1997, 75) description of the jurisdiction of a constitu- 
ency committee, which addresses programs with 
broadly dispersed costs and concentrated benefits. All 
the roll calls we identify a priori as distributive were 
proposals either to eliminate a program altogether or 
to reduce substantially its funding level. On these 
votes, which would likely produce conflict, we expect 
legislators whose constituencies would be detrimentally 
affected to vote against the amendment. Thus, distrib- 
utive behavior is revealed, a pattern not discernible 
when distributive legislation is passed by consensus 
(Weingast 1979). 

Issues are considered partisan when they invoke the 


11 The creation of a similar new program or the expansion of an 
existing one also would meet these criteria, but there were no roll 


Ferejohn 1974). 


Vote on Selected Farm Bill Amendments by Member Status 


TABLE 1. Vote on Selected Farm Bill Amen eee 


Total Republicans Democrats 
Distributive Amendment to Phase Out Peanut 
Price Support Program, Roll Call 2034 
Ali members 209-212 (49.6%) 125-108 (63.6%) 84-104 (44.7%) 
Agriculture Committee . 6—42 (12.5%) 2-25 (7.4%) 4-17 (19.0%) 
Appropriations subcommittee 2-9 (18.2%) 1-6 (14.3%) 1-3 (25.0%) 
Noncommittee members” 201-181 (55.5%) 122-77 (61.3%) 79—83 (48.8%) 
Partisan Democratic Substitute Amendment, 
Roll Call 2040 
All members 163-258 (38.7%) 1-233 (0.4%) 162-25 (86.6%) 
Agriculture Committee 19-27 (41.3%) 0-27 (0.0%) 19-0 (100.0%) 
ns subcommittee 4-7 (36.4%) 0-7 (0.0%) 4-0 (100.0%) 
Noncommittee m 140-224 (38.5%) 1-199 (0.5%) 139-25 (84.8%) 


votng yea e In parentheses. 
Agnculture Committee or the agncutture appropnations subcommittee. 









(1) = 31.38" x*(1) = 27.89" ¥(1) = 6.50" 
(1) = 6.00" (1) = 6.22" ¥7(1) = 0.88 
(1) = 0.14 (1) = 0.14 (1) = 3.36 
(1) = 0.02 1) = 0.04 ¥7(1) = 0.72 





interests of competing partisan electoral coalitions or 
of respective groups of party activists (see Schlesinger 
1975). Examples are social welfare or environmental 
policy and questions regarding the appropriate (or 
inappropriate) scope of federal action. If an issue 
ordinarily divides the parties and leaders in their 
endeavor to move public policy in their preferred 
direction, we deem such a roll call to be partisan. 


RESULTS AND DISCUSSION 


For the selected roll calls we make direct comparisons 
of the committee and subcommittee members’ floor 
votes relative to the rest of the chamber, controlling for 
party. We also address the manner in which district 
characteristics influence committee membership and 
voting behavior. Finally, we offer multivariate logit 
analysis of certain roll calls, which allows us to examine 
together the various influences on floor voting on these 
issues. 


Distributive and Partisan Votes on 
Agriculture Policy 


Table 1 reports on two amendments to the farm bill, 
one classified a priori as distributive, the other as 
partisan. The distributive vote was on roll call 2034, an 
amendment that would phase out over seven years 
federal support to peanut farmers (see Appendix B for 
coding of roll calls). This is clearly a distributive issue. 
It is striking that this amendment failed, even if by only 


13 Roll calls not selected for analysis either were not theoretically 
relevant or presented such complex issues that they were difficult to 
classify. See Appendix A for a listing and a priori identification of the 
thirty-seven roll calls. 


915 


Distributive and Partisan Issues in Agriculture Policy in the 104th House 





Noncommittee 
Roll Call Members" versus 
All Distributive Votes Committee 
Subcommittee 
All Partisan Votes Committee 
Subcommittee 


three votes, given its strong support from noncommit- 
tee members. Yet, there was a clear consensus among 
members of both the Agriculture Committee and the 
agriculture appropriations subcommittee against end- 
ing the program, and they overcame a forty-vote deficit 
among the noncommittee members to preserve it. If 
the committee preferences had mirrored the rest of the 
chamber even a little more closely, the amendment 
would have passed. This is a clear example of outlier 
behavior by the committees, behavior that reversed the 
outcome preferred by the rest of the House. 

Examination of the partisan patterns yields further 
interesting results. Although the roll call to terminate 
the distributive peanut program could be classified as a 
party-unity vote—a small GOP majority favored the 
amendment, while the Democrats opposed it—both 
parties on the floor showed substantial internal divi- 
sion. Within the committee and subcommittee, how- 
ever, Clear majorities of both parties staunchly opposed 
the amendment. This bipartisan resistance on the 
committees enabled them to overturn the preferences 
of the noncommittee members on this distributive 
issue. 

Table 1 also shows the vote on Rep. Charles Sten- 
holm’s (D-TX) amendment to the farm bill, roll call 
2040. The Democrats’ substitute amendment, which 
proposed an alternative to the GOP’s policy, clearly is 
a partisan issue. It supported greater federal involve- 
ment in agriculture, in contrast to the Republican 
leaders’ phase out. As we would expect, the committee 
voting pattern was nearly identical to that of the 
noncommittee floor in a strong party-unity result. In 
fact, there were no partisan defectors in either com- 
mittee. As predicted, committee member behavior is 
substantially different when faced with a partisan roll 
call than when a distributive vote is at issue. 

The analysis of these two roll calls shows that both 
distributive and partisan dimensions were present in 
the farm bill.14 That is, multidimensionality exists in the 
legislative environment, even within a single bill. These 
roll calls are not unique, however. They are represen- 
tative of the outcome and voting patterns for the other 
roll calls on the three bills. On every distributive roll 
call, an overwhelming majority of Agriculture Commit- 
tee members voted in opposition to a majority of 
nonmembers on the floor. The same was true of the 
subcommittee on nearly all votes; its members voted 


14 For analysis of all relevant roll calls, see Hurwrtz, Moiles, and 
Rohde 1997. 
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Total Republicans Democrats 
~ 37.0 40.0 34.4 
35.9 40.1 30.6 
0.3 3.3 4.8 
12.0 18.7 4.1 





consistently with the Agriculture Committee, not the 
rest of the House. In fact, committee and subcommit- 
tee behavior on the distributive votes had a profound 
effect on the outcome; in five of the roll calls the 
amendment’s failure was a direct result of committee 
member divergence from nonmembers. Yet, on parti- 
san votes the decisions of committee members rather 
closely resembled those of their party colleagues.15 

Further evidence of multidimensionality, as well as 
the representativeness of the roll calls presented above, 
is found in Table 2. Here we present the mean absolute 
percentage differences in the percentage voting “yea” 
on all relevant roll calls between the committee or 
subcommittee compared to the rest of the House. 
These results confirm the divergence of both panels 
from the rest of the House on distributive issues. 
Across distributive roll calls, the mean absolute differ- 
ence between both the committee and subcommittee 
and other representatives was more than 35%. This is 
in stark contrast to the absolute differences on partisan 
votes of 0.3% and 12% for committee and subcommit- 
tee, respectively. (Comparable values are obtained for 
each party separately.) These findings illustrate that 
when committee members were confronted with 
threats to distributive programs salient to their constit- 
uents, their behavior clearly differed from that of the 
chamber. Yet, when partisan issues were at stake, 
committee members act in concert with the House. 
The relatively high deviation between committee Re- 
publicans and GOP colleagues on the floor on distrib- 
utive (but not partisan) votes implies a conflict in their 
distributive and partisan motivations. 


The Effect of District Characteristics on 
Voting Behavior 


The discussion has focused primarily on the contrasting 
voting patterns of committee members and other rep- 
resentatives. It also is important to consider the effect, 
if any, of district characteristics on the voting behavior 
of these groups, particularly in light of the claim by 
Krehbiel (1990) that the Agriculture Committee is not 


4 For instance, in all the distributive roll calls, differences between 
the committee and floor were significant at better than the .001 level. 
Subcommittee and floor differences were also significant for most of 
these votes, although the differences were not as large. In contrast, in 
only one of the partisan votes (roll call 1543) was the committee and 
floor difference significant, and in that case the committees voted in a 
more partisan manner than noncommittee members. 
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Middle 11 

High 37 
Appropriations Subcommittee 
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Low 139 
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a high demander of benefits. Two indicators are rele- 
vant for agriculture policy. The first is Farm Employ- 
ment as a percentage of district employment. The 
second is Rural Population as a percentage of district 
population. For each variable, the House membership 
is divided into thirds, representing low, middle, and 
high levels of both district farm employment and rural 
population (see Appendix B). 

In Table 3 we classify House members by these 
constituency characteristics. Clearly, the two commit- 
tees are dominated by representatives from farming 
and rural districts, and they differ considerably in these 
respects from noncommittee members. On the Agri- 
culture Committee, three-quarters of the members fall 
into the high farm employment category, and two- 
thirds represent districts with a high rural population. 
All members of this committee but one are above the 
House median for farm employment, and all but six are 
above the median for rural population. On the subcom- 
mittee, all but two members fall above the median for 
both variables. 

To demonstrate a connection between constituency 
characteristics and roll call voting, we apply these 
district classifications to votes on roll call 1544, a failed 
(distributive) amendment to limit various tobacco sup- 
port programs in the FY96 spending bill. The results in 
Table 4 indicate that House members from districts 
with high farm and rural proportions who serve on the 
two committees were most likely to oppose the amend- 
ment. Furthermore, noncommittee members from 
such districts were much more likely to vote against it, 
in contrast to noncommittee members from the low 
categories of each variable, who voted overwhelmingly 
to cut these funds. Similar results were obtained in 


TABLE 3. Distribution of House Districts by Level of Farm Employment and Rural Population 


Farm Employment Rural Population 
District Level* - Number Percentage Number Percentage 
All Members 


gE S E 
=Dmneron Into approametaly equal thirds 6 based upon the values obtained for the farm employment end rurei population vanables See Appendix B for 
®Representatives who are not members of erther the Agncutture Committee or the agncuiture appropnations subcommittee 


32.0 145 33.3 
35.2 145 33.3 
32.9 145 33.3 

— 1 2.1 
22.9 15 31.3 
77.1 32 66.7 
36.4 7 63.6 
63.6 4 36.4 
37.0 144 38.3 
36.7 123 32.7 
26.3 109 29.0 


other distributive votes. The mean percentage of 
House members who voted to reduce distributive to- 
bacco programs was more than twice as high for the 
low category than for the high category. On the parti- 
san votes a majority of both parties at each level of 
farm and rural characteristics supported the party 
position. The highest mean absolute percentage differ- 
ences among Republicans for these levels of farm and 
rural district characteristics are only 11.3% and 9%, 
respectively, while for Democrats they are 1.4% and 
0.9%. 

This analysis is consistent with the self-selection 
hypothesis, that is, high demanders of benefits attempt 
to secure a seat on committees with a direct connection 
to district concerns (Adler 2000; Shepsle 1978; Wein- 
gast and Marshall 1988). In terms of the homogeneity 
of committee members and the dissimilarity of their 
behavior from colleagues on the floor—the two criteria 
Krehbiel (1990, 153) claims are necessary for outlier 
status—our results on distributive issues disagree with 
his interpretation concerning the Agriculture Commit- 
tee. It is central to our analysis to note, however, that 
this conclusion applies only to distributive issues. On 
partisan matters, differences between committee mem- 
bers and nonmembers are negligible. Thus, the Agri- 
culture Committee is both representative and unrepre- 
sentative of the House membership, depending on the 
type of issue involved. 

Two further points can’be made. First, the two 
committees are largely different from the rest of the 
House in terms of district characteristics as well as 
interest in agriculture policy. Rather than being a 
heterogeneous mix, they are drawn primarily from 
districts that are substantially more oriented toward 
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TABLE 4. Voting Behavior on Amendment to FY96 BIII to Prohibit Funds for Tobacco Program, 


Roll Call 1544, by Level of District Characteristics 


Farm Employment 
Number Voting 


District Level 
All Members 
Low 
Middle 
High 


Agriculture Committee 
Low 


Middle 
High 


Appropriations Subcommittee 
Low 
Middle 
High 


Noncommittee Members? 

Low 36 
Middle 78 
High 82 





Rural Population 
Number Voting 
Yoa Nay 


95 
70 
34 


25.0 
28.6 


72.3 A 
43.1 56 
36.1 28 79 


69.6 
54.1 
26.2 


Ne Sn a a E S E a E a O a 
aDmaon nto approoametety equal thrds 1 besed upon the values obtained for the farm employment and rural populahon vanabies See Append B for 


data sources 


°Reoresentatives who ara not members of ether the Agricutture Committee or the agrouiture appropnations subcomnittes 


agricultural concerns than the rest of the House and, 
therefore, are likely to be high demanders of agricul- 
tural benefits for their constituents. Second, members 
of these committees behave most similarly to the subset 
of noncommittee members who are elected from 
heavily rural and farm-oriented districts. 


Multivariate Logit Analysis 


District characteristics tell us something about the 
source of committee members’ interest in agriculture 
policy. Members secure a place on committees that 
present a vital link to their constituents, and as re- 
flected by their voting on certain distributive issues. 
Yet, a more thorough analysis of multidimensionality 
requires us to consider district characteristics in com- 
bination with other variables. To address these issues, 
we turn to multivariate logit analysis of several farm bill 
roll calls.16 We examine two distributive votes, one 
partisan vote, and (for comparison) the vote on final 
passage of the farm bill. 

The dichotomous dependent variable for each esti- 
mation is a yea (coded as 1) or nay vote on the roll call 
(Aldrich and Nelson 1984). A number of independent 
variables, based on our theoretical argument, may 
influence floor voting on these roll calls. We already 
have discussed several: Agriculture Committee Member- 
ship, Appropriations Subcommittee Membership, farm 
employment, rural population, and Party. Another is 
South, since many agricultural commodities are grown 
primarily in that region. It also seems appropriate to 
16 Similar logit analyses were performed on roll calls for the FY96 


and FY97 bills, and the results comport with those presented hore, 
See Hurwitz, Moiles, and Rohde 1997 
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consider Republican Freshmen, because the incoming 
GOP group was “long on size and cohesion” regarding 
its collective stance on a conservative revolution 
(Fenno 1997, 25). As structured, each logit estimation 
is significant beyond the .0001 level. 

The logit results are presented in Table 5. Party has 
a substantial effect on voting behavior. The estimate 
for the party coefficient is strongly significant for all the 
roll calls, with the sign in the expected direction for the 
partisan vote.17 For substantive interpretation of the 
logit coefficients, we also calculated predicted proba- 
bilities, the results of which are reported in Table 6. 
The greatest effect of party came, as expected, on the 
(partisan) vote for the Democratic substitute. Holding 
all variables at their respective modal or mean value, 
the probability of a Democrat voting for the substitute 
was .87, compared to a minuscule .001 for a Republi- 
can. A particularly strong influence also was found for 
final passage of the bill; for Democrats the probability 
of support was .19, and for Republicans, .90. The 
influence of party is less strong on the distributive 
amendments, although the difference in probabilities 
remains substantial. Status as a freshman Republican 
had no material influence on any of the roll calls. 

Membership on the Agriculture Committee is a 
powerful influence (in the anticipated direction) on 
voting on the distributive roll calls (Table 5), and the 
simulated probabilities provide additional evidence of 
the effect (Table 6). For example, committee member- 
ship reduced the probability of voting to phase out the 
peanut and sugar programs by .35 (.74 — 39) and 54 


17 There is no a priori expectation for significance or direction of 
party on the distributive votes. 


American Political Science Review 


TABLE 5. Logit Analysis of Selected Farm Bill Roll Calls 


Varlable 
Party* 


Republican Freshmen 


Agriculture 
Committee 


Appropriations . 
Subcommittee 


South 


Farm Employment 


Rural Population 
Constant 

N 

Log likelihood 
LR x 

Pseudo R? 


Peanut 
Amendment, 
Roll Call 2034 
(distributrve) 


Sugar 
Amendment, 
Roll Call 2035 
(distributive) 


Democratic 
Substitute, 
Roll Call 2040 
(partisan) 





805" (277) 
598 (.357) 


—1.502* (523) 


—1.791* (.845) 
—2.208** (.283) 
—.169* (.078) 

—.021** (.007) 
1.196** (.225) 
421 


—210.870 


161.87 
.2t7 


1.146" (265) 
410 (.343) 


—2.53* (.768) 


179 (.682) 
—1.060™ (.254) 
. —.503** (.100) 

.004 (.007) 

.775* (213) 

425 


—228.199 


132.59 
225 


—8.653" (1.262) 
b 


1.258 (1.484) 


1.523 (2.341) 
1.127 (.854) 
.050 (.198) 
027 (.017) 
1.083** (.304) 
421 
~71.804 
418.40 
745 
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Passage of 
Farm Bill, 
Roll Call 2042 


3.638" (.385) 
—.059 (.528) 


—.133 (.547) 


1.199 (1.020) 
1.586" (.338) 
—.00002 (.078) 
026** (.008) 
—2.127* (.295) 
425 


— 154.582 


248.50 
446 


Note: The dependent variable is coded 1 for a yea vote on a parboular roll call Cell entnes are logit cosfiaents, standard errors sre in parentheses *p = 


106, "p s 01 


“Coded 1 for Repubkcan, 0 ctherwise, see Appendix B for coding detaia. 
"Dropped due to lack of varlanoe, since Republicans were nearly unanimous In ther oppoemton to the Democrabo substitute (1 for, 233 agamed. 


(.68 — .14), respectively, from that for noncommittee 
members. Significant effects also emerge for subcom- 
mittee members on the peanut vote, where the proba- 
bility of voting to protect the program was reduced to 
.24 from the baseline of .74; analogous results were not 
found on the sugar roll call. Yet, membership on these 


















Party? 


Republican Freshmen 
Agriculture Committee 
Appropriations Subcommittee 


South 





Rural Population 
Note. Cei entnes are 


Varlable 
Baseline Probabllity* 


Farm Employment 


Amendment, 
Roll Call 2034 


(distributive) ` 


14 


| 
œ NON NON ¢ 
ep YR ER BS 


82 


fa 


.64 


Sugar 


Amendment, 
Roll Call 2035 
(distributive) 


.68 


Democratic 
Substitute, 
Roll Call 2040 
(partisan) 
.001 


.001 .002 


predicted probaburbes of vohng yea when each vanabie of interest e set as indicated in the left-hand column, thes le 0 or 1 for the 
dichotomous vanables and +/— one standard devon of ther mean values for the continuous vanables, with all other dichotomous vanables set at their 
modal values, and continuous vanables set at ther mean values. 
"The baseine probaburty ts that for a member who ws a Republican but not a freshmen, not a member of ether committee, does not represent a southern 


0 othermes; see Appendix B for coding detale : 
“Dropped due to lack of vananoe, since Republicans were nearly unanmnous in thar oppoertion to the Democrabe substrtute (1 for, 233 ageuned. 





I 


committees had no discernable influence on the non- 
distributive roll calls. 

Among district variables, both farm employment 
and rural population proved (as expected) to be 
important for House members on the peanut roll call, 
although only farm employment had a substantial 


TABLE 6. Predicted Probabilities of Voting Yea on Selected Farm Bill Roll Calls 
Peanut e 


of Farm 
BIll, Roll Call 
2042 


.80 


83 94 
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influence on the vote to limit the sugar program. The 
predicted probabilities of voting to eliminate the pro- 
grams generally decreased when the high category of 
farm and rural district characteristics were involved. 
The South variable had a rather impressive influence 
when the distributive programs in that region were 
threatened. South is strongly significant in the logit 
estimations, and the predicted probabilities demon- 
strate that a member from a southern district was much 
more likely to protect these distributive programs than 
a nonsoutherner. Also, representing a southern district 
proved to be a material influence in passage of the final 
bill. 

Our logit analysis supports several conclusions about 
influences on voting behavior. First, party unequivo- 
cally is important, whether the issue is distributive or 
partisan, but it tends to become less important when 
other (in this analysis, regional) dimensions arise. 
Second, membership on the Agriculture Committee 
considerably increases the probability of voting to 
protect distributive programs within the committee’s 
jurisdiction. Moreover, particular district characteris- 
tics, such as southern region, farm employment, and 
Tural population density, also increase the probability 
of protecting distributive agriculture programs, but the 
latter two have no effect when partisan issues are 
considered. Finally, we would suggest that region mat- 
ters primarily for distributive issues with a well-defined 
regional benefit; in particular, our analysis demon- 
strates that the South rose to defend the peanut and 
sugar programs. * 


CONCLUSION 


The results of the empirical analysis provide support 
for all the main points of our theoretical argument. We 
expected that multiple issue dimensions would be 
revealed in the roll calls, and that some of those 
dimensions would be characterized by partisan voting, 
whereas others would reflect distributive interests. The 
two roll calls on the farm bill that we examined in detail 
demonstrate exactly this pattern, and the results on all 
the other partisan and distributive amendments are 
consistent with those results. We also expected to 
observe a specific pattern in the voting related to 
committee membership: As high demanders, members 
of the agriculture committees should be much more 
supportive of distributive policies than nonmembers. 
On these issues, the committee would be unrepresen- 
tative of the House. This expectation was also sup- 
ported. On all the individual distributive amendments, 
and in the aggregate, members of the two agriculture 
committees were substantially more supportive of the 
programs than were nonmembers. Also as expected, on 
partisan issues there was no similar difference in the 
vote choices of committee members and nonmembers. 

Our theory contends that both committee member- 
ship and voting on distributive issues will be strongly 
related to the degree of agricultural interests in a 
members’ district. The findings on all distributive votes 
were consistent with these predictions. We also ex- 
pected that the predicted relationships would all still 
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hold simultaneously in a multivariate analysis that 
included other relevant variables. This proved to be the 
case. In particular, it is worth noting that the signifi- 
cantly greater support of committee members for 
distributive policies held even when we controlled for 
the agricultural nature of their district. . 

This study indicates that all three of the theoretical 
perspectives we discuss can explain some aspects of 
congressional behavior and organization, but none 
offers a complete account by itself. Partisan theory is 
not always applicable because not all issues invoke 
partisan interests. Similarly, distributive theory does 
not speak to all aspects of legislative politics because 
only some issues are primarily distributive, and the 
analysis regarding gains from exchange across jurisdic- 
tions does not seem to apply to all policy questions. 
Finally, some of the results we present are inconsistent 
with the majoritarian expectations of informational 
theory. On distributive issues, members of the agricul- 
ture committees are unrepresentative of the House and 
are high demanders of benefits. But they are represen- 
tative of the House on other issues, as Krehbiel’s 
theory predicts. Moreover, on some distributive roll 
calls the committee members voted in the direction 
opposite to a majority of the chamber, which produced 
an outcome different from that preferred by the rest of 
the House. In addition, there were multiple issue 
dimensions with different preference patterns for each 
of three bills analyzed, contrary to a unidimensional 
perspective. 

We do not offer here a new general theory of 
legislative behavior, and that was not our purpose. 
Instead, we suggest the direction that future theorizing 
should take and provide some evidence in support of 
that suggestion. The results of this study seem consis- 
tent with the view that a multifaceted combination of 
the various theories of legislative organization, which 
specifies for each the locus and range of its applicabil- 
ity, promises a richer explanation of congressional 
behavior than any one of those theories alone. 


APPENDIX A: CLASSIFICATION OF AND 
PROPORTION OF VOTES ON 
AGRICULTURE ROLL CALLS 


Below, the name in parenthesis 1s the amendment’s sponsor. 
The numbers in parentheses are as follows: the percentage of 
all members voting “yea”; the percentage of Agriculture 
Committee members voting “yea”; and the percentage of 
agriculture appropriations subcommittee members voting 
“yea.” 


Farm Bill (HR 2854) Roll Calls 


2033 (Chabot): distributive (39.8; 6.3; 0.0) 
2034 (Shays): distributive (49.6; 12.5; 182) 
2035 (Miller): distributive (48.9; 4.2; 45.5) 
2036 (Sclomon): too complex (61.1; 38.3; 50.0) 


2037 hlert): too complex (91.0; 98.4; 90.9) 
2038 ley): too complex (38.5; 40.4; 27.3) 
2039 (Foley): partisan (70.6; 59.6; 63.6) 


2040 (Stenholm): partisan (38.7; 41.3; 36.4) 
2041 (recommit): not theoretically relevant (36.9; 40.4; 36.4) 
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2042 e): (63.5; 70.2; 81.8) 

instruct conferees): not theoretically relevant (100; 
100; 100) 

2107 (passage of conference report): (78.1; 83.0; 90.9) 


FY96 Appropriations Bill (HR 1976) Roll 
Calls 


1535 (Skeen): partisan (58.1; 68.1; 72.7) 
1538 (Walsh): not theoretically relevant (100; 100; 100) 
1539 (Allard): partisan (45.8; 45.8; 18.2) 
1540 (Castle): too complex (22.4; 18.8; 9.1) 
1541 (Sanders): too complex (16.4; 0.0; 9.1) 
1542 (perfecting): not theoretically relevant (54.4; 57.4; 63.6) 
1543 ): partisan (65.7; 48.9; 36.4) 
1544 in): distributive (47.2; 17.0; 27.3) 
1545 ): distributive (38.8; 7.0; 22.2) 
1547 (Hoke): distributive (19.7; 2.2; 0.0) 
1548 (Sanford): distributive (47.7; 32.6; 0.0) 
1549 ae too complex (40.5; 8.7; 9.1) 
1550 VAERE distributive (37.1; 2.1; 0.0) 
1551 (Obey): aa (43.5; 6.7; 9. 1) 
1552 edy): not theoretically relevant (32.7; 13.3; 0.0) 
oo utsch): distributive (59.2; 20.0; 18.2) 
): (80.1; 93.3; 100) 
Ta (recommit): not theoretically relevant (37.4; 28.3; 27.3) 
1708 (passage of conference report): (68.6; 68.9; 90.9) 


FY97 Appropriations BIll (HR 3603) Roll 
Calls 


2230 (DeFazio): distributive (33.3; 4.3; 0.0) 
2231 (Kolbe): distributive (44.7; 12.8; 0.0) 
nnedy): not theoretically relevant (31.6; 10.6; 0.0) 
2233 (Durbin): distributive (49.8; 12.8; 18.2) 
2234 (passage): (82.6; 78.7; 100) 
2387 (passage of conference report): (90.0; 97.9, 100) 


APPENDIX B: CODING OF VARIABLES 


Party: 1 if Republican; 0 otherwise.1* 

Agriculture Committee: 1 if member of House Agriculture 
Committee; 0 otherwise. : 

Appropriations Subcommittee: 1 if member of House agri- 
culture appropriations subcommittee; 0 otherwise. 

Republican Freshmen: 1 if a Republican first elected in 1994; 
0 otherwise. 

South: 1 if from any of the following states: AL, AR, FL, GA, 
KY, LA, MS, NC, OK, SC, TN, TX, VA; 0 otherwise. 
Farm Employment: Percentage of district workers age 16 and 
older employed in farming, forestry, and fishing occupa- 
tions (Source: CQ’s Congressional Districts in the 1990s, 

from U.S. Census data). 

Rural Population: Percentage of district residents living 
outside urban areas (defined as places with at least 50,000 
people) or in places of 2,500 or more outside urbanized 
areas (Source: CQ’s Congressional Districts in the 1990s, 
from U.S. Census data). 

Roll Calls: In the roll call numbers, the first digit marks the 
session in which the vote was held, and the last three are 
the official roll call number as reported in the Congressional 


18 Rep. Bernard Sanders (I-VT) is coded as a Democrat. Because his 
committee assignments are made as if he were, and he caucuses with 
that party, scholars ordinarily code him as such (see, e.g., Cameron, 
Epstein, and O’Halloran 1996). 
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Record. 1 if “yea”; 0 if “nay”; votes of “present” were 
dropped. 
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s a system of thought, pessimism is often assumed to be too deterministic or self-contradictory to 
withstand serious scrutiny. I examine Nietzsche’s use of the term “Dionysian pessimism” to describe 
his own philosophy in order to challenge these presumptions. Nietzsche was quite critical of the 
pessimistic philosophers popular in his day, but he nonetheless considered his own work to be a kind of 
pessimism, which he meant not as a psychological characterization but as a philosophical one. Nietzsche's 


Dionysian pessimism is a 


ive on life that can draw sustenance, rather than recoil, from the 


disordered, disenchanted world left to us after the demise of metaphysics. Whereas Schopenhauer advocated 
resignation, Nietzsche maintained that a new ground for activity could be found apart from the narratives 
of reason and progress. Dionysian pessimism is an answer to those who characterize Nietzsche's philosophy, 
and pessimism more generally, as passive or suicidal modes of thought. 


That there still could be an altogether different kind of 
pessimism, ... this premonition and vision belongs to me 
as inseparable from me, as my i and ipsissi- 
mum....1 call this pessimism of the future—for it comes! 
I see it coming!—Dionysian pessimism. 
—The Gay Science 


pessimism, or is such an attitude simply a confusion 

or a contradiction in terms? It is often maintained 
that pessimism cannot be taken seriously as a philoso- 
phy because it leads nowhere, that is, it leads to 
hopelessness or resignation.! One could reply that the 
consequences of a philosophy cannot affect its truth or 
falsehood. We cannot reject pessimism because we do 
not like where it takes us. On the contrary, we should 
be on guard against any tendency to suppress a line of 
thought because its conclusion repels us. Yet, the 
eminently logical quality of this reply may deflect our 
attention from another and more satisfying sort of 
response, one that defends pessimism in particular as 
opposed to disagreeable thoughts generally. We must 
not simply grant that pessimism leads to a posture of 
resignation. The epigraph from Nietzsche suggests that 
another result from pessimism is possible or, indeed, 
that there is another sort of pessimism altogether, the 
conclusions of which do not lead inevitably to despair. 
Like the idea of progress and the various philoso- 
phies to which it gave rise, pessimism is a modern 
phenomenon. As many historians have noted, the 
linear sense of historical time that emerged in the 
early-modern period made entirely new sorts of polit- 


I: it possible to draw a positive conclusion from 
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1 See, for example, Bertrand Ruseell’s (1945, 753-9) brief dismissal 
of Schopenhauer. 


ical thought possible (Koselleck 1985; Pocock 1975). 
Although this new sense of time did not, of course, 
produce a philosophy by itself, it did provide an 
underlying intellectual structure that allowed new ideas 
to be built atop it, and it made those ideas feel more 
plausible when once proposed. It has been said many 
times that the idea of progress is something modern. 
One could offer many qualifications to this platitude,? 
but I will accept its general validity and simply point 
out that the change in European time-consciousness 
did not authorize only the idea of progress. Pessimism, 
too, is one of its progeny, the hidden twin of progress 
in modern political thought. 

What is surprising in standard intellectual histories is 
how rapidly the idea of linearity is assimilated to the 
idea of progress, as if progress and stasis are the only 
two choices available to political philosophy. The word 
pessimism came into widespread use only in the nine- 
teenth century, but it clearly names a persistent 
thought or set of thoughts that has recurred often in 
social and political theory since the Enlightenment in 
tandem with its opposite. Both terms appear only after 
the thoughts they reflect are already in play. Leibniz 
first used “optimum” as a correlate to “maximum” and 
“minimum” in his Théodicée of 1710 (1985). French 
writers then began to refer to his doctrine as one of 
optimisme. The term apparently crosses into English 


with the ity of Voltaire’s Candide ou Optimisme 
of 1759 (1992). 
The first known printed ce of “pessi K 


in English followed a few decades later, although the 
context seems to indicate that the term was already in 
use.3 Philosophically, however, the emergence of pes- 
simism may be dated to 1750 (1964) and the appear- 
ance of Rousseau’s Discourse on the Arts and Sciences, 
with its characterization of modern man as a moral 
degenerate. Rousseau’s ideas were seconded, in the 
early nineteenth century, in such works as Leopardi’s 


2 The most common qualification is the claim that modern theories 
of progress are merely a secularized version of earlier Christian 
theologies of hope. The classic text is Lowith 1949. This argument 
has met with strong criticiams from, among others, Blumenberg 
(1983) and Pocock (1975). Although I cannot address this debate m 
detail, I should note that I find these criticrsms largely persuasive. 
3 See pessimunn in the Oxford Enghsh Dictionary. 
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Moral Essays ([1827] 1983), but pessimism achieved its 
brief period of genuine popularity through the work of 
Schopenhauer, whose Parerga and Paralipomena went 
through many editions after its initial publication in 
1851 (1942). Thereafter, although never a dominant 
school in political theory, pessimism was a well-recog- 
nized position for at least several generations.‘ 

What should be clear, even from this brief history, is 
that pessimism was originally viewed as a theory rather 
than (as we often think of it today) a psychological 
disposition. Rousseau’s complicated story of material 
development coinciding with ethical devolution, for 
example, contained a great deal of human psychology 
but neither implied nor was meant to engender a 
depressive outlook. With Schopenhauer, the story is 
more complicated. As the author of such lines as “If 
the immediate and direct purpose of our life is not 
suffering then our existence is the most ill-adapted to 
its purpose in the world” (Schopenhauer 1970, 41), he 
indeed recommended withdrawal from material pur- 
suits. But his reasons for doing so were largely meta- 
physical, and he expected that such a withdrawal would 
sharply limit human unhappiness. In any case, it is our 
very modern tendency to project a philosophical posi- 
tion onto people who are simply depressed that leads 
us to confuse philosophical with psychological pessi- 
mism. The question of the implications of 
cannot be settled by studying the effects of unhappi- 
ness; they need to be addressed at the theoretical level 
at which they arose.5 Happiness and unhappiness are 
universal phenomena, whereas pessimism, like the 
theories of progress to which it is opposed, is a modern 
idea. 

Nietzsche’s relationship to the pessimists who pre- 
ceded him was hardly one of uniform celebration. He 
called Rousseau a “moral tarantula,” and although 
initially inspired by Schopenhauer’s philosophy, he 
eventually dissociated himself from its systematic con- 
clusions (but retained a respect for its critical spirit). 
Nietzsche was also unkind toward the pessimists pop- 
ular in the Germany of his day, especially Eduard von 
Hartmann, who held the chair of philosophy in Berlin; 
Nietzsche called him “completely abysmal” (BGE 
204).6 As discussed below, Nietzsche believed that the 


4 In the nmeteenth century, examples are Eduard von Hartmann and 
then Hyppolite Taine; in the twentieth, Weber, Adorno, Camus, 
Cioran, and so on. Relaxmg one’s definitions a bit permits a much 
longer list (mcluding such figures as Freud, Headegger, Unamuno, 
and Sartre) to be generated. I cannot take up here the question of the 


the idea of progress. See the “Greek Pessimism” section below. 

5 This has not stopped interpreters from attributing Schopenhauer’s 
pessimism, for example, to an unhappy childhood. See the introduc- 
tlon by Hollmgdale to Schopenhauer 1970. 

6 Nietzsche references will use the following system: AC = The 
Anti-Christ (1968); AOM = Assorted Options and Maams (1986); 
BGE = Beyond Good and Evil (1966a); BT = The Birth of Tragedy 
(1967b); D = Daybreak (1982); EH = Ecce Homo (1967a); HH = 


(1984); and WP = The Will to Power (1967c). Numbers refer to 
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pessimism of both Hartmann and Schopenhauer led 
directly to nihilism. Indeed, the very popularity of this 
form of pessimism in the late nineteenth century was 
one reason Nietzsche believed nihilism would soon 
enjoy a temporary dominance of European society. 

Intermixed with his critique, however, is an account 
of another kind of pessimism. Nietzsche viewed it as 
distinct from the popular one and called it “that 
courageous pessimism that is . . . the way to ‘myself,’ to 
my task” (AOM, Preface, 4). This alternative grew both 
from the “pessimism” of the pre-Socratic Greeks (as he 
called it in The Birth of Tragedy) and from Schopen- 
hauer’s philosophy, but Nietzsche distinguished sharply 
between what he frequently called “my pessimism” and 
those that preceded it. Ultimately, he gave his alterna- 
tive the name “Dionysian pessimism” (GS 370).7 What 
exactly Nietzsche meant by this term and what appeal 
it may still have is the aim of this essay to discover. In 
part, this is a project of disentanglement, since Nietz- 
sche made many references to “pessimism” without 
always indicating which variety he was talking about. 
When these references are viewed as a whole, however, 
clear patterns begin to emerge. Indeed, Nietzsche 
speaks of many types of pessimism, “the unclear word,” 
only one of which he can embrace (KGW 8.1.129; see 
WP 38). 

“Pessimism,” by itself, is not a very specific term to 
Nietzsche, and this is not surprising. The late nine- 
teenth century was the one period in which pessimism 
enjoyed wide respectability, if not allegiance, in popu- 


- lar and intellectual discussions. The term was used by 


and applied to a wide spectrum of authors in an 
indiscriminate way. Nietzsche’s notes in the 1880s 
contain several lists of the various types of pessimism. 
Whether he composed these lists simply to distinguish 
among the possible varieties or because he planned to 
write about them in sequence is unclear. One list reads, 
in part: “Russian pessimism. Tolstoi, Dostoevsky / 
aesthetic pessimism lart pour lart ‘description’ / ro- 
mantic and antiromantic pessimism / epistemological 
pessimism. / Schopenhauer. “Phenomenalism.” / anar- 
chistic pessimism,” and so on, down to an entry for 
“moralistic pessimism,” which Nietzsche identifies with 
himself (KGW 8.2.73-4).° 

Nietzsche did not address all these varieties of 


Nictrache’s numbered sections or, 1f there are none, to page numbers 
in the editions listed in the references. For KGW = Werke: Kriturche 
Gesamtausgabe (1967d), numbers refer to volume, book, and page. 
ie ee oe 
often draw on those in WP and other published versions when these 
exist, All emphases are original unless otherwise noted. 

7 Nietzsche also occasionally spoke of “the pessimism of strength” 
(KGW 8.2.133), which, as we shall see, has a parallel but not quite 
identical meaning 

3 One interesting account of this, with many useful citatons to the 
contemporary literature, appears in Dale 1989, chaps. 9-10. 

° A version of ths jotting appears in The Will to Power as “aphorism” 
82. The editors inserted a variety of punctuation marks in order to 
Clarify and associate various Imes with one another; all this punctu- 
ation is suppositional, and some of it appears mistaken (apart from 
the obvious awkwardness involved in pretending that a lest is an 
aphorism). In Nietzsche’s notebook, but not in WP, the note goes on 
to list other topics that apparently are to be considered related, such 
as “nationalism / industrial competition / science.” To the best of my 
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TE in depth, but when he did consider them, he 

careful to distinguish them from their 
relatives From his perspective, what they had in com- 
mon was their practice of rejection and denigration or, 
in his vocabulary, “no-saying.” But the object and 
means of no-saying were far more important to Nietz- 
sche than the negativity itself. After another such list of 
pessimisms (“of sensibility,...of ‘unfree will’,... of 
doubt”), he sets out a clarification: “What must not be 
confused with all this: pleasure in saying No and doing 
No out of a tremendous strength and tension derived 
from saying Yes... the Dionysian in will, spirit, taste” 
(WP 1020). Dionysian pessimism, then, although it too 
is a no-saying and related to the others, is explicitly set 
off from them. For Nietzsche it is a philosophy of 
personal conduct, a suggestion of how to manage the 
human condition and cope with the basic problems of 
existence. Far from being a psychological disposition, it 
is a set of practices intended to guide an individual 
through the chaotic and disenchanted world in which 
we find ourselves. 

A great deal has been written on the “art of living” 
that Nietzsche prescribes as a kind of substitute for 
ethics.1° This literature points us in the right direction 
but misses something crucial by ignoring Nietzsche’s 
self-characterization as a kind of pessimist. The impli- 
cations of that label will be explored below, but they 
can be prefaced as follows: Pessimism has a particular 
understanding of the burdens of the human condition 
that these interpreters have not fully acknowledged. 
For Nietzsche, the time-bound character of our exis- 
tence forms the basic problematic (and sets limits to 
the possibilities) of any life-practice that he can recom- 
mend. 

Nietzsche’s Dionysian pessimism is a crystallization 
of ideas that takes place relatively late in his philosoph- 
ical growth, but the term had many precursors in his 
earlier periods. It is best understood by tracing the 
development of Nietzsche’s thoughts on pessimism 
and, relatedly, the process by which he disentangled his 
own thinking from that of Schopenhauer." Nietzsche 
began by quoting Schopenhauer uncritically, but he 
ended by proclaiming his views to be the opposite of 


knowledge, Nietzsche does not refer to his own pessimism as 
“moralistic” elsewhere. 

10 The phrase “art of lvıng” 1s from Hadot (1995, 272), who used it 
as a description of the mtended goal of ancient philosophy, “an 
exercise practiced at cach instant.” See also Nehamas (1985, 1998), 
who applies the phrase to Nietzsche, as well as Thiele 1992, Strong 
1988, Rorty 1989, Orlie 1997, and Connolly 1991. Foucault (1986) 
credits Hadot for insprring his Nietzschean search for “techniques of 
the self” in ancient texts, but Hadot (1995, 206-13) politely declines 
to equate his interpretation with that of Foucault. This whole strand 
of interpretation is strongly criticized by those who believe Nietzsche 
meant to offer philosophical truths (although very novel ones) in the 
traditional meaning of that term. See, e.g, Appel 1999, Berkowitz, 
Clark 1990, and Lerter 1994. I cannot address this controversy in any 
detail, but my interpretation demonstrates an affinity with (and 
perhaps presents further evidence for) the view of the first group. 
11 I cannot accept the view that Nietrsche’s writings, from first to last, 
are all of a piece. In exploring the early writings, I attempt to identify 
themes that, although they have rivals at the time, later become 
dominant. I accept, in broad outlme at least, the division of Ni- 
etzsche’s work into an carly, middle, and late period as proposed by 
Warren 1988. 
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Schopenhauer’s, although he still called Dionysian 
pessimism his “quintessence.” While Nietzsche is more 
often labeled a nihilist than a pessimist, the crime of 
inciting resignation or apathy is one he is often charged 
with along with such acknowledged pessimists as 
Schopenhauer. If this charge turns out to be false, then 
it must change our opinion of both Nietzsche’s political 
theory and pessimism more generally. 

Properly understood, pessimism is not simply an 
important element of Nietzsche’s philosophy but a 
tradition whose strength and relevance has been over- 
looked. In the right hands, pessimism can be—and has 
been—an energizing and even a liberating philosophy. 
It does indeed ask us to limit and eliminate some of our 
hopes and expectations, but it can also provide the 
means to navigate the bounded universe it describes. 
An entire literature, both scholarly and popular, is 
devoted to blaming pessimism for whatever spiritual 
crisis is thought to occupy us at the moment.!? Indeed, 
it is such a flexible term of abuse that it has readily 
been applied to almost every critical social theory of 
the twentieth century. Existentialism, critical theory, 
and postmodernism are regularly labeled pessimistic, 
as if doing so were enough to discredit them. The term 
is more appropriate in some of these cases than others, 
but some thought should be given to why the label 
functions so well as a gesture of dismissal. 

Critics often mistake a depiction of the world for a 
choice about our future, as if philosophers rejoice at 
the decline or decay they describe. This is akin to 
deriding scientists who warn of global warming because 
their models give apocalyptic predictions. Is it sensible 
to assume that the scientists want their predictions to 
come true? If the pessimists are right, it is the world 
that threatens us, not the writers who describe it. Yet, 
rather than address the threats to happiness that the 
world daily provides, critics of pessimism focus instead 
on the bearers of ill tidings and hope that, in dismissing 
them, they will eliminate the message as well. Yet, 
despite the instant unpopularity they accrue, pessimists 
keep appearing—-I suspect because the world keeps 
delivering the bad news. Rather than blame pessimism, 
perhaps we should study it. Rather than hide from the 
ugliness of the world, perhaps we should learn to 
withstand it. Nietzsche took it as‘his task to find a way 
to live with the conclusions at which he had arrived, 
and to live well, sometimes even joyfully. One can 
debate the degree to which he succeeded, but under- 
standing his pessimism must reorient our approach to 
all those pessimisms that followed in his wake. 


GREEK PESSIMISM 


Nietzsche first wrote of pessimism and its connection to 
the Dionysian in The Birth of Tragedy. Although it was 
certainly anachronistic to apply the term to the ancient 
Greeks (the word did not exist before the early nine- 
teenth century), his use of it here explains a great deal 


12 Some recent representative titles are Enemies of Hope. A Critique 
of Contemporary Pessimism (Tallis 1999) and The Future and Its 
Enemues (Postrel 1998). 
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about its meaning in his later work. Its appearance in 
this early work may be thought to be due principally to 
the influence of Schopenhauer, but the Greek version 
Nietzsche claims to identify is in many ways distinct 
from Schopenhauer’s.'3 In retrospect, Nietzsche recog- 
nized this. In the “Attempt at a Self-Criticism,” which 
he added to the book upon its republication, he 
lamented that he had “obscured and spoiled Dionysian 
premonitions with Schopenhauerian formulations” 
(BT, “ASC,” 6).14 But this retrospective judgment did 
not, as might be expected, lead him to alter the 
characterization of the pre-Socratic Greeks as pessi- 
mists. That was not the “Schopenhauerian formula- 
tion” he had in mind. Indeed, in the 1886 edition he 
added the subtitle “Hellenism and Pessimism” to the 
work and emphasized in his new introduction that what 
he still approved of in the book was its examination of 
“the good, severe will of the older Greeks to pessi- 
mism” and its contrast with the “optimism” initiated 
around the time of Socrates (BT, “ASC,” 4). 

The mistake that the new introduction identifies is 
the confusion of Greek pessimism with the Schopen- 
hauerian variety. The Dionysian pessimism of which 
Nietzsche had a premonition through his exploration 
of the Greeks was obscured at first by his equation of it 
with Schopenhauer’s philosophy. Later, the ancient 
Greeks were still viewed as pessimists but were simply 
another kind, as was Nietzsche himself. In The Birth of 
Tragedy, in fact, we have an early version of Nietzsche’s 
own pessimism. Greek pessimism is not the same as 
that of Schopenhauer or Nietzsche, but it is an instruc- 
tive model, both to Nietzsche, who called it “the only 
parable and parallel in history for my own innermost 
experience” (EH, “BT,” 2), and to us in our attempt to 
understand his later attachment to this term. 

The task that The Birth of Tragedy set itself was to 
explain not only the appearance of Greek tragedy but 
also its disappearance, at least in its traditional form, 
after Euripides. As is well known, Nietzsche hypothe- 
sizes that Socrates’ introduction (and Plato’s further- 
ance) of a rationalistic philosophy destroyed the pre- 
existing cultural grounds for Greek tragedy (BT 12-5). 
What did Socrates destroy, and how was this possible? 
Why, in any case, should a philosopher have the power 
to affect the theater? The answer lies in the pessimism 
that Nietzsche associates with the pre-Socratic philos- 
ophers and his belief that their ideas reflected the 
original character of early Greek culture. “Tragedy is 
the outlet of mystic-pessimistic knowledge” (KGW 
3.3.73). Pessimism was the philosophical basis for the 
plays of Aeschylus and Sophocles. This was the wisdom 


13 That Schopenhauer strongly mfluenced Nietzsche is generally 
uncontested by scholars, but the degree and timing of that influence 
are matters of considerable debate. For example, Kaufmann (m 
Nietzsche 1967b, 60n.) believes Nietzsche had already “broken loose 
from nhauer” in The Bath of Tragedy, whereas Nehamas 
(1985, 42) believes it ıs precisely on the issue of tragedy that “the 
influence of Schopenhauer became dominant.” Janaway (1998, 22), 
m a judicious formulation, maintams that “the Schopenhauerian 
system hovers cerily in the background, unasserted but indispens- 
able.” My article cannot settle this debate but may make a useful 
contribution to it 

14 The Buth of Tragedy was published m 1872 and remsued in 1886. 
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that the pre-Socratics possessed and that later genera- 
tions first denied, then forgot. Socrates is the agent of 
this change because his philosophy is essentially opti- 
mistic (BT 14).15 

In the period in which he wrote The Birth, Nietzsche 
did not think of optimism and pessimism as two equal, 
if opposite, ways of looking at the world, as we might 
today; rather “pessimism . . . is older and more original 
than optimism” (KGW 4.1.208). Pessimism is the do- 
main of the Ionian philosophers who preceded 
Socrates and whose teachings we possess only in frag- 
ments. Instead of trying to construct a systematic, 
ordering philosophy, as Socrates and Plato were to do, 
the pre-Socratics grasped the chaotic and disordered 
nature of the world and only attempted to cope with it 
insofar as that was possible: “Pessimism is the conse- 
quence of knowledge of the absolute illogic of the 
world-order” (KGW 3.3.74). 

In some notes from this period, Nietzsche first 
attributes to Democritus the doctrine that “the world 
[is] without moral and aesthetic meaning” and calls this 
idea “the pessimism of accidents” (KGW 3.4.151). In 
Philosophy in the Tragic Age of the Greeks, a posthu- 
mously published essay written at about the same time 
as The Birth, he likens Anaximander to Schopenhauer 
and calls him “the first philosophical author of the 
ancients.” He goes on to describe Anaximander as a 
“true pessimist” and quotes his only extant fragment to 
justify the label: “Where the source of things is, to that 
place they must also pass away, according to necessity, 
for they must pay penance and be judged for their 
injustices, in accordance with the ordinance of Time” 
(PTG 4; see KGW 3.2.312).16 The comparison with 
Schopenhauer follows directly in Nietzsche’s text and 
emphasizes the moral quality in both of their philoso- 
phies; both view “all coming-to-be as though it were an 
illegitimate emancipation from eternal being, a wrong 
for which destruction is the only penance” (PTG 4; see 
Cartwright 1998, 122). As discussed below, Nietzsche 
later distinguished between Schopenhauer’s moralizing 
pessimism and that of the Greeks. 

In other words, the pre-Socratics, as Nietzsche inter- 
preted them, grasped the animating principle of pessi- 
mism as I have described it elsewhere (Dienstag 1999, 
85—6): Time is an unshakable burden for human beings 
because it leads to the ultimate destruction of all 
things—and this fate belies any principle of order that 
may, on the surface, appear to guide the course of 
events.17 Of course, whether any of the pre-Socratics 
would have put things this way is debatable (although 
Heraclitus, in particular, is certainly often understood 





15 A parallel analyms, but without the emphasis on pessimmm, is 
offered in Strong 1988, 152f. 

16 This is a translation of Nietzsche’s German translation of the 
Greek orginal, which he slightly adapted to suit his own understand- 
ing. A standard English translation of the pre-Socratics renders 
Anaximander’s fragment thus: “And the source of coming-to-be for 
existing things is that mto which destruction, too, happens ‘according 
to necessity; for they pay penalty and retribution to each other for 
their myustice according to the assessment of Tme™ (Kirk, Raven, 
and Schofield 1983, 118). 

17 For an alternate account of pessimism see Pauen 1997. For some 


conceptual analysis, see Bailey 1988. 
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in this fashion). What is important is that Nietzsche 
understood them to be doing so, that he understood 
the root of pessimism to be, as he later wrote, “time- 
sickness [Zeit-Krankheit]” (KGW 7.2.51). 

Nietzsche considered tragedy to be the outgrowth of 
this view of the world as something constantly in flux, 
constantly in the process of becoming and, thus, con- 
stantly in the process of destroying. The ravages of time 
could not be cured or compensated through tragedy, 
only understood: “Tragedy ...is in its essence pessi- 
mistic. Existence is in itself something very terrible, 
man something very foolish” (KGW 3.2.38). Nietzsche 
resists the conclusion, popular since Aristotle, that 
tragedy offers some kind of purification of the emo- 
tions generated by the terrible truths of the human 
condition (TI, “What I Owe,” 5; WP 851). He also 
rejects the idea that tragedies contain some sort of 
moral lesson meant to instruct us in ethical behavior. 
Instead, he argues, tragedy simply serves to lay bare for 
us the horrible situation of human existence that the 
pre-Socratic philosophers describe, a situation from 
which our minds would otherwise flee: 


The hero of tragedy does not prove himself...m a 
struggle against fate, just as little does he suffer what he 
deserves. Rather, blind and with covered head, he falls to 
his ruin: and his desolate but noble burden with which he 
remains standing in the presence of this well-known world 
of terrors presses itself like a thorn in our soul (KGW 
3.2.38). 


The tragic outlook is thus generated from a base of 
pessimistic knowledge. It recommends no cure for the 
pains of existence, only a public recognition of their 
depth and power. 

From the beginning, too, this view is associated with 
the Dionysian, “the mother of the mysteries, tragedy, 
pessimism” (KGW 3.3.309). The Athenian public the- 
atrical festivals were known as the Dionysia, and 
Nietzsche goes so far as to claim the existence of a 
tradition “that Greek tragedy in its earliest form had 
for its sole theme the sufferings of Dionysus” (BT 10).18 
Dionysus, in Nietzsche’s account (which here certainly 
parallels Schopenhauer’s account of the human condi- 
tion), suffers the prototypical agonies of existence 
inflicted by time. He is severed from the eternal flux 
and individuated, then torn to pieces and reunited with 
the whole: 

This view of things already provides us with all the 

elements of a profound and pessimistic view of the world, 

together with the mystery doctrine of tragedy: the funda- 
mental knowledge of the oneness of everything existent, 
the conception of individuation as the primal cause of evil, 
and of art as the joyous hope that the spell of individuation 
may be broken in augury of a restored oneness (BT 10). 


Dionysian suffering is essentially human suffering. In 
tragedy, this is indicated by a connection between the 
various elements involved in the public performance of 
the drama. The tragic hero simply personifies the 


18 Nietzache calls this tradition “undisputed,” which seems doubtful. 
Again, however, the accuracy of his construal of the philological 
literature and traditions is less important than how these were 
related to his own views. 


“Dionysian state” of the chorus as a whole (BT 10). 
The chorus is “the mirror-image in which the Diony- 
sian man contemplates himself” and also “a vision of 
the Dionysian mass of spectators” (BT 8). Actor, 
chorus, and public are all connected in tragedy through 
their Dionysian character (see Strong 1988, 165). Each 
is a fragment torn from the whole. Nietzsche is here 
critiquing but also reconstituting the traditional philo- 
logical stance that the chorus represents the Greek 
public. Although he sharply attacks the original propo- 
nents of this view, he in fact proposes not to reject it 
but to modify it. What he truly dislikes about the 
association in its original form is the implication that 
the connection between Athenian performers and 
spectators is somehow reflected in contemporary (i.e., 
nineteenth-century) relationships between artists and 
their public. He will only accept the connection of 
citizens and chorus on the condition that the Greek 
public is understood as a unique phenomenon, a 
“Dionysian throng,” that is, as a public already infected 
with the pessimistic wisdom of the pre-Socratics.19 
Because modern audiences no longer share this out- 
look, comparisons of the moderns to the Greeks are, to 
Nietzsche, specious. 

Against this account of pessimism and tragedy as a 
kind of Dionysian wisdom, Nietzsche counterposes 
Socratic philosophy, whose characteristic feature now 
appears to be its optimism.” Even while proclaiming 
its ignorance, Socratic inquiry rejects the pessimistic 
idea that inquiry, like every human activity, is ulti- 
mately doomed: “For who could mistake the optimistic 
element in the nature of dialectic, which celebrates a 
triumph with every conclusion... the optimistic ele- 
ment which, having once penetrated tragedy must 
gradually overgrow its Dionysian regions and impel it 
necessarily to self-destruction” (BT 14). Socrates does 
not promise eternal happiness, but he does affirm both 
that virtue results in happiness and that virtue can be 
taught; happiness is theoretically within the grasp of all 
(BT 15).2! He denies that there is anything ultimately 
mysterious about life or inevitable about suffering: “By 
contrast with this practical pessimism, Socrates is the 
prototype of the theoretical optimist who, with his faith 
that the nature of things can be fathomed, ascribes to 








19 Nietzsche identifies A.W. Schlegel as the originator of the other 
view; although he proclaims that he gives Schlegel’s formulation “a 
deeper sense,” he certainly also exaggerates his own distance from 


element to which Socrates is particularly 
the new : the Dionysian and the Socratic” (BT 12). 

21 Throughout The Birth of Tragedy, Nietrsche’s characterizations of 
Socrates are given without reference to their source; here it seems 
clear that he has m mind the conclusions of Socrates in Plato’s 
Gorgias, and Republic that true happmess can only come 
from virtue and that virtue is equivalent to knowledge. This picture 
is common enough, but it is far from the only one possible; an 
opposite view could perhaps be constructed from the Socrates of the 
Meno, who concludes that virtue cannot be taught. 
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knowledge and insight the power of a panacea” (BT 
1 


Notwithstanding Socrates’ fate at the hands of his 
fellow citizens, Nietzsche has no doubt that this ap- 
proach, developed by Plato, was ultimately victorious 
in its struggle with tragedy: “Optimistic dialectic drives 
music out of tragedy with the scourge of its syllogisms” 
(BT 14). Just as the pessimism of an older generation 
of Greeks explains the origin of tragedy, so the Socratic 
turn in Greek philosophy explains its demise. When the 
population adopted the optimistic perspective, the 
cultural context for tragedy evaporated (see Strong 
1988, 161). From Nietzsche’s perspective, this was 
anything but a theoretical advance. Greek pessimism 
may have been somewhat soporific in its consequences 
(see below), but it had a fundamental honesty that 
Socratic-Platonic philosophy lacks. This point, in par- 
ticular, is reemphasized in the later introduction to The 
Birth. Pessimism is today, as in Nietzsche’s time, com- 
monly associated with ideas of cultural decay, but he 
takes the Greek experience to indicate precisely the 


opposite: 


Is pessimism necessarily a sign of decline . . . as it once was 
in India and now is, to all appearances, among us, “mod- 
erm” men and Europeans? Is there a pessimism of strength? 
... And again: that of which tragedy died, the Socratism of 
morality, the dialectics, frugality, and cheerfulness of the 
theoretical man—how now? might not this very Socratism 
be a sign of decline. ...Is the resolve to be so scientific 
about everything perhaps a kind of fear of, an escape from, 
pessimism? A subtle last resort against—#ruth (BT, “ASC,” 
1)? i 


The Greeks of Socrates’ generation could no longer 
bear to live with the brutal truths of the human 
condition and sought refuge in an optimistic philoso- 
phy. To Nietzsche this was “morally speaking, a sort of 
cowardice ... amorally speaking, a ruse” (BT, “ASC,” 
1). Either way, it was an active self-deception that 
made life more tolerable but less genuine. It was a 
retreat from a real look at the abyss to a pleasing 
fantasy of progress and happiness. Thus, Nietzsche 
concludes, the optimists are the true harbingers of 
cultural decline. What else can we call their weakening 
of resolve in comparison with the stance of the earlier 
Greeks? Nietzsche’s attack on Socrates and Plato is 
often taken to be a defense of irrationalism, but from 
his perspective it is they who retreat from an honest 
assessment of the world. The pessimistic vision of the 
world as fundamentally disordered, untamable, unfair, 
and destructive is the “truth” against which they close 
their eyes and retreat to a cave. 

If this was Greek pessimism, and if it was in some 
sense Dionysian, then what separates it from Ni- 
etzsche’s own later pessimism? We are dealing here 
only with matters of degree, but the differences are real 
enough (a fuller answer will be given below). Ulti- 
mately, the “Dionysian man” of The Birth is likened by 
Nietzsche to Hamlet—both are paralyzed by the 
knowledge of “the eternal nature of things.” Both, that 
is, have gained an understanding of the primordial 
chaos of the world, next to which their own efforts will 
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always amount to nothing. Both, therefore, draw the 
conclusion that acting is pointless: 


The Dionysian man resembles Hamlet: both once looked 
truly into the essence of things, they have guined know- 
ledge, and nausea inhibits action; for their action could not 
change anything in the eternal nature of things; they feel it 
to be ridiculous or humiliating that they should be asked to 
set right a world that is out of joint. Knowledge kills 
action; ... true knowledge, an insight into the horrible 
truth, outweighs any motive for action, both in Hamlet and 
in the Dionysian man (BT 7). 


The pessimism of the Greeks resulted in a quies- 
cence that tragedy, rather than purging, encouraged 
and strengthened. This Schopenhauerian conclusion 
that pessimism must issue in resignation is reversed in 
Nietzsche’s later thought. Ultimately, for Nietzsche, 
the combination of the Dionysian and the pessimistic 
served to stimulate activity rather than passivity. In 
some notes for Ecce Homo, he wrote of The Birth that 
it contained, in embryonic form, “the conception of 
pessimism, a pessimism of strength, a classical pessi- 
mism...The antithesis of classical pessimism is ro- 
mantic pessimism ...e.g., the pessimism of Schopen- 
hauer” (KGW 8.3.21). Although it is true that The Birth 
held many of the elements of Nietzsche’s later account 
of pessimism, this statement probably exaggerates its 
distance from Schopenhauer (see Janaway 1998, 24). 
After all, in the first edition Nietzsche maintains that 
the Greeks derived some sort of “metaphysical com- 
fort” from tragedy; later, in rejecting that conclusion, 
he suggests “you ought to learn the art of this-worldly 
comfort first; you ought to learn to laugh, my young 
friends, if you are hell-bent on remaining pessimists” 
(BT, “ASC,” 7). Ultimately, then, Nietzsche’s Diony- 
sian pessimism stimulates action. But how can the 
elements of Greek pessimism be recombined to draw 
the conclusion directly opposite that of The Birth? 


PESSIMISM AND NIHILISM 


It will be helpful at this point to locate Nietzsche’s 
objections to the pessimism of his day, especially that 
of Schopenhauer and Hartmann. Although Nietzsche 
considered the latter to be a comically simplistic ver- 
sion of the former, his mockery of it is instructive, for 
it reveals a great deal about what he takes popular 
German pessimism to be. 

Nietzsche judged Hartmann’s pessimism to be a kind 
of reverse utilitarianism. That is, Hartmann posed the 
question of life as if it were a simple cost-benefit 
analysis: “Whether it be hedonism or pessimism or 
utilitarianism or eudaemonism: all these modes of 








2 Again it can be objected that this account ignores the Apollonian 
element of tragedy, which allowed the Greeks to put a mask of 
“cheerfulness” over these conclusions (BT 8). But this point should 
not be overstated. The Apollonian elements do not, in Nietzsche's 
account, cause the Greeks to forget Dionyman msights; they are 
sımply a means for avoiding suicide. They do not fundamentally alter 
the posture of resignation but, rather, redirect its effects. 

B Maudemarie Clark (1998, 45) characterizes The Birth as “an 
attempt to save Schopenhauer’s metaphysics by showing how to 
avoid Schopenhauer’s own inconsistency.” 
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thought which assess the value of things according to 
pleasure and pain .. . [are] naiveties” (BGE 225). They 
are naive because they take reports of pleasure and 
pain at face value and because they never imagine that 
something other than simple pleasures could be a 
justification of life. Finding the pains of life to outweigh 
the pleasures, Hartmann draws the “logical” conclu- 
sion that life itself is best rejected. To Nietzsche, this 
entire way of thinking is absurd and hardly merits the 
title of “philosophy”; Hartmann’s pessimism is 

no more than a mathematical summation of peoples’ 
feelings. 

“The sum of displeasure outweighs the sum of pleasure; 

consequently it would be better if the world did not 

exist”—“The world is something that rationally should not 
exist because it causes the feeling subject more displeasure 
than pleasure”—chatter of this sort calls itself pessimism 

today! . . . I despise this pessemism of sensibility. it is itself a 

sign of deeply impoverished life. I shall never permit such 

a meager [one] as Hartmann to speak of his “philosophical 

pessimism” (WP 701; see WP 789). 

Nietzsche’s objection was not that Hartmann per- 
formed his calculation incorrectly. Nietzsche certainly 
did not maintain that life was or would be justified 
when pleasures outweighed pains (see KGW 4.2.414). 
Aside from its sheer simplemindedness, what con- 
demns this approach is the impossibility of making such 
a calculation. We lack the necessary measuring-stick. 
Hartmann assumes that it is possible to stand outside 
life as a whole and, as it were, tote up its pluses and 
minuses. This is inconceivable, in Nietzsche’s view: 
There is no perspective sub specie aeternitatis from 
which to make such an assessment, and the world itself, 
as the pre-Socratics recognized, is in a constant state of 
transformation, or becoming, which renders any such 
calculations transient and useless. 

Becoming is of equivalent value every moment; the sum of 

its values always remains the same; in other words, it has 

no value at all, for anything against which to measure it, 
and in relation to which the word “value” would have 
meaning, is lacking. The total value of the world cannot be 
evaluated; consequently philosophical pessimism belongs 

among comical things (WP 708). 

The world as a whole cannot be said to have any 
particular value, and it cannot be said to have a higher 
(or lower) value at one time rather than at another. We 
can never say that things are getting better or worse 
overall, or that the world as a whole is of high or low 
value. At best, we might say that, taken as a whole, 
things get neither better nor worse, since whatever 
exists is always in a process of transformation. 

Nietzsche found Hartmann’s pessimism comical— 
rather like an infant who rejects the world the moment 
its milk goes missing—and did not worry that many 
would be convinced by it. Or, if they came to espouse 
such a view, it would not be because the arguments 
were persuasive but because, as decadents, they were 
already inclined to this position—lke the later Greeks 
(see TI, “Problem of Socrates,” 2). Nietzsche’s reaction 
to Schopenhauer was quite different. Although 
Schopenhauer had made the famous analogy between 
life and a business whose receipts did not match its 


expenses, this was only one element of his argument 
and, to Nietzsche, a minor one, even if it was empha- 
sized by successors such as Hartmann. 
Schopenhauer had, in Nietzsche’s view, made a 
moral judgment against life and not merely an eco- 
nomic calculation in its disfavor. Schopenhauer, unlike 
Hartmann, recognized the fundamental disorder of the 
world first identified, in a different way, by the Greeks. 
This, to Nietzsche, was Schopenhauer’s great advance 
on all philosophy since Plato. It should have led him 
back to something like the tragic view of the pre- 
Socratics, but instead he drew a judgment against such 
a world, based on a moral standard.” That is, he 
attempted to account such a chaos generically “evil,” 
but could only do so based on an imagined “good” 
stability, a timeless world of Being, against which our 
transient, everyday world could be measured. 


Given these two insights, that becoming has no goal and 
that underneath all becoming there is no grand unity in 
which the individual could immerse himself completely as 
in an element of supreme value, an escape remains: to pass 
sentence on this whole world of becoming as a deception 
and to invent a world beyond it, a true world (WP 12; see 
WP 6, 9, 11). 


This is the pessimism that leads to nihilism. It is 
pessimistic in the sense that it rejects the optimism 
inherent in the idea of an ordered universe. On this 
point, Schopenhauer remains, to Nietzsche, a great 
critic of the nineteenth-century social philosophies of 
progress, whether of the liberal-English or Hegelian- 
German varieties—and certainly is a stimulant to 
Nietzsche’s own rapproachement to the pre-Socratics. 
But rather than embrace the natural chaos, as Greek 
tragedy did, Schopenhauer devised one final strategy to 
keep it at bay, namely, to sit in judgment and deem it 
bad. This, to Nietzsche, is something worthy of being 
called nihilism, and it is much more serious than 
Hartmann’s calculation that for most individuals their 
quanta of pain exceeds their quanta of pleasure. It is 
instead a judgment against the world as a whole, a wish 
that it would not exist. Schopenhauer’s pessimism is 
more severe because it cannot be cured, even in theory, 
by increasing the amount of pleasure in the world. 
Instead, life in its very nature is something worthy of 
rejection. Schopenhauer’s pessimism endorses the wis- 
dom of Silenus: “Not to be born is best,” and for man 
“the next best thing by far is to go back / back where he 
came from, quickly as he can.” 

Ultimately, this strategy can be met with the objec- 
tion Nietzsche raises to Hartmann: No observation 


% This pomt has been overlooked by several commentators on the 
relationship between Schopenhauer and Nietzsche. Soll (1988, 113; 
1998, 83/6), for instance, seems to assume that it is the surplus-of- 
pam argument that defines uer and lmks him to N+ 
etzache, Cartwright (1998, 136) calls Schopenhauer a “quam-hedo- 
nist” but admits that Nietzsche 18 not. 

25 There is strong evidence for this mterpretation. For example, 
Schopenhauer (1970, 49) wrote: “Nothing is more certain than that, 
generally speaking, it is the grievous sin of the world which gives mse 
to the manifold and great nufering of the world; whereby is meant not 
any physical-empurical connenon but a one.” 

æ Sophocles, Oedipus at Colonus, lines 1388-91 (1982, 358). Ni- 
etzscho discusses “the terrible wisdom of Silenus” in BT 3 and 4. 
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point exists from which to make such a judgment. 
Furthermore, there is no evidence for such a “true” 
world, only our “psychological need” for it to exist. 
Nietzsche makes these objections but then goes far- 
ther: Schopenhauer did not, in the end, have the 
courage of his own convictions: “a pessimist, a world- 
denier and God-denier, who comes to a halt before 
morality—who affirms morality and plays the flute [a 
pastime of Schopenhauer’s], affirms laede neminem 
[harm no one] morality: what? is that actually—a 
pessimist” (BGE 186)? In other words, Schopenhauer 
betrayed the logical outcome of his own pessimism at 
the last possible moment by rejecting the implications 
of his ontology solely on a moral basis, and an unver- 
ifiable moral basis at that (see Higgins 1998, 167f.). 

At least for Socrates, optimism was based on the 
idea that the world had an order; when this idea was 
abandoned, the Socratic morality should have disap- 
peared as well. But Schopenhauer denied the results of 
his own ontology. That he sought to preserve morality 
with a transcendental projection was transparently a 
failure of nerve—and one with severe consequences. 
When morality no longer claims to have a basis of any 
kind in the real world of events, it is free to condemn 
that world in toto, to develop into genuine philosoph- 
ical suicidalism. Although Nietzsche saw the roots of 
this sort of thinking stretching back to Socrates’ final 
request that his debt to Asclepius be paid, it was not 
until Schopenhauer completely uncoupled morality 
from ontology that this tendency could fully develop 
(TI, “The Problem of Socrates”). Nihilism, at least of 
this variety, is pessimism mixed with morality in a kind 
of devil’s cocktail.27 

This is the basis for Nietzsche’s repeated claim that, 
rather than rejecting Schopenhauer’s pessimism, he 
“deepened” it and “first really experienced it” (WP 
463). Whereas Schopenhauer mixed up and adulter- 
ated his pessimism with morality, Nietzsche takes 
himself to be purifying pessimism of the imperfections 
that Schopenhauer, its modern inventor, introduced to 
it (see BGE 56). In the introduction to the second 
volume of Human, All-too-Human, written in 1886, 
Nietzsche looks back on his Schopenhauer as Educator 
and explicitly traces this development: “I then went on 
to give expression to my reverence for my first and only 
educator, the great Arthur Schopenhauer... I 
was... already deep in the midst of moral skepticism 
and destructive analysis, that is to say in the critique and 
likewise the intensifying of pessimism as understood 
hitherto” (AOM, Preface, 1; see also WP 463). That 
critique is rooted in a moral skepticism: Schopenhauer 
relies on moral categories in passing a final judgment 
on the world, and Nietzsche rejects these categories. 
But the critique is likewise an intensification because it 
liberates Schopenhauer’s pessimism (which is, after all, 
his original contribution) from the commonplace mo- 


77 Perhaps this is one example Nietzsche had in mind when he wrote 
of “the hidden history of philosophy, the psychology of its great 
names,” that “Error is cowardice—every achievement of knowledge 
is a consequence of courage, of severity toward oneself (WP 1041). 
See also WP 382. 
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rality within which it has been encased.28 The attack on 
Schopenhauer is thus, in spirit, an act of loyalty. And it 
leaves Nietzsche free to “experience” pessimism in a 
way unavailable to Schopenhauer— unavailable in fact 
to any philosopher in the West ever since Socrates 
poisoned Greek pessimism by his introduction of opti- 
mistic morality. 

If this means confronting the terror that Schopen- 

hauer, and the pre-Socratic Greeks, found in the 
prospect of a world of flux and becoming, Nietzsche’s 
perspective at least offers the advantage of not suc- 
cumbing to a nihilism that rejects life as a whole (see 
Soll 1998, 101ff.). Pessimism recognizes that “becoming 
aims at nothing and achieves nothing,” and pessimism 
does not sit in judgment of this condition (WP 12). What 
is the result? “The innocence of becoming restored” 
(TI, “Four Great Errors,” 8).2 This idea Nietzsche 
considers “a tremendous restorative” (WP 765) just 
because we are released from the burden that morality 
imposes on us. Morality causes us to judge the world as 
a whole (an impossibility) and to judge it negatively (a 
mistake predicated on an impossibility): “Insofar as we 
believe in morality we pass sentence on existence” 
6), we “find existence a misfortune” (KGW 7.1.192). 
Along with the terror, there is also “a great liberation” 
involved in pessimism (TI, “FGE,” 8). We no longer 
give credence to the world-hatred and self-hatred 
bound up with morality. The burden of its judgment is 
rem 

But are we not then returned to the Hamlet-like 
impotence of the pre-Socratics? Perhaps not. To the 
innocence of becoming, Nietzsche now believes there 
will be two broad categories of response, which he 
characterizes as arising out of “strength” and “weak- 
ness”; these indicate a capacity (or lack of capacity) to 
tolerate the meaningless of life. Those who cannot bear 
this sort of existence, who require an ultimate meaning 
to life, end up once again as nihilists, although in a 
psychological rather than a moralistic sense: “One 
grants the reality of becoming as the only reality, 
forbids oneself every kind of clandestine access to 
afterworlds and false divinities—but cannot endure this 
world although one does not want to deny it” (WP 12). 
This sort of pessimism does result in despair and 
resignation, and “the weak perish of it” (WP 37). This 
is what Schopenhauer’s position, stripped of its illu- 
sions of a thing-in-itself, would amount to. 

Yet, it is a mistake to believe that human beings 
need such premanufactured meanings, for there is a 
pessimism of strength as well. “It is a measure of the 
degree of strength of will to what extent one can do 


38 On Schopenhauer’s self-consciously unoriginal morality, see BGE 
186. 
> Ths translation makes it appear as if Nietzsche is replacing one 
moral judgment about the world (Le. that it is “guilty”) with another 
(that it is “innocent”). But the word Nietzsche uses is “unschuld,” the 
opposite m everyday speech of “schuld” or “guilty.” Unschuld can be 
translated as “innocent,” but the more literal translation 
is “not guilty” or “lacking guilt.” Nietzache’s use of it here does not 
reverse the moral judgment but, insofar as possible, removes it 
entirely. To say that Becoming is unschuld is to adopt an agnostic 
postion as to its moral worth and, moreover, to suggest thet such a 
valuation is, m itself, inappropriate. 
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without meaning in things, to what extent one can 
endure to live in a meaningless world because one 
organizes a small portion of it oneself” (WP 585). It is 
one thing to know that there is no natural or God-given 
meaning to the world as a whole or to life as a whole. 
But, from that point, to make “the inference that there 
is no meaning at all” is a “tremendous generalization,” 
one that Nietzsche considers “pathological” (WP 13). 
Such a generalization represents an absolute disfaith in 
humanity, a presumption that humans can create no 
meaning other than that which they are given, which in 
itself is as without foundation as the earlier belief in a 
natural moral order to the world.» The alternative to 
this is based on the human capacity to create meanings 
of a temporary nature in our own corner of the cosmos. 
The lack of an overall natural meaning in the universe 
is no argument that adequate meaning cannot be 
generated by individuals. Nietzsche may have made 
this faulty generalization at the time he wrote The Birth 
of Tragedy, but he abandons it in his later work. 
Aphorism 370 of The Gay Science encapsulates this 
transformation. Nietzsche describes his initial attrac- 
tion to “philosophical pessimism” (Schopenhauer) and 
“German music” (Wagner) as based on a misunder- 
standing. What appeared to him at first as a cultural 
“earthquake” emerging from a Dionysian “over-full- 
ness of life” was in fact the product of “the impover- 
ishment of life,” which Nietzsche now labels “roman- 
ticism.” Romanticism only simulates something 
revolutionary; its radicalism is feigned. It seeks “above 
all mildness, peacefulness, and goodness in thought 
and deed ... also logic, the conceptual understandabil- 
ity of existence ... in short, a certain warm narrowness 
that keeps away fear and encloses one in optimistic 
horizons.” Just as Wagner began his career as a 
partisan of the 1848 revolutions but ended as a unctu- 
ous courtier to German princes, so Schopenhauer 
began with a seeming rejection of Socratic optimism 
but, in the final analysis, retreated to it. This “romantic 
pessimism” is thus “an altogether different kind” from 
Nietzsche’s own, which he names here for the first time 
as “Dionysian pessimism.” 
That there still could be an altogether different kind of 
pessimism, ... this premonition and vision belongs to me 
as inseparable from me, as my proprium and ipsissi- 
mum ...I call this pessimism of the future—for it comes! 
I see it coming|—Dionysian pessimism. 
Romanticism, although it was Nietzsche’s own starting 
point, turns out to be a kind of sham pessimism, and 
Nietzsche here declares his independence from it. 
Unadulterated pessimism is only now coming into 
existence; only when it does can we fully appreciate its 
promise and dangers. Nietzsche rejects Schopenhauer 
(although grateful for the education he provided), not 


% Soll (1988, 116) and Higgins (1998, 174), among many others, tend 
to view the difference between the two alternatives as a matter of 
temperament, so that the choice for strength or weakness is either 
something inborn and unalterable or a radical choice with no 
philosophical basis. I think the passages quoted above make clear 
that this is not the case. Every philosophical choice, for Nietzsche, is 
in some respect a matter of character, but it is clear that these two 
positions are separated by ideas, not just moods. 


because the latter is too pessimistic, but because he is 
not pessimistic enough. 


THE PESSIMISM OF STRENGTH 


By a process of elimination, we have come some 
distance closer to understanding the pessimism of 
which Nietzsche could approve, but it remains to give a 
more detailed account of it. Certainly, his pessimism is 
a kind of no-saying, a rejection of traditional morality. 
But he emphasizes the activity involved in such a 
no-saying and considers it, by itself, to be something 
valuable. The alternative title Nietzsche gave one of his 
final books is perhaps a good starting point. How to 
Philosophize with a Hammer is the second name given 
to Twilight of the Idols. Throughout the book, however, 
there is little reference to this “hammer,” and readers 
are often left wondering just what it is.*1 In his notes 
Nietzsche repeatedly refers to pessimism as a kind of 
“hammer,” one used to break down and break apart 
traditional ways of thinking (e.g., WP 132, 1055). This 
destruction is healthy and recuperative on its own, even 
apart from some rebuilding that may come: “The 
hammer: a teaching which through setting loose the 
death-seeking pessimism brings about an extraction of 
the most vital” (KGW 8.1.108). 

What does it mean to wield the pessimism of 
strength as a tool? In the first place, of course, it means 
to attack existing moralities, “to teach destructive ways 
of thinking” (KGW 7.3.210). In this task, pessimism is 
an all-purpose instrument because it attacks the basis 
of all moralities, not just some of them. By denying the 
existence of any natural order to the universe and 
emphasizing the continuous flow of becoming and 
time, pessimism is as critical of utilitarian morality as it 
is of the Christian or Kantian variety. But its effect is 
not simply a critical one. Even if destruction is a 
necessary prelude, that is not the end in itself. A 
hammer also can be used to put something together— 
indeed, it is one of the few tools to possess this dual 
property. Likewise, pessimism “in the hand of the 
strongest becomes simply a hammer and instrument 
with which one can make oneself a new pair of wings” 
(KGW 8.1.109). 

Wings of what sort? Here lies the difference between 
Nietzsche’s pessimism and previous ones. Even the 
past pessimisms Nietzsche admired, such as that of the 
Greeks, came to an end with the destruction of illu- 
sions. In his account, the pre-Socratics evoked an ethos 
of virtual paralysis. As does Buddhism, they taught one 
to be at peace with the world’s chaos but not to seek to 
alter it. In a long note entitled “Critique of previous 
pessimism,” Nietzsche outlines his alternative: 


31 In the foreword, Nietzsche likens the hammer to “a tuning-fork” 
with which he sounds out the hollowness of idols. But this does not 
square well with his characterization of the work as “a declaration of 
war” and does not, in any case, help us understand the nature of the 
hammer. The final section is entitled “The Hammer Speaks” but is 
simply a short quotation from Thus Spoke Zarathustra. Perhaps the 
hammer is Zarathustra himself? But this only further begs the 
question. 
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Our pessimism: the world does not have the value we 
thought it had... . Initial result: it seems worth less [sic]; 
... simply in this sense are we pessimists; namely, with the 
will to admit this revaluation to ourselves unreservedly and 
not to tell ourselves the same old story, not to lie to 
ourselves. 

That is precisely how we find the pathos that impels us 
to seek new values. In sum: the world might be far more 
valuable than we used to believe; ... while we thought we 
accorded it the highest interpretation, we may not even 
have given our human existence a moderately fair value 
(KGW 8.1.248). 


Nietzsche is treading a delicate line, since, as we have 
seen, he also says it is a mistake to impute to the world 
any overarching value. But this does not mean we 
should cease to value anything at all. Schopenhauer’s 
philosophy tends toward that nihilistic conclusion, 
which Nietzsche wants to resist. Pessimism as such 
need not lead there. Rather, the withdrawal of an 
overarching account of the world’s value impels one to 
seek “new values” (note the plural). No single one of 
these can replace the old value system, but separately 
they may give us more reasons to continue living than 
can any overarching Meaning of Life. Christian moral- 
ity and its offshoots seek to overcome thoughts of 
suicide with one ultimate duty, or ultimate happiness. 
Nietzsche’s pessimism advises each of us individually to 
cobble together a meaning for life out of lesser goals 
with the ultimate result that, when these are gathered 
together, “the world might be far more valuable than 
we used to believe.” 

Nietzsche’s inspiration here, as in so many other 
matters, is the example of a certain kind of art. His 
praise of art is not the romantic idea that it puts us in 
touch with great truths.22 Rather, art represents the 
organization of a small portion of an otherwise mean- 
ingless world that gives purpose to an individual exis- 
tence (WP 585). It is the attempt to impose a tempo- 
rary form on the inevitable transformation of the 
world; since the world must acquire some sort of 
particular form in its metamorphoses, art is “repeating 
in miniature, as it were, the tendency of the whole” 
(WP 617)—but by an effort of will. Art is not an 
attempt to fight the pattern of existence but an effort to 
shape that pattern into something recognizable, “to 
realize in oneself the eternal joy of becoming—that joy 
which also encompasses joy in destruction” (TI, “What 
I Owe,” 5). The creativity of artists is, in essence, 
“gratitude for their existence” (WP 852). 

When art assumes this shape, it becomes “the great 
seduction to life, the great stimulant to life” (WP 853). 
This is not to say, however, that such art must be 
“uplifting” in the conventional sense. Since joy in 
destruction may be a stimulant to life, even depictions 
of the most miserable things may be included: “The 


= His praise of art is not indiscrimmate. He goes to great lengths to 
distmguish the sort of art he has in mind from that produced by the 
“artists of decadence” and “romanticism m art” (Wagner is always 
his chief example), which proceeds from “an impoverishment of life” 
and ends in “hatred of the il-constituted, disinherited, and under- 
privileged ... one who, as it were, revenges himself on all things by 
forcing his own image, the image of his torture, on them, branding 
them with it” (WP 852, GS 370). 
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things they display are ugly: but that they display them 
comes from their pleasure in the ugly .. . . How liberat- 
ing is Dostoevsky” (WP 821)! Nietzsche does not 
mean, of course, that we should all be artists, but we 
should approach our lives as artists do their work. If we 
can understand why an artist who knows that art is 
devoid of metaphysical value still wants to paint pic- 
tures, then we can understand why Nietzsche thinks 
pessimism can result in a creative pathos. A better 
example might be the situation of an architect: 
sane architect must know that no building lasts forever. 
Built in opposition to nature (as to some extent every 
human structure must be), it will be attacked by nature 
(by wind, water, and so on) the moment it is completed. 
Whatever the purpose for which the structure is de- 
signed, that purpose will someday be superceded. 
However beautiful it may seem when erected, it will 
someday, to another set of eyes, appear ugly. Yet, 
knowing all this, architects pursue their craft. Knowing 
that the universe will ultimately not tolerate their work, 
they continue to organize a small portion of that same 
universe for local purposes. 

The lack of order in the universe can also fuel 
nihilism, as Nietzsche is well aware. Unlike the nihilist, 
the pessimist does not just reveal the tragic character of 
existence but achieves a degree of equanimity about it. 
This aspect of pessimism often comes across as indif- 
ference to the suffering of others. Indeed, at times 
Nietzsche verges on expressing such indifference, but in 
the end he does something rather different. He advises, 
instead, that we not look to nature or God to express a 
horror of suffering on our behalf and that we not 
imagine that such suffering is any less “natural” than 
happiness: 

The benefit consists in the contemplation of nature’s 

magnificent indifference to good and evil. No justice in 

history, no goodness in nature: that is why the pessimist, if 
he is an artist, goes in historicis to those places where the 
absence of justice is revealed with splendid naiveté ... and 
also in nature, to those places where her evil and indiffer- 
ent character is not disguised (WP 850). 


The view that pessimism leads to resignation usually 
includes the notion that it promotes a disinterest in the 
workings of the world. Nietzsche suggests just the 
opposite. Pessimism is an invitation to a new critical 
investigation of nature and history, even those ele- 
ments of life that we consider ugly and evil. 

One effect of this situation is that when we look at 
the world once again—without the grey-colored glasses 
of morality—we may see things differently. We may 
now find ourselves curious about that which, for mil- 
lennia, we were taught to shun. Indeed, Nietzsche sees 
curiosity about what has been considered evil to be one 
of pessimism’s greatest benefits. This does not mean we 
will simply celebrate what we once abhorred. Rather, 
we will seek it out on its own terms and come to our 
own fresh evaluation of it, and this goes for what was 
once called “good” as well: 


Let us dwell a moment on this symptom of highest 
culture—I call it the pessimism of strength. Man no longer 
needs a “justification of ills”; “justification” is precisely 
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what he abhors: he enjoys ills pur, cru; he finds senseless 
ills the most interesting. If he formerly had need of a god, 
he now takes delight in a world disorder without God, a 
world of chance, to whose essence belong the terrible, the 
ambiguous, the seductive. In such a state it is precisely the 
good that needs “justifying” (WP 1019). 


Nietzsche is quite clear that what we previously called 
good may well find a justification, but not “justifica- 
tion” in its previous sense: “If he in praxi advocates 
preservation of virtue, he does it for reasons that 
recognize in virtue a subtlety, a cunning, a form of lust 
for gain and power” (ibid.). 

The pessimism of strength involves the use of pessi- 
mism as a hammer—as a philosophical technology—to 
destroy and to build. Pessimism is both a critique of 
existing moralities and an instrument in the construc- 
tion of an alternative apart from morality. Far from 
ending in despair and resignation, Nietzsche considers 
the moment when “my type of pessimism” appears, 
“the great noon,...[the] great point of departure” 
(WP 134). Pessimism may not be the end of the 
journey, but all roads to the future lead through it, and 
it may be necessary to remain pessimistic for “a few 
millennia” (KGW 7.3.210). Can we say more about this 
alternative? In particular, what is Dionysian pessi- 
mism? 


DIONYSIAN PESSIMISM 
It is probably true that Nietzsche was less interested in 


assigning content to these hypothetical new values than 
in demonstrating that they should exist. One note on 
“the pessimism of the energetic” emphasizes that “the 
‘to what end?’ after a terrible struggle [is] . . . itself a 
victory” (KGW 8.2.62). The simple desire to formulate 
new goals after overcoming earlier moralities is some- 
thing to be celebrated. Although Nietzsche speaks 
often of a revaluation of values, he never provides a 
new set to replace the old. Indeed, given his well- 
known sentiment that “a will to a system is a lack of 
integrity” (TI, “Maxims,” 26) and his radically individ- 
ualistic belief that the formulation of new values is 
something each of us should undertake on our own (Z 
1:22), it would be unfair to expect this from him. Still, 
we are not left simply with the imperiously vague 
injunction to “create new values.” Dionysian pessimism 
is not itself a value system, but it is an ethos that sheds 
some light on what it might be like to live a good life in 
the era following the death of God. This pessimism is a 
sort of art of living. It is a life-practice that Nietzsche 
recommends, although not for everyone. 

Some sense of what Nietzsche meant by Dionysian is 
given in The Birth of Tragedy, but the use of this word 
continued to evolve (although he often wrote as if all 
the later meanings were implicit in the earlier ones). If 
Dionysian pessimism is the one “no” that evolves out of 
“yes,” then it is important to know what one is approv- 
ing with a Dionysian “yes.” From the various texts and 
notes that bear on this question, the answer seems to 
be something on the order of “life as a whole” or “the 
world as it is and will always be.” But, as Nietzsche was 
fond of pointing out with regard to Hartmann, there is 


really no perspective from which to view life as a whole 
(whether to deny or affirm it), so such an assent can 
only be a kind of gamble or risk-taking. It is an 
affirmation in the dark, an approval given in ignorance. 
Above all, it is a decision to welcome the unknown 
future and accept the unseen past rather than cling to 
a familiar present (Z 2:20). All pessimisms conclude 
that the universe has no order and human history no 
progress; the Dionysian variety is the only one that can 
find something to like about this situation: 


My new way to “yes.” My new version of pessimism as a 
voluntary quest for fearful and questionable aspects of 
beings... A pessimist such as that could in that way lead 
to a Dionysian yes-saying to the world as it is: as a wish for 
its absolute return and eternity: with which a new ideal of 
philosophy and sensibility would be given (KGW 8.2.121). 


The phrase “fearful and questionable,” which recurs 
frequently in Nietzsche’s texts, is carefully chosen to 
indicate what is at issue.” The aspects of existence that 
we have the greatest difficulty grasping and affirming 
are not the cruel and disgusting; they are those whose 
existence is so threatening to our sense of order that we 
have heretofore denied their very being, so that initially 
we find them “questionable” or “dubious.” Which are 
these? In Twilight of the Idols, Nietzsche ridicules “the 
almost laughable poverty of instinct displayed by Ger- 
man philologists whenever they approach the Diony- 
sian.” Why laughable? Because these philologists can- 
not recognize what is, so to speak, right under their 
noses. The “Dionysian mysteries” are simply “the 
mysteries of sexuality... the sexual symbol was to the 
Greeks the symbol venerable as such, the intrinsic 
profound meaning of all antique piety” (TI, “What I 
Owe,” 4; see Higgins 1998, 170ff.). The absurdity of 
post-Socratic philosophy is ultimately demonstrated in 
its attitudes toward sex and the body. What ought to be 
the most obvious and immediate source of knowledge 
and pleasure is not merely obscured but almost entirely 
obliterated. Cruelty may be condemned by morality, 
but at least it is acknowledged; sexuality is eliminated 
from view through a process of “moral castrationism” 
(WP 204, 383). 

Sexuality, not cruelty, represents that part of life with 
which it is most difficult to come to terms. It is the most 
difficult not because it is inherently shameful (“It was 
only Christianity...which made of sexuality some- 
thing impure”: TI, “What I Owe,” 4). The difficulty lies 
in affirming the necessity for pain and suffering that 
accompanies any growth. That is, it involves admitting 
that we ourselves (and not just the world) are essen- 
tially flux and change. With its constant dissolution of 
boundaries, sexuality is more threatening to the opti- 
mist than is the human tendency to cruelty. This 
violation of self—simultaneously painful and pleasur- 
able—is the simplest and best evidence that our own 
nature is as unstable and tumultuous as that of the rest 
of the universe and, therefore, that no calculation of 





3 Furchtbaren und fragwilrdigen also can be translated as “terrible 
and doubtful.” For other uses of this term, see, for example, WP 852, 
GS 370. The phrase always refers to those things which the pessmust 
can bear the sight of but others cannot. 
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our best interest can ever be permanent. The Diony- 
sian is “the triumphant Yes to life beyond death and 
change; tue life as collective continuation of life 
_ through procreation” (ibid.). But this can only come at 
the cost of suffering, as the price to be paid for 
continuous rebirth: “In the teaching of the mysteries, 
pain is sanctified: the ‘pains of childbirth’ sanctify pain 
in general—all becoming and growing, all that guaran- 

tees the future, pain... All this is contained 
in the word Dionysus” (ibid.). In Christian morality, 
the pains of childbirth are the Curse of Eve, and 
sexuality the sin which enables and stands for sin in 
general; it is this symbolism (and this generalization) 
which Nietzsche urges us to reverse. The Dionysian is 
not simply sexuality (Nietzsche is not Freud); rather, 
the repression of sexuality represents the repression of 
the “fearful and questionable” as such. Accepting the 
necessity of these things, setting aside the goal of 
happiness as the ultimate aim of a human life, is what the 
Dionysian “yes” requires. 

This does not mean that happiness must disappear 
from human life. Setting it aside as the final goal does 
not mean banishing it altogether. But if happiness is to 
be found, it can only be on these new terms. We can 
only take our pleasures in an acceptance of this chaotic 
and, we now know, painful condition. Pleasure and 
pain cannot be separated, as the utilitarian or simplistic 
pessimists contend with their efforts simply to seek one 
and avoid the other. Destruction must be known and 
acknowledged as part of anything creative or good. The 
true embrace of becoming at the expense of being 
means fo take pleasure in the suffering that accompanies 
the demise of whatever is. 


The joy of Being is only possible as the joy of appre ] 
The joy of becoming is only possible in the destruction of 


the actuality of “Beings,” the beautiful visions, in the 
pessimistic annihilation of illusions. In the destruction also 
of beautiful illusions, Dionysian joy appears as its climax 
(KGW 8.1.114; see also EH, “Destiny,” 4). 


This is something we have great difficulty doing. 
Nietzsche knew such an idea would sound dreadful to 
most. It is not enough to withdraw our condemnation 
of suffering. It is not enough to retreat to an agnostic 
shrug and agree to coexist with “necessary” suffering. 
That would equate to being agnostic about life itself 
Instead, we must approve it. That is why Nietzsche 
depicts the idea of eternal recurrence as something 
proposed by a “demon” and the “greatest weight” upon 
one’s conscience (GS 341). To will the eternal recur- 
rence is to will endless suffering. Why should we 
sanction suffering, even our own, much less that of 
others? 

If Nietzsche’s reply is simply “because it is an 
unalterable part of life,” then we are tempted to return 
to the position of Schopenhauer. Indeed, perhaps we 
now can see the attractions of that position most 
clearly. Why not reject this life we are offered, as 
Schopenhauer suggests, if to endorse it means to 
endorse endless and unalterable suffering? Nothing 
requires us to participate in the suffering of others. Our 
every moral instinct rebels at the thought If we are 
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truly powerless against suffering, as Schopenhauer sug- 
gests, why not just withdraw? To this question, Ni- 
etzsche cannot give the sort of answer that provides any 
comfort. He cannot offer any unrebutable reason for 
preferring affirmation over denial. In a world of flux, no 
such “reason” can permanently exist. This is why 
Nietzsche. refers to strength—not because the strong 
survive and the weak die, but because those who affirm 
have the strength to control their disgust long enough 
to give themselves a local reason to live. 

The Dionysian “yes” is not a matter of taking a 
sadistic pleasure in the suffering of others. It is a 
decision to value the future over the present. To be 
glad that the world is one of becoming rather than 
being means to be glad that things are always changing, 
that the future is always coming, and the present is 
always passing away. It means detachment from what- 
ever exists at present, which inevitably appears as 
callousness toward others: “Di lan wisdom. Joy in 
the destruction of the most noble and at the sight of its 
progressive ruin: in reality joy in what is coming and lies 
in the future, which triumphs over existing things, however 
good” (WP 417, second emphasis added). This is what 
Nietzsche had in mind by such phrases as “amor fati” or 
eternal recurrence. He is not saying that we must relive 
the past again and again; rather, this pattern of destruc- 
tion and creation is unalterable and must be borne 
(WP 1041). And it cannot be withstood through faith in 
progress. We must learn to hope in the absence of an 
expectation of progress. If this sounds almost nonsen- 
sical to the modern ear, perhaps it is because we have 
been told for so long that progress is the rational thing 
to hope for. 

The difference between the false pessimism of 
Schopenhauer and Nietzsche’s version is explicitly out- 
lined in Zarathustra as a difference in their respective 
attitudes toward our fate of temporality. In the twen- 
tieth section of the second book, titled “On Redemp- 
tion,” Nietzsche traces two approaches to our time- 
bound condition. First, he describes the preaching of 
“madness.” But madness speaks as Nietzsche once did 
himself, by citing the aphorism of Anaximander—in a 
slightly altered form to bring out what Nietzsche now 
considers its vengefulness: “Everything passes away; 
therefore everything deserves to pass away. And this, 
too, is justice, this law of time that it must devour its 
children.’ Thus preached madness” (Z 2:20). Madness 
continues to speak more directly in the voice of 
Schopenhauer, whose solution to the problem of time 
is to withdraw from the life of the will insofar as is 
humanly possible. “Can there be redemption if there is 
eternal justice? Alas, the stone {t was cannot be moved: 
all punishments must be eternal, too....No deed can 
be annihilated: . . . This, this is what is eternal in the 
punishment called existence, that existence must eter- 
nally become deed and guilt again. Unless the will 
should at last redeem itself, and willing should become 
not willing.” To Nietzsche, this attitude of resignation 
toward our place in time can only be called madness, 
the product of “the spirit of revenge” or “the will’s ill 
will against time.” It moves too quickly from the 
inescapability of time to the idea that it enslaves us. 
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Second, Nietzsche contrasts this false redemption 
with a better one: Our temporality conditions us but 
does not imprison us: “I led you away from all these 
fables when I taught you, ‘The will is a creator.’ All ‘it 
was’ is a fragment, a riddle, a dreadful accident— until 
the creative will says to it, ‘But thus I willed it.” Rather 
than hate backward, as it were, Nietzsche suggests that 
we look forward. If the present is the result of an 
unalterable past, it is also the source of a very alterable 
future: “I walk among men as among the fragments of 
the future—that future which I envisage.” Instead of a 
false redemption that is essentially an abandonment of 
society, the true pessimist (pessimistic still because he 
accepts our time-bound condition and all it entails) 
sees an opportunity, whereas the false sees only a 
conclusion: “To redeem those who lived in the past and 
to re-create all ‘it was’ into a ‘thus I willed it-—that 
alone should I call redemption.” Schopenhauer’s ro- 
mantic pessimism acknowledges the power of the past 
but not the open horizon of the future. It is madness 
because it seems to be based on a hostility to existence 
that Nietzsche ultimately finds inexplicable except as 
self-hatred. Temporality is not just a limitation but a 
source of potential. The redemption of the past to 
which Nietzsche looks forward may be unlikely, but at 
least it is not an impossibility. His pessimism allows for 
possibilities. 

Dionysian pessimism may be “fearful and question- 
able,” but the alternative is worse. In a famous note 
Nietzsche embodies the two choices as “Dionysus and 
the Crucified”: “It is not a difference in regard to their 
martyrdom,” that is, in whether the two personifica- 
tions of different life-practices suffer and die, “it is a 
difference in the meaning of it” (WP 1052). In other 
words, it is not a question of how death and suffering 
can be minimized; in a pessimistic view, the greater 
portion cannot be avoided. “The problem is that of the 
meaning of suffering: whether a Christian meaning or a 
tragic meaning.” We are only given the choice of 
accepting this life as a whole or rejecting it as a whole.™ 
There are more than two possible meanings for suffer- 
ing, and we can surely struggle to alter those elements 
of life within our purview, but we will still be faced with 
the larger question when we cannot pick and choose. 
One alternative is to reject life as a whole: “The god on 
the cross is a curse on life, a signpost to seek redemp- 
tion from life.” The other is to embrace life, with all the 
suffering entailed, both for ourselves and for others: 
“Dionysus cut to pieces is a promise of life: it will be 
eternally reborn and return again from destruction” 
(ibid.). If one accepts the pessimistic assessment of the 
world as a place of chaos and dissonance, one faces the 
choice of retreating from it or embracing it and trying 
to “let a harmony sound forth from every conflict” (WP 
852). 


% This is perhaps what Camus ([1955] 1983, 3) had in mind when he 
wrote: “There is but one truly serous philosophical problem, and 
that 1 suicide.” See Cartwright 1998, 149. 


THE FUTURE OF DIONYSIAN PESSIMISM 


The commonplace understanding that pessimism must 
lead to hopelessness and resignation is unjustified.** In 
Dionysian pessimism, Nietzsche creates an alternative 
that is as ruthlessly skeptical toward all ideas of 
progress as is Schopenhauer’s pessimism but does not 
issue in despair (see Janaway 1998, 25). It looks toward 
the future, not with the expectation that better things 
are foreordained, but with a hope founded only on 
taking joy in the constant processes of transformation 
and destruction that mark out the human condition. 

The belief that pessimism must lead to resignation 
makes one of two errors: Either it mistakes Schopen- 
hauer’s variant (or Wagner’s or Buddha’s) for the 
whole of pessimism, or it sees no other possible re- 
sponse to the realization that we live in a tragic, 
disordered, immoral world. Why is it commonly 
thought that human beings must be disappointed at the 
prospect of a world in constant flux and chaos, where 
no moral order can be sustained? The answer is the 
assumption that human beings are creatures of order, 
that we are discomposed by chaos. To Nietzsche, we 
are no different from the world to which we are 
condemned; we are not islands of being in a sea of 
becoming but are constantly transforming and devel- 
oping. He envisions a “Dionysian world of the eternally 
self-creating, the eternally self-destroying . . . without 
goal... Do you want a name for this world? ... This 
world is the will to power—and nothing besides! And you 
yourselves are also this will to power—and nothing 
besides!” (WP 1067). To restore “the innocence of 
becoming” to the world means likewise to restore it to 
ourselves and to face this chaotic world not as a 
creature alien to it or fallen from it but as part of that 
which we find most threatening. Those who believe 
pessimism leads to resignation see humans as “weak” 
creatures who must have transcendental meanings in 
order to survive. But pessimists need not believe that 
the demise of traditional beliefs must lead to aimless- 
ness and suicide. It is rather those who fear pessimism, 
or fear the repeal of traditional moralities, that main- 
tain this. 

What does it mean to go through life with no 
expectations or, more precisely, with an expectation of 


concept of resignation has recently been elaborated by Dumm 
(1998), who finds affirmative possibilities that deserve consideration. 
But I use the term in the widely accepted sense of despair or 
purposeful withdrawal from activity. 
% To be sure, there n something ical about this formulation. 
To imagine ourselves as will normally would imply our will has some 
object. Yet, if there are no permanent objects, only an eternal flux, 
including ourselves, then how is this possible? The paradox is not 
eliminated but mitigated in light of Nictzsche’s critique of our 
subject-object grammar and his related critique of our ideas of 
causality. In GS 370, Nietzsche explicitly links these to the emergence 
of Dionysian pessimism. The seeming strangeness of what he pro- 
poses emerges as much from our ordinary grammar of “will” as it 
does from the propoations themselves. In this passage it seems clear 
that Nietzsche grves us an inaccurate shorthand “name” for what he 
describes only because we, his readers, demand it. Nietzsche’s 
critique of causality is especially vivid in his discussion of dreams (sce 
GS 22, 112; TI, “Four Great Errors,” 4; WP 479, and Dienstag 1997, 
96-100). 
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nothing? To be sure, one is deflected from a certain 
kind of global ambition. The desire wholly to remake 
the world in one’s image, in whatever manner, must be 
set aside once it is realized that the world will hold no 
image at all for very long.” Yet, Nietzsche says nothing 
that would deter one from seeking to organize “a small 
portion of it oneself” (WP 585), which does not mean 
cultivating one’s own garden so much as knowing the 
limits to one’s actions, however ambitious. Further- 
more, there is a kind of freedom to be gained when 
one’s existence is detached from the narrative of 
progress: freedom from the past. If human history is a 
narrative of progress, then one’s fate is already 
scripted, in a sense, by what has come before; one is 
nothing more than “an angry spectator of all that is 
past” (Z 2:20). Pessimism, by freeing us from this 
script, simultaneously frees us from enslavement to the 


The destruction of all things by time is not a judg- 
ment of their worth, as Anaximander maintained, but 
simply a condition of life and an opportunity to chart a 
personal course free of “the stone It was.” Nietzsche 
concludes, as a kind of modern version of Greek 
“cheerfulness,” that “the belief in time is good for one’s 
health (pessimists after all)” (KGW 7.1390). The 
constant transformation reminds us that our fate is not 
set. We have at least a role in determining it. The 
burden of the past is thus lessened, and the prospect of 
the future brightens. “The trust in life is gone: life itself 
has become a problem. Yet one should not jump to the 
conclusion that this necessarily makes one gloomy. 
Even love of life is still possible, only one loves 
differently” (GS, “Preface,” 3). Instead of being a 
creature of the past, one can be “a bridge to the future” 
(Z 2:20). Instead of valuing oneself for being part of a 
long chain of progress, one can value the fresh start 
that one makes of oneself. Instead of searching for 
transcendental meanings, one can “give the earth a 
meaning, a human meaning” (Z 1:22.2). 

Such a technique will not be to everyone’s taste nor 
within the ability of all. It is best suited to those 
Nietzsche calls “the most moderate”: “those who do 
not require any extreme articles of faith; those who not 
only concede but love a fair amount of accidents and 
nonsense; those who can think of man with a consid- 
erable reduction of his value without becoming small 
and weak on that account” (WP 55). These are the 
humans he considers “the strongest”—not those who 
can destroy the most, but those who can withstand the 
most destruction without giving way to pity and resig- 
nation. 


CONCLUSION 


A curious fact about Nietzsche’s writings on pessimism 
emerges from the quotations in the preceding sections. 
Nietzsche wrote on the topic throughout his career, but 
there is a remarkable concentration on the topic in the 


37 Such a desire is permitted so Jong as one does not regret the 
fleeting character of the remaking (which obviously would alter the 
nature of the project). 
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series of introductions prepared in 1886 for all his 
pre-Zarathustra writings, which were then in the pro- 
cess of being republished. It is almost as if, looking 
back over the development of his thought, he identifies 
a guiding principle unseen hitherto in its entirety, even 
by himself, and desires to reemphasize it. He draws 
together his various poses into a unified attitude. Read 
together, these introductions (to The Birth of Tragedy, 
Untimely Meditations, Human, All-too-Human, Assorted 
Opinions and Maxims, and Daybreak) describe a pessi- 
mism that “has no fear of the fearful and questionable 
that characterizes all existence”: 


This has been my pessimistic perspective from the begin- 
ning—a novel perspective, is it not? a perspective that even 
today is still novel and strange? To this very moment I 
continue to adhere to it and, if you will believe me, just as 
much for myself as, occasionally at least, agunst my- 
self... Do you want me to prove this to you? But what else 
does this long preface—prove (AOM, “Preface,” 7)? 


It would be a mistake to try to understand Dionysian 
pessimism by setting it against, say, utilitarianism or 
Kantian ethics and viewing it as a moral theory that 
prescribes a certain ideal set of behaviors and attempts 
to justify them to an audience of rational disputants. It 
is instead both a description of the irrational world in 
which we find ourselves and a prescription for coping 
with that situation. Nietzsche neither appeals to nor 
promises rationality and happiness in his attempts to 
defend his stance. It is better to consider Dionysian 
pessimism as an attitude and a practice that can guide 
us through the world, “a remedy and an aid in the 
service of growing and struggling life” (GS 370). This 
does not mean that the self is the sole object of 
philosophy or action, but it underscores the idea that 
we must recognize those limits of the human condition 
that optimists have been loath to acknowledge. Rather 
than suggest resignation, this pessimism encourages us 
to act while seeking to avoid the hubris so common to 
more systemic philosophies. 








= Nietzsche is playing on the meaning of beweisen (prove) The root 
ween means “show,” so that beweren can mean “show” in an 
rntransitive sense, that is, to “show oneself.” 
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upon national levels of life satisfaction. Specifically, 


positively affected by the ideological complexion of governments and by qualitative features of the welfare 
state. The implications for social policy, electoral politics, and our theoretical understanding of life 


satisfaction are discussed. 


Dahl have stressed its potential for building a 
society in which people enjoy better lives. The 
promise of a better life is a powerful mechanism for 
winning political support, and electoral competition in 
modern democracies has come largely to embody 
ideological disagreement over the best way to improve 
quality of life. All shades of political philosophy, from 
the social democratic Left to the libertarian Right, 
claim that their approach will make life more agree- 
able. As Esping-Andersen (1990, 1) comments, the 
democratic state is of necessity “preoccupied with the 
production and distribution of social well-being.” The 
dominant mode in the liberal democracies may thus 
profitably be conceived of as the politics of human 
happiness. 2 
It is hardly surprising that a principal concern of 
empirical democratic theorists is the search for evi- 
dence that politics matters in the lives of ordinary 
people. Much has been learned about how politics 
affects public policies, but less attention has been 
devoted to identifying the sociopolitical factors that 
determine the final variable of interest, viz. the general 
level of subjective quality of life. The dominant theo- 
retical approaches to the subject deny that cross- 
national differences in satisfaction can be attributed to 
politics (e.g, Easterlin 1974; Inglehart 1990; for a 
review, see Veenhoven 1996), but I will argue that 
certain political outcomes—those that insulate citizens 
against the worst forms of market dependence—dra- 
matically affect a nation’s general level of satisfaction 
independent of economic or cultural conditions. 


Pout of democracy from Pericles to Robert 


Shapiro, Mitch Sanders, Duane Swank, Jay Verkullen, Christina 
Wolbrecht, as well as the anonymous reviewers and the APSR editor. 


THE STUDY OF LIFE SATISFACTION 


Life satisfaction refers to the degree to which individ- 
uals evaluate positively the quality of their life in total. 
Most surveys ask respondents “how satisfied” they feel 
with their lives “in general.” It is generally agreed that 
this simple item performs as well or better than more 
complex formulations (e.g., Veenhoven 1993). It has 
become commonplace to compare mean levels of 
satisfaction across nations using this indicator (for 
reviews, see Veenhoven 1996, 1997a). 

Three broad approaches are applied to studying 
cross-national differences in satisfaction. The first is 
comparison theory, which is associated most strongly 
with Easterlin’s (1974, 1995) analyses of the relation- 
ship between economic development and national sat- 
isfaction. He argues that richer countries display no 
greater mean happiness than poor ones. This is be- 
cause individual assessments of one’s life are not 
absolute but relative to the “consumption norm.” 
Those above that standard are more happy, those 
below it less so. The overall level of satisfaction thus 
should tend to be neutral, in that the number of happy 
and unhappy people should be about the same. Any 
observed national differences would reflect only mea- 
surement error.} 

The second approach, cultural theory, maintains that 
different countries manifest different levels of happi- 
ness because of powerful and enduring “national 
creeds” or “national character traits” (Inkeles 1997). 
Political interventions that affect objective living con- 
ditions are, as in comparison theory, deemphasized. 
Inglehart (1990, 30) suggests that international differ- 
ences in life satisfaction “reflect cognitive cultural 
norms, rather than individual grief and joy.” Unlike 
comparison theory, this approach admits the possibility 
of meaningful differences in satisfaction, but both deny 
that objective conditions play a significant part in 
determining those differences. Both give no role to 
politics, in the one case because positive political 
outcomes would move the consumption norm, leaving 
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people no happier than before, and in the other case 
because political interventions would not change 
deeply held cultural norms. 

The third approach is “livability” theory, which is 
predicated upon Maslow’s (1970) work on human 
needs, the most basic of which are physical and eco- 
nomic security. As Schyns (1998, 8) puts it, the “more 
needs are satisfied, the happier people will be.” Con- 
trary to the other two theories, subjective appreciation 
of life depends upon the “objective” quality of life. This 
approach is most fully developed by Veenhoven (¢.g., 
1995), who notes it is merely the application of the 
“common sense” view that people will lead better lives 
when they live in better or more livable societies. The 
specification of what constitutes a livable society is not 
entirely clear, but the basic notion is plain enough: 
People will be happier in countries that do a better job 
of meeting the needs of the population. 

Livability theory predicts that satisfaction should 
vary in response to successful political strategies for 
meeting needs. The most obvious strategies are re- 
flected in the basic ideological approaches to the “good 
society”: The traditional Left supports a state guaran- 
tee of human needs, and the traditional Right favors 
reliance on the market. If these approaches contribute 
differently to the attainment of needs, then their insti- 
tutionalization should affect human happiness, on av- 
erage. Indeed, perhaps the most basic and compelling 
test for the relevance of democratic politics is whether 
subjective quality of life varies in response to which 
party controls government and the kinds of policies 
pursued. 

Most empirical research on national satisfaction 
focuses on Easterlin’s (1974) contention that happiness 
is not related to material affluence. Easterlin (1995) 
remains convinced of this, but some question how st 

rongly his data support his conclusions (¢.g., Oswald 
1997). Others show that a rise in standard of living 
increases happiness (e.g., Blanchflower, Oswald, and 
Warr 1993; Diener et al. 1993; Veenhoven 1995). Quite 
clearly, mean levels of happiness vary positively with 
the level of national economic development, although 
perhaps with diminishing returns. At the individual 
level, life satisfaction varies directly with income in the 
United States and in every member state of the Euro- 
pean Union (Di Tella, MacCulloch, and Oswald 1997). 
The real question seems not to be whether affluence 
contributes to satisfied lives but how much. 

The cultural approach also has proven problematic. 
Any simplistic interpretation of national happiness as 
the manifestation of immutable cultural traits does not 
seem sustainable. Veenhoven’s (1994) devastating cri- 
tique of “trait theory’ demonstrates as much. The 
utility of national character is also indicted implicitly by 
cross-sectional studies that reveal the power of objec- 
tive living conditions as predictors of average satisfac- 
tion levels (e.g., Schyns 1998; Veenhoven 1995). On its 
face, the view that aggregate happiness is determined 
by unconsciously held collective norms or traditions 
would deny any such correspondence. Still, the more 
sophisticated version of trait theory associated with 
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Inglehart (1990) is not so easily dispatched. Inglehart 
argues that national differences in satisfaction reflect, 
in the main, accumulated national experiences that are 
absorbed in preadult socialization, but he clearly rec- 
ognizes that objective conditions affect aggregate levels 
of satisfaction, at least marginally. This opens the door 
to political interventions that affect material condi- 
tions. 

Livability theory is open to the possibility of cultural 
influences, which can be interpreted as additional 
factors that affect the tangible quality of life. Indeed, 
two of the strongest works in support of livability— 
Veenhoven (1997b) and Schyns (1998}—concede that 
at least one aspect of culture affects subjective well- 
being. Both find that countries whose cultural norms 
are more “individualist” than “collectivist” show 
greater mean levels of happiness. The logic is that the 
former provide the independence to choose and 
achieve one’s goals. This result is confirmed by other 
work (e.g., Diener, Diener, and Diener 1995). Both 
cultural theorists such as Inglehart and needs theorists 
such as Veenhoven thus agree that culture and objec- 
tive conditions should affect the mean level of happi- 
ness, although they may disagree about the relative 
importance of each. 

There is little consensus about the determinants of 
national satisfaction. Advocates of livability theory 
offer plausible suggestions (see especially Schyns 1998; 
Veenhoven 1995), but this work is plagued by several 
shortcomings. The first is overly simple statistical tech- 
niques, as virtually all the relevant studies rely upon 
correlations. The second is the tendency to pool devel- 
oping and industrialized countries. It is not necessary 
to repeat familiar strictures on the selection of cases in 
comparative research, but consider the heterogeneity 
among countries so different in history, culture, poli- 
tics, and economics as Brazil, China, Mexico, and 
Nigeria. Without an extraordinarily well-developed 
theoretical structure, empirical findings based on such 
mixing will always be suspect. The third shortcoming is 
that much (although admittedly not all) of the evidence 
in support of livability vanishes when rich and poor 
countries are considered separately (e.g., Schyns 1998), 
which suggests that the more refined indicators of 
livability simply tap a developed versus developing 
distinction. Similarly, the evidence for livability tends 
to be weakened when controlling for economic devel- 
opment (e.g, Veenhoven 1997a). 

In sum, little is known about the political determi- 
nants of subjective well-being across nations. Strictly 
construed, comparison and trait theory suggest that 
cross-national differences either do not exist or are 
independent of political conditions. Livability theory 
and, to some extent at least, a more sophisticated 
version of trait theory suggest that differences do exist 
and are (at least partially) attributable to how well 
societies satisfy basic human needs, but there is little 
evidence that politics per se matters. I will argue that 
salient features of democratic politics profoundly affect 
the subjective quality of life in industrial democracies. 
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It is widely agreed that the two social mechanisms 
available for the generation and distribution of well- 
being are markets and politics (e.g., Lindblom 1977). In 
liberal democracies, the conflict between these systems 
of allocation typically is manifested in the struggle to 
adjust the perceived inequalities of market distribution 
with the presumed equality of citizenship rights, Le., to 
make “citizen entitlements... rather than the market 
contract” the basis of allocation (Esping-Andersen 
1985, 159). Those who favor entitlements may do so for 
a variety of reasons, but these ultimately reduce to the 
view that markets are “indifferent to the fate of indi- 
viduals” (Lane 1978). As I will argue, there is ample 
reason to believe that citizenship-rights will tend to 
produce greater average satisfaction than market dis- 
tribution. 

The ways in which the market contributes to unhap- 
piness can be tied to the generalized effects of capital- 
ism on the human personality (e.g., Lane 1978), but 
they are more commonly argued to result from the 
economic insecurity and personal loss of autonomy 
that accompany market economies. Echoing Marx, 
Polanyi (1944), and Lindblom (1997), among others, 
Esping-Andersen (1990, 36) observes that, whatever 
the many positive and commendable aspects of a 
capitalist economy, “the market becomes to the worker 
a prison within which it is imperative to behave as a 
commodity in order to survive.” It is certainly the case 
that the great mass of people in the industrial world 
depend for their livelihood on the sale of their labor 
power as a commodity, and that the market for that 
commodity is characterized by uncertainty. 

As Lindblom (1977, 82) notes, “a pertinent objection 
to markets is that they foist insecurities on the popu- 
lation,” which become “all the more a problem when 
[one’s] livelihood is at stake.” So long as individuals 
depend upon the sale of their labor power in conditions 
of uncertainty, “they are captive to powers beyond 
their control,” such as business cycles, globalization, 
technological innovation, or other market vagaries 
(Esping-Andersen 1990, 37). Accordingly, I argue that 
the principal political determinant of subjective well- 
being is the extent to which a program of “emancipa- 
tion” from the market is institutionalized within a state. 
Put differently, life satisfaction should increase as we 
move from less to more social democratic welfare 
states. More generally, life satisfaction should vary 
positively with the dominance in government of polit- 
ical parties committed to the social democratic pro- 
gram of limiting human dependence on the market. 

Even the most ardent defenders of capitalism con- 
cede that insecurity is endemic in market economies. 
This translates into chronic psychological stress (e.g., 
Brenner 1977), which is clearly inimical to life satisfac- 
tion. To the extent that a society insulates people from 
market dependence, it insulates them from the stress 
and anxiety of the market. Furthermore, as economists 
since Adam Smith have recognized, markets contribute 
to inequality. The social democratic project again 
facilitates well-being by mitigating the worst effects of 
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inequality through the idea of social citizenship: The 
state guarantees a basic and ideally middle-class level 
of need satisfaction. 

The political program of the Left attempts to “mar- 
ginalize the market as the principal agent of distribu- 
tion and the chief determinant of peoples’ life chances” 
(Esping-Andersen 1985, 245). As classically articulated 
by Marshal! (1950), individuals have claims on the 
product of society because they are citizens. That 
status, rather than one’s class position or income, 
should be the principle that guides the distribution of 
well-being. The state intervenes to limit the importance 
of markets in providing the things most people find 
necessary for a satisfactory quality of life. To the extent 
such social rights evolve, individuals should be happier, 
on average. Put another way, if market independence 
contributes to subjective well-being, then the expansive 
“emancipating” welfare state associated with the social 
democratic model should produce a higher general 
level of life satisfaction. 

Because leftist parties are one, if not the, principal 
advocate for the social democratic welfare state, con- 
trol of government by such parties should show the 
same relationship with life satisfaction. Three strands 
of research support this contention. One documents 
the importance of leftist party influence in government 
for affecting the general level of welfare spending (e.g., 
Hicks and Misra 1993). Another considers how leftist 
parties affect social citizenship per se (e.g., Korpi 
1989). A third argues that representation af the Left in 
government is the principal factor in support of a social 
democratic welfare state regime (eg. Esping- 
Andersen 1990). More generally, leftist governments 
can replace the market with political outcomes, given 
that their tenure in office “opens up the possibility for 
major interventions in economy and society” (Huber, 
Ragin, and Stephens 1993, 720). In sum, leftist incum- 
bency is associated with a variety of “socialist regime 
attributes” (p. 740). 


ANALYSIS 


My basic contentions about the primacy of politics are 
tested with data from the 1990 wave of the World 
Values Survey.? Life Satisfaction is assessed with the 
standard question: “All things considered, how satis- 
fied are you with your life now?” I analyze these data in 
three separate ways. First, following the convention of 
the literature, satisfaction is aggregated to the level of 
national means; my purpose is to identify country-level 
factors that affect average happiness. Second, I exam- 
ine individual survey data in order to model satisfaction 
as a function of both individual- and national-level 


2 Countries mcluded are Austria, Belgium, Canada, Denmark, Fin- 
land, France, Germany, Great Britain, Ireland, Italy, Japan, the 
Netherlands, Norway, Sweden, and the United States. National 
samples average about 1,600 respondents, with the exception of 
Finland (nm = 588), all nations have a sample larger than 1,000. I 

limit attention to the industrial democracies to avoid any 
problems due to pooling developed and developing countries. Miss- 
ing data on either the dependent or (especially) independent vart 
ables would exclude nearly all the latter in any case. 
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variables. Third, in a hybrid of these approaches, I 
estimate the effect of the political factors by determin- 
ing their influence on the share of national satisfaction 
that cannot be attributed to individual-level variables. 
I will introduce the variabies as I proceed. Further 
details regarding their coding and definition, as well as 
the data themselves, are provided in the Appendix. 


Aggregate Analysis of National Means 


For the aggregate data, I use the three control variables 
common to the literature. Two are economic: the rate 
of Unemployment (average 1955-90, data from Hall 
and Franzese 1998) and real per-capita GDP in 1990 
purchasing power parity (from the Penn World Table, 
version 5.6). The third is a measure of the Individual- 
ism of national cultures devised by Triandis (1989, 
1995); data are reported in Diener, Diener, and Diener 
(1995). It ranges from 0-10, with higher scores indi- 
cating greater individualism. 

To account for the extent to which social rights 
replace markets, I use three different operation- 
alizations. Two relate to general welfare state charac- 
teristics, and the third measures the relative strength of 
political parties of Left, Right, and center in national 
governments.’ 

Perhaps the best and certainly best known indicators 
of the extent to which states protect against market 
dependence are suggested by Esping-Andersen (1990). 
First, he provides a set of three variables that tap how 
much a national system embodies elements of his three 
ideal types of welfare regime: Liberal, Conservative, and 
Socialist. The liberal type “effectively contains the 
realm of social rights” with modest and often ‘means- 
tested programs. Conservative regimes may provide 
more generous benefits but in the end have a “negligi- 
bie” redistributive effect, which reflects an emphasis on 
“the preservation of status differentials.” The socialist 
type is predicated explicitly upon the substitution of 
entitlements for markets. This vision of the welfare 
state promotes “an equality of the highest stan- 
dards . . . by guaranteeing . . . [everyone] full participa- 
tion in the quality of rights enjoyed by the better-off” 
(Esping-Andersen 1990, 27). The market is crowded 
out in favor of state-provided social rights. 

These are ideal types, and no country fits neatly into 
any one category. In practice, states combine elements 
from each. For that reason, Esping-Andersen (1990, 
Table 3.3} provides three indices, which I use, to access 
the amount. of socialist, conservative, and liberal ele- 
ments in a given policy regime. Each country is scored 
on each variable; the three together represent the 
particular mix of these attributes in its policies. Higher 
values indicate more of the qualities of the particular 
ideal type (0—8 for socialist and conservative, 0-12 for 
liberal), Because socialist attributes are most protective 
against market dependence, I expect subjective quality 
of life to vary directly with the socialist variable. The 
liberal welfare state depends most strongly on the 
market, so it should be negatively associated with 
subjective well-being. The direction of the conservative 
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regime effect is difficult to predict, but it should be 
small whatever the direction. 

Esping-Andersen (1990, Table 2.2) provides an al- 
ternative operationalization of what he.calls “market 
independence” in the form of an index of the Decom- 
modification of labor. Labor is decommodified to “the 
degree to which individuals, or families, can uphold a 
socially acceptable standard of living independent of 
market participation” (p. 37). Briefly, it is a summary 
index that assesses the extent of “emancipation from 
market dependency” in three specific domains: pen- 
sions, income maintenance for the ill or disabled, and 
unemployment benefits. Higher scores represent 
greater decommodification (the cumulative index 
ranges from 13.8 to 39.1). For reasons stated previ- 
ously, decommodification should contribute to quality 
of life. 

These two indicators focus explicitly on welfare state 
regimes per se, but it also may be instructive to 
consider the relative strength of parties. I assume that 
the ideology of a government is a function of the 
ideology of the parties that comprise it. By extension, 
the policy regime that citizens face at any given time is 
approximated by the cumulative governmental tenure 
of parties in office to that point. Because leftist parties 
generally advocate and rightist parties typically oppose 
social rights, the relative balance between Right and 
Left accumulated over time should reflect the extent to 
which a policy regime approximates the social demo- 
cratic ideal. The predominance of the leftist approach 
should produce greater security and thus greater life 
satisfaction. I measure Left Dominance as the cumula- 
tive portion of leftist cabinet seats less the cumulative 
portion of rightist cabinet seats (from 1950 to 1990; 
data from Huber, Ragin, and Stephens 1997).3 Al- 
though party ideologies are always in flux, general party 
identities have remained stable enough since World 
War I to make this measure useful.‘ 

The effect of centrist parties is less clear. These often 
are considered less hostile to the welfare state than is 
the Right, although they favor policies that tend to be 
far more market conforming than those advocated by 
the Left. Their effect on quality of life is likely to be 
minimal. To account for any effect, I include in the 
party control model the cumulative cabinet seats for 
the Center as a separate variable. 

I rely upon the following estimation strategy. The 
national level of satisfaction is modeled provisionally as 
a function of the two economic variables, the culture 
term, and one of the market independence variables. 
Because the number of independent variables is rather 
large relative to the number of cases, and because some 
models have variables smaller than their standard 





> Details regarding the coding of data are provided in the Appendix. 
* Despite significant moderation among lefhats (eg. the French 
Socialists) the Left is still identified with a program of protection 
from the market. Empincal comparison of ideology in the 1980s with 
the 1990s shows remarkable stabilrty (Huber and Inglehart 1995, 79). 
Recent developments (such as “New Labour” in Bntain or the 
moderation of the German SDP) are more problematic, but these 
cases are fortunately beyond the temporal limits of the present 
analysis. 
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TABLE 1. 


Regime Attributes 
Socialist .109*™ (.029) 
Liberal ° —.046* (.024) 
Conservative —.034 (.024} 
Decommodification Wa 
Left Dominance N n/a 
Individuallam i Va 
GDP i .087* (.043) 
Unemployment —211"™ (.037) 
Constant 5.33°™ (.683) 
‘Adj. F2 829 
N 15 





errors, .I reestimate reduced models that delete the 
terms, for any given equation, with t-scores of {ess than 
unity in absolute value. These results are reported in 
Table 1.5 

Various tests for robustness were performed. Chief 
among these was biweight robust regression, a tech- 
nique especially appropriate for demonstrating stability 
in small samples. I also bootstrapped (with 1,000 
repetitions), jackknifed, and compared the results after 
removing the small number of potentially influential 
data points that were identified through an inspection 
of the dfbetas. Except as noted, none suggested any 
appreciable differences in substantive conclusions.® 

For regime attributes (column 1), the socialist vari- 
able is, as predicted, significant and positive. Also as 
expected, the liberal variable is negative and signifi- 
cant, and the conservative is insignificant. The more 
socialist and the less liberal a welfare state regime, the 
more satisfied with their lives citizens tend to be. The 
procedures for assessing stability uniformly support a 


5 I attempted to validate these results by substituting for life satis- 
faction either of two alternative measures of subjective well-being— 
the variable tapping iness” or an index of variables 

“positive affect” (e.g., “feeling on top of the world”). The results are 
similar, of slightly less robust. Life satisfaction mw the preferred 
formulation, as it avoids the ambiguity of happiness as well as the 
transitory character of affective states (e.g, Vennhoven 1997b). 

6 The Cook-Wessberg test indicated no heteroskedasticity in any of 
the models. I experimented with two other aggregate characteristics 
that might affect quality of life: average umon density 1955-90 (Hall 


either of these terms was included in the models in Table 1, results 
were much as before, although decommodificahon and Left domi- 
nance became inugnificant when union density was added. Thu is 
Ifkely due to multicolinearity, given the high correlations between 
these three vanables. Variables for the number of years of continu- 
ous democracy from 1900 to 1990 (Inglehart 1990) or for population 
size or density had no effect. 


Pollical Determinants of Mean Life Satisfaction 


Note. Entnes are unstandardized regreesion coeficients. Standard errors are m parentheses., "o = 06, “p = .01, p = 001. 














pecomimodiication Party Control 
n/a n/a 
n/a n/a 
n/a ma 
058" (.018) a 
n/a .017* (.005}) 
.086 (.074) n/a 
.084 (.088) .098* (.047) 
—.242*** (.067) —.191** (.045) 
3.01* (1.27) 5.20" (.750) 
517 578 
15 15 








significant socialist variable, although the liberalism 
result is somewhat less robust.” 

Decommodification also is significant and of the 
expected sign (column 2). The greater the decommodi- 
fication accomplished by state policy, the greater is the 
social level of satisfaction. 

The difference between Left and Right control of 
government (Left dominance) is also positive and 
significant (column 3). The more time in power for the 
Left relative to the Right, the happier are citizens. As 
predicted, centrist control appears to have little effect. 

Perhaps the best way to interpret the magnitude of 
the results is to compute the change in the dependent 
variable, expressed in the number of standard devia- 
tions, that would be predicted by moving from the 
lowest to the highest observed value in the independent 
variables (see Table A-1). For example, decommodifi- 
cation ranges from 39.1 (Sweden) to 13.8 (United 
States), a difference of 25.3. Given that the coefficient 
of decommodification is .056, this suggests a difference 
of about 1.42 in satisfaction. As ‘the life satisfaction 
variable has (for the countries in the analysis) a stan- 
dard deviation of 51, the predicted difference when 
moving from the lowest to highest levels of decom- 
modification is a remarkable 2.8 standard deviations in 
life satisfaction. Applying the same methodology sug- 
gests a predicted change in satisfaction of 1.7 standard 
deviations for the socialist variable, 1.1 for the liberal 
variable, and 2.7 for Left dominance. 


Analysis of Individual-Level Data 
The general focus in the comparative literature is on 
national aggregates, but it is also possible to study the 


? In particular, robust regression reduces the liberalism coefficient to 
insignificance (although it retains the correct sgn). 
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phenomenon at the individual level. The scant research 
of this type consistently shows that the same basic 
characteristics tend to affect individuals similarly across 
industrialized countries (see especially, Di Tella, Mac- 
Culloch, and Oswald 1997). The basic determinants 
relate to simple demographics—such as gender, age, 
income, and education—and social connectedness 
(e.g., Lane 2000). Using appropriate variables from the 
World Values Survey, I provisionally treat life satisfac- 
tion (measured on a ten-point scale in ascending order) 
as a function of Gender, Age, household Income, Edu- 
cation, whether the chief wage earner is Unemployed, 
whether the respondent is Married (or living as mar- 
ried), number of Children, respondent’s appraisal of 
the quality of their Home Life, and frequency of Church 
Attendance. The results reported presently are remark- 
ably stable in the face of reasonable changes in the 
manner in which these variables are operationalized® 
or the addition of other individual-level variables. I 
also add to the right-hand side of the equation the 
same national-level variables as above: per-capita 
GDP, aggregate unemployment, individualism, and the 
various political factors. 

Estimation is with Huber-White robust standard 
errors, correcting for the pooled structure of the data 
(Le., country clusters). This procedure yields estimates 
that are robust to both between-country heteroskedas- 
ticity and within-country correlation (i.e., robust to 
error terms being neither identically distributed nor 
independent).1° The data are weighted to take into 
account different sample sizes across countries." 





* Results are consistent if any or all of these changes are made. 
married from a dummy to three dummies (1 = yes) for marred, 
widowed, and divorced; a dummy for any children mstead of number 
of children, age as a pair of dummies (18-24 and 25-54, leaving 55 
and older as the reference group), to account for some evidence that 
both the young and the old are happier than the middle category; 
church attendance from a dummy to a seven-category scale, in 
descending order from “more than once a week” to “practically 
never.” 

9 Adding a dummy for members of nonmayority racial/ethnic group 
(from v369) bas no effect. Adding dummies for Catholic, mainline 
Protestant, Fundamentalist, and Buddhist religious denominations 
(from v144) also changes little, as does the incluson of a vanable for 
self-reported health (v83). I also experimented with dummies for 
belonging to arty/cultural groups (v21) or sporting groups (v29) as 
further evidence of social connectedness, but their iocluston had no 
effect Including variables for importance of various domains of life 
(v49) had little effect; only family (v5) and friends (v6) were 
significant and of the expected sign, which implies that less impor- 
tance of family and friends is associated with less satisfaction. 

10 Because the response set for lfe satisfaction has a wide range 
(1-10), and the wording of the question asks for a numeric score 
rather than a verbal ranking (of the “somewhat satisfied,” “very 
satisfied” vanety), I assume a constant distance between response 
categories and treat the dependent variable as interval rather than 
ordinal (e g, Headey 1993) Note that the estimator is asymptotic m 
the number of clusters. 

1 Results are unchanged when not weighted. These data also 
provide an opportunity to confirm that the results for satisfaction do 
not reflect a wider attitudinal structure (personality) that should be 
rodependent of political factors. The results for satisfaction can only 
be strengthened if we find no relationship when considering other 
dependent variables for which we have no theoretical expectation of 
political effects. Obvious candidates are job satisfaction (v116), 
loneliness (v187), boredom (v89), or any of a battery of questions 
(v326-7333) assessing self-confidence. Substituting any of these for 


944 


December 2001 


Results are reported in Fable 2. Considering each of 
the different operationalizations of market indepen- 
dence, the coefficients for decommodification, socialist 
and liberal regime, and Left dominance are ail signif- 
icant and of the expected sign in their respective 
equations. The one notable difference between these 
and the earlier results involves individualism: It was not 
significant before but now proves to be (and is of the 
predicted sign) in each equation. The implication is 
that citizens are more satisfied in more individualistic 
cultures. Since socialist countries are usually consid- 
ered collectivist, this finding seems to contradict the 
view that a welfare state should produce greater satis- 
faction. An inspection of the values of the individual- 
ism variable (see Table A-1), however, suggests that 
while Scandinavia is slightly less individualistic than, 
say, the United States or Canada, it is more individu- 
alistic than France, Ireland, Italy, or Japan. In any case, 
the purpose of multivariate analysis is to separate these 
issues. Satisfaction may be influenced by a country’s 
level of both individualism and social democracy. 

These data lend themselves to a test of the hypoth- 
esis that the variables for market independence have 
the preatest effect on people most in need of support 
against market vagaries. This does not necessarily 
mean that only individuals with lower income find their 
lives improved by citizenship rights, merely that they 
find life relatively more improved than do those with 
higher income. The simplest and most intuitive test is 
to divide the sample into high- and low-income groups. 
The results, provided in Table 3, are consistent with 
expectations. 

Decommodification is significant for both income 
categories, although its magnitude is about 50% 
greater for those with lower income. Left dominance is 
twice the magnitude for those with lower compared to 
higher income (insignificant for the latter). Socialist 
regime attributes are significant for the lower group 
but somewhat smaller and insignificant for the upper 
group. The liberal variable, as expected, is negative and 
significant for those with a smaller income, and it is 
halved in magnitude and reduced to insignificance for 
the wealthier group. This suggests that the deleterious 
effects of liberal welfare states are concentrated on 
those with less income. The evidence is clear that those 
with modest income are most affected by social democ- 
racy, although there is some evidence that wealthier 
citizens also benefit, if somewhat less. 


Analysis of National Factors Net of 
Indlvidual-Level Determinants 


Another approach for estimating the effect of political 
variables is to focus on that portion of the national 
variance in satisfaction unexplained by purely individ- 





life satisfaction in the results reported below produced political 
variables with coefficients that vary wildly ın sgn and are always 
insignificant. Similarly, appraisal of one’s “home life” (v180) œ totally 
unaffected by any of the political vanables. 
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Conservative .009 (.011) 















@DP .036 (.020) 
Unemployment —.034 (.019) 
Individualism .048* (.020) 
Married .044 (.080) 
Gender .038 (.031) 
Education .008 (.007) 
Age —.022 (.015) 
Children —.006 (.012) 
Home Life 504" (.042) 
Income .046*™* (.009) 
Unemployed Chlef Eamer —.258°™ (.040) 
Church Attendance .134*** (.040) 
Constant 2.18" (.447) 
Adj. R? 305 

N 16,103 





ual-level characteristics. The procedure is to regress 
satisfaction on all the individual-level variables, com- 
pute the mean residuals for each country, and use 
those values as the dependent variable in further 
analysis. Di Tella, MacCulloch, and Oswald (1997) 
refer to this kind of residual life satisfaction as a 
country’s “pure” satisfaction, meaning the amount of 
satisfaction that can be attributed to national charac- 
teristics per se. This overstates the case somewhat, 
since politics can influence at least some variables (e.g., 
income) that are potentially affected by government 
policy. Still, the procedure is useful because it is highly 
conservative. By assigning as much variance as possible 
to individual-level factors that are assumed to be 
apolitical, we raise the bar for showing that politics 
indeed affects average levels of satisfaction. 
Therefore, I estimate satisfaction using the individ- 
ual-level variables reported in Table 2 and then com- 
pute average residuals for each country. These are used 
to replicate the basic national-level analyses reported 
previously. The modeling strategy is as before: a full 
model with all variables (details not shown) followed by 
a reduced model that eliminates terms with Itl < 1. 
Robust regression and bootstrapping are used to verify 
robustness. Results are provided in Table 4. Decom- 
modification and Left dominance are significant and 


TABLE 2 Effects on Life Satisfaction (Individual-Levei Data) 


Regime Attributes 
Socialist .026* (.010) 
Liberal —.028* (.011) 


ie SLE Saye ral 
Note: Enines are unstanderdized regression coeficients. Courtry-clustered robust standard errors are n parentheses. “p = .05, “p = .01, ““p = .001. 









Decommeodification 
Wa n/a 
n/a n/a 


.001 (.003) 





016 (.020) .015 (.017) 
—.032 (.026) —.005 (.040) 
.061*™ (.027) 044* (.022) 
040 (.078) 039 (.079) 
034 (.031) 037 (.030) 
007 (.007) 010 (.008) 
—.023 (.015) —.018 (.015) 
—.007 (.012) —.005 (.012) 
506" (.052) 507** (.050) 
047" (.008) .490** (.008) 
—.257** (.039) — 242** (038) 
139** (.038) 116*** (.034) 
1.82 (382) 2.53" (.202) 
303 303 
16,103 16,103 


have the expected sign (the magnitude of the coeffi- 
cients is smaller, because the dependent variable, now 
a residual, varies over a smaller nominal range).? The 
socialist and liberal variables are not significant, al- 
though the probability of the coefficients being due to 
chance remains less than .10. Finally, individualism is 
again significant and has the expected sign in each of 
the models. 


Summary 


It may be instructive to end this section with a visual 
analysis of a simple bivariate relationship. This kind of 
analysis is seldom enlightening when dealing with 
complex phenomena, but in this case it provides a 
useful summary of the arguments. If we consider the 
“pure” satisfaction data, “cleansed” of the individual- 
level determinants of quality of life, we find relatively 
straightforward bivariate relationships with the politi- 
cal variables. Figure 1 plots residual satisfaction with 
Left dominance. The countries fall reasonably cleanly 


12 Results are unchanged if the median rather than the mean residual 
is considered (as expected, grven that the two correlate at about .95). 
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Regime Attributes 


along the bivariate regression line (Left dominance is 
significant at the .01 level, = 3.7). France and Japan 
are outliers, falling below their predicted values. The 
principal reason is the one other variable that was 


TABLE 3. Effect of Polttical Variables on Life Satisfaction by Income Groups 


Low Income High Income Low Income High Income Low Income High Income 


Socialist .027* (.013)  .025(.017) ma Va va Va 
Liberal —.039 (.014) —.016 (.013) Wa n/a n/a na 
Conservative —.005 (.012) .022 (.014) Wa ma ma na 
Decommodtfication n/a n/a .022* (.006) .015* (.007) n/a n/a. 
Left Dominance Wa va Wa na .008"* (002)  .004 (.003) 
Center ma ma n/a ma —.000 (.004) .005 (.005) 
GDP .000 (.000) .000 (.000) .000 (.000} -000 (.000) .000 (.000) -000 (.000) 
Unemployment 031 (.022) -034 (.022) .034 (.031) .033 (.026) .008 (.048) —.010 (.043) 
indtviduallsm .044* (020)  .052* (.G24) .068 (021)  .052 (.038) .046 (.031) .042 (.026) 
Married .022 (.073) .071 (.089) .025 (.077) .074 (.088) .020 (.076) .082 (.088) 
Gender .057 (.047) .015 (.043} -050 (.047) .013 (.043) .053 (.047) 015 (.043) 
Education .002 (.007)- 009 (.008) .003 (.008) .008 (.008) -006 (.008) .008 (.007) 
Age —.029* (.019) —.013* (.022) -.028(.020)  —.014(.021)  -.026(.020) —.013 (.022) 
Chiidren —.014 (.019) .004 (010) —.013 (.020) 002 (.008} —.012(.019) .002 (.009) 
Home Life 512" (057) .492*** (.046) .516°"(.056) 491° (.046) 516°" (.054) .49.4*"* (.045) 
Income .059* (017) .035™ (.010) .053°°(018) .036°(.011) .053°(018) .039*"(.010) 
Unemployed Chief 
Eamer — 289" (.028) —.146 (.091) . —.289*" (.029) — 1.47 (.083) —.270"* (.027) —.153 (.096) 
Church Attendance  .159°(.084)  .089* (.048) .163* (.081) .095* (.048) .135* (.074) .080 (.047) 
Constant 2.245" (.462) 2.35" (.426) 1.505 (.373) 2.273 (.538) 2.318" (226) 3.001°™ (.286) 
Adj. R° 297 293 298 293 296 293 
N 8,948 7,155 8,948 7,165 8,948 7,155 


Note: Entnes are unstandardized regression coefficients. Country-chustered robust standard errors are In parentheses. ‘p = 06, “p = .01, *“p = .001. 
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consistently significant in Table 4—individualism. Ja- 
pan has the single lowest score on that measure, and 


France scores only slightly higher. Similarty, the United 
States, with the highest value on individualism, is well 


TABLE 4. Political Determinants of Residual Mean Life Satisfaction 
Regime Attributes Decommoadification Party Control 


Soclallst 
Liberal 
Decommeodification 


.032 (.019) 
—.022 (.013) 
Wa 


Left Dominance Wa 


Individualism 
Unemployment 
Constant —.594™ (204) 
Ad]. R° 533 

N 15 


.058* (.026) 
.035 (.021) 





n/a 


.022** (.007) 
Va 
.070** (.025) 
.049* (.024) 
—1.37*** (.253) 
481 
15 


Node: Entree ere unstenderdized regression coelhovents. Standard errors are m perentheees. “p = .06, “p = 01, ~p = .001. 
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above the regression line.!3 The other political vari- 
ables (not shown) reveal a pattern similar to the one 
displayed in Figure 1. The plots provide an intuitive 
view of the analysis and indicate not only a clear 
relationship between market independence and life 
satisfaction but also nontrivial deviations due to the 
influence of culture. 


DISCUSSION 


The results of ideological competition have dramatic 
effects on national levels of life satisfaction. Subjective 
evaluation of life quality is positively affected by gov- 
ernments of the Left. It is also enhanced by the extent 
to which states reduce market dependency through 
decommodification of labor and, in general, adopt a 
social democratic welfare regime. Several implications 
follow. ' ; 

The most obvious is the practical implication for 
electoral politics and social policy. Social democracy, 
whatever its failings, does appear to increase mean 
levels of subjective well-being. Similarly, the present 
analysis also speaks to the perennial debate over the 
desirability of the expansive welfare state. Social dem- 
ocrats view the welfare gtate as a mechanism for 


13 Left dominance remains significant after re either the three 
Scandinavian countries (t = 2.31) or both France and Japan (t = 
2.34). The results thus cannot be attributed simply to the differences 
between extreme scores on residual satisfaction. The basic results 

able 1) are also similar after 


Residual Life Satisfaction and Leftist Party Dominance 


Left Dominance 


empowering the working class. Before workers can 
engage in collective action, they must have a basic level 
of security outside the market. Nonsocialist 

sives and various liberal reformers also look to the 
welfare state as a way to correct market failures. 
Whether: driven by a sense of social justice or the 
simple desire to make capitalism more self-sustaining, 
they too see the welfare state as a mechanism for 
improving people’s lives. 

Conversely, many conservatives and classical liberals 
decry the deleterious effects of “socialism” on the 
positive and empowering institution of the market. 
Cradle-to-grave protection may be well intended, it is 
argued, but the net effect is to breed dependency and 
complacency, with predictable consequences for peo- 
ple’s self-respect and autonomy. Individuals find them- 
selves alienated and powerless in the face of an intru- 
sive governmental bureaucracy. In this view, the 
welfare state worsens rather than betters lives. 

The evidence presented here does not resolve these 
disagreements. While the fact that the welfare state 
and leftist governments apparently make people (on 
average) more satisfied with their life would seem of 
some moment, it by no means settles the debate. Any 
assessment of the value of the welfare state is ulti- 
mately both normative and ideological; it is unlikely to 
be resolved by any empirical evidence.* 


eS ee 
4 A possible counterargument is that the countries my analysis 


947 


Politics, Markets, and Life Satisfaction: The Political Economy of Human Happiness 


Three notes of caution are in order. The analysis 
covers only 15 countries, and such a modest sample 
imposes obvious econometric imitations. I have been 
at pains to demonstrate that the results are, on the 
whole, robust, but this issue cannot be summarily 
dismissed. Equally important is the range of variation 
in satisfaction with life represented in the sample. I 
focused purposely on advanced industrial countries 
with established democratic regimes in a period of 
peace and relative prosperity. A broader sample of 
economic and political conditions might well point 
toward independent variables other than those that 
proved important here. 

Finally, the positive, direct effects of the welfare 
state and the Left on quality of life could be discounted 
by any indirect negative effects they may have on 
economic outcomes. I address this issue below. 


ON THE ENDOGENITY OF POLITICS AND 
ECONOMICS 


I have attempted to demonstrate that an expansive 
welfare state and leftist control of government contrib- 
ute to life satisfaction, but common sense points to 
another obvious determinant—the state of the econ- 
omy. The reported models do not, however, consider 
the possibility that economic conditions may them- 
selves be partially determined by the same political 
factors that affect satisfaction. In other words, the 
welfare state may have indirect negative effects on the 
economy, so that the net benefits could be zero or even 
negative. Similarly, modest welfare states (or domi- 
nance by conservative parties) may have strongly pos- 
itive but indirect consequences for happiness due to 
the greater economic prosperity they conceivably may 
foster. The ideal solution would be to model how 
politics affects both life satisfaction and economic 
performance. 

Although I acknowledge the possibility of indirect 
effects via economic performance, there is reason to 
believe those effects are unlikely to be pronounced. 
The preponderance of evidence suggests that politics 
affects the economy only marginally or even that social 
democracy fosters a more vibrant economy. According 
to Boix (1997), 


an accumulating body of research has shown... that the 
ability of parties to pursue their most preferred economic 
strategies is extremely limited. Governmental policies on 
the demand side have [only] temporary effects... are 
highly contingent on underlying institutional conditions in 


the labor market... and in the state [especially central 
banks]... and are dependent upon the world economy (p. 
817). 

Thus, in the end, 


Researchers that theorize about the interaction between 
the economy and politics emphasize ...the powerful ef 
fects that specific sets of structures, institutions, and norms 


statistical difference (t = —1.1); a clmical study by Murphy (1982) 
suggests much the same. Climate is sometimes to affect 
Scandinavians adversely, but Argyle (1987, 102) coac nde that it has 
“little or no effect on happmess.” 
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have in the formulation of politics . . . . Parties are trapped, 

they point out, in specific equilibria induced by particular 

configurations of institutions (p. 841). 

Boix (1997, 841) argues that partisan preferences can 
be manifested in the supply side of the economy, such 
that the Left entrusts to “the state an important role in 
the provision of physical and human capital.” His 
empirical analysis confirms that governments domi- 
nated by social democratic parties spend more on 
public investment (ie., physical capital) and greater 
educational spending (Le., human capital) than other 
governments. These are among the key factors in 
economic growth (e.g., Levine and Renelt 1992), so 
governments of the Left plausibly may contribute to 
greater prosperity. In sum, the literature implies either 
(1) minimal long-term partisan effects on economic 
performance, in which case the conclusion that Left 
dominance increases quality of life independent of 
economic performance is unchanged, or (2) a positive 
influence on growth by leftist governments, in which 
case the net contribution of the Left to satisfaction may 
be larger than the direct effects reported here. 

Turning from the ideological complexion of govern- 
ments to the welfare state, the literature becomes 
rather more inconclusive and contentious. Perhaps the 
single most respected contemporary piece on the de- 
terminants of cross-national growth is by Levine and 
Renelt (1992), who attempt to isolate factors that have 
stable, robust effects on the economy. Their work was 
motivated by the widely different conclusions in the 
literature. They do not consider welfare spending per 
se, but they examine several variants of governmental 
consumption of GDP (e.g., total consumption, total 
less defense spending) that are certain to correlate 
highly with welfare state generosity. None of these 
variables proves to have robust effects on growth. 

Atkinson (1999) carefully reviews ten major studies 
of the effect of social transfer spending on growth. He 
reports (p. 34) that two find an insignificant effect, four 
a negative effect, and four a positive effect. His review 
of the literature on social security and unemployment 
comes to similar agnostic conclusions (pp. 43-9). Other 
studies could be cited, but none unresolves the ques- 
tion. The lack of conclusive evidence may mean that 
the “real” effect of the welfare state on the economy is 
small, whatever its direction (Atkinson 1999). If so, 
then any indirect effects on satisfaction also are likely 
to be reasonably small, whatever their direction.15 

The possibility that the political variables in my 
model affect economic factors raises an econometric 
issue. If we conceive of the relationships as a system of 
equations in which economic performance is a function 
of politics, and satisfaction is a function of both, then 
the economic variables are endogenous. Yet, as this 
implicit system of equations is recursive, the usual 


15 The literature focuses almost entirely on spending per se, not the 
more nuanced measures of Esping-Andersen (1990) used here. The 
one study takes the “three worlds of welfare capitalism” as its 
organizing theme and finds that the socialist regime produces better 
economic performance than the liberal or conservative (Goodin et al. 
1999). If so, then the total effect of such a regime on saturfaction 
would be larger than the direct effects reported here. 
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.06,“*p = 01, ™“p = .001. 





statistical complications that make OLS inappropriate 
do not apply. So long as the equations do not have 
correlated errors, the coefficients will be unbiased. 
Although there is little reason to be concerned about 
the integrity of the reported results, I addressed the 
possibility of correlated error by estimating a system of 
equations using two-stage least squares. It is not my 
main purpose to test the hypothesis that the political 
canals affect the economic, but this method allows 
such a test. 

In this procedure, instruments are created for the 
endogenous variables. To estimate growth I use the 
model suggested by Granato, Inglehart, and Leblang 
(1996, Table 1, model 4); this treats the mean rate of 
growth in GDP (1960-89) as a function of per-capita 
GDP in 1960, Primary school enrollment in 1960, and a 
measure of “achievement motivation.” To estimate 
unemployment, I rely on variables and data from Hall 
and Franzese (1998, Table 2), where the average 
unemployment rate (1955-90) is determined by central 
bank independence, the extent of coordinated wage 
bargaining between unions and employers, the open- 
ness of the economy, and labor union density.© Fur- 
ther details regarding these variables are provided in 
the pepe To each equation I (iteratively) add one 
set of the political variables that were significant in the 
previous analysis. The 1990 value of per-capita GDP is 
treated as an endogenous variable determined by its 
1960 level, average economic growth, and the log 
population of 1950. Finally, life satisfaction is modeled 
as a function of the three endogenous variables, the 
culture measure, and the political indicators that 
proved important in earlier models. This procedure is 
me to the political variables that were significant in 
Table 1. The second-stage models for life satisfaction 
are provided in Table 5. 

The results are much as before, suggesting that the 
relationships between the political variables and satis- 
faction are not affected by the potential for endogenity. 





16 Hall and Franzese use a different political variable (the simple 
average share of cabinet seats for leftist parties), which I delete 
because of redundancy. It should be noted that their variable is not 
significant in their reported results. 





Regime Attnbutes Decommodrfication Party Control 
Soclallst -122** (.032) mna n/a 
Liberal —.046* (.028) va n/a 
Decommodification ma .056** (.019) Wa 
Left Dominance n/a ma .017** (.053) 
@DP .087 (.005) .089 (.068) -100 (.068) 
Unemployment —.222™ (.042) —.257** (.079) —.198""* (.057) 
Individualism .032 (.052) .082 (.075) .003 (.081) 
Constant 4.82 (.610) 2.83* (1.29) 5.17*™ (.800) 
Adj. R° .786 .515 .533 
N 15 15 15 





Note: The dependent vanable m ife sattsfacton Entnes are unstandarcized two-stage least-equares coefficients. Standard errors are In parentheses. *p = 





For the sake of brevity, I do not report the other 
equations in the system that take the economic vari- 
ables (rather than satisfaction) as their dependent 
variable. The central conclusion that emerges from 
these models, however, can be summarized readily: 
None of the political variables that affected life satis- 
faction is significantly related to the state of the 
economy. This clearly suggests the absence of endo- 


genity.!7 


CONCLUSION 


Turning from the practical to the theoretical, compar- 
ison theory clearly fails. Aggregate levels of satisfaction 
vary systematically in response to the political factors 
that the theory predicts to be irrelevant. This study 
adds to the choir of voices suggesting that happiness is 
not relative (e.g., Diener et al. 1993; Veenhoven 1991). 
The culture theory fares rather better. Although the 
individualism/collectivism variable is not significant in 
some of the preliminary models, on whole it acquits 
itself nicely. The simplistic interpretation of satisfac- 
tion as driven largely or entirely by immutable national 
traits is not supported by the data for the same reason 
that comparison theory fails (countries differ systemat- 
ically and consistently in response to political condi- 
tions), but the contention that “culture matters” cer- 
tainly is sustained. 

Livability theory is also vindicated. Its fundamental 
premise that the subjective cea life varies directly 
with national aspects that make life more comfortable 
and secure is uniformly supported. One of those as- 
pe appears to be reduced dependence on market 
orces that are—whatever their positive attributes— 
“indifferent” to human needs (Lane 1978). 





17 I do not wish to make too much of this. As noted above, it is 
beyond the scope of this article to make strong clams about the 
economy-politics relationship. One wishing to salvage a place for 
politics in affectmg the economy might be able to do so with other 
data or variables, more cases, or different modeling techniques. 
Caution 1s especially warranted given the small size of my sample m 
the context of two-stage least squares. 
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In sum, as Inglehart oa might argue, there seem 
to be sustained cultural differences in subjective ap- 
praisal of life, but national levels of satisfaction also 
vary in response to political factors. Party control of 
government and the quality and extent of welfare 
provisions affect a nation’s aggregate level of satisfac- 
tion regardless of cultural differences. Subjective well- 
being is affected meaningfully by the political choices 
of voters and the governments they elect. Democrats of 
all ideological persuasions will surely welcome this 
conclusion. 


APPENDIX 
Dependent Variable 


The dependent variable is the national mean on the followin, 
question: “All things considered, how satisfied are you wi 
your life as a whole these days?” There are ten response 
categories, and higher values indicate ter satisfaction. 
Data are from the 1990-93 World Values Survey (WVS) 
(ICPSR #6160). National values are provided in Table A-1. 
A large literature assesses the validity and reliability of 
self- rted measures of subjective well-being in general 
and life satisfaction in particular. Satisfaction reports are not 
particularly troubled by social-desirability bias and display 
stability over time (see Myers and Diener 1995), although 
they do vary in a nonrandom fashion in mse to positive 
and negative life events (e.g., Headey and Wearing 1992). 
Individuals who purport to be more satisfied with their lives 
also demonstrate the behavioral and attitudinal characteris- 
tics this might imply, such as laughing and smiling more than 
others, as well as scoring higher on other self-reported 
measures of well-being (¢.g., Myers 1993; Myers and Diener 
1995; Watson and Clark 1991). Individual assessments of 
well-being also correlate hi with external evaluations 
from friends and family members as well as with clinical 
evaluations fe Myers and Diener 1995). Veenhoven (1993, 
1994, 1996, 1997a, 1997b) examines in detail a large number 


of concerns about the scientific utility of self-reported satis- 
faction and concludes that most doubts “can be discarded” 
(1996, 4). According to Veenhoven (1976, 157), the “litera- 









TABLE A-1. Data for Maln Variables 





mdniduakem s from Dener, Diener, and Diener 1996, 856. 
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ture on this point can be summarized as saymg that simple 
questions on happiness and life-satisfaction measure subjec- 
tive reciation of life quite validly.” Both validity and 
reliability are improved by the use of national averages rather 
than individual-level data, because imprecision and response 
error in individual judgments tend to balance out in large 
samples. 

National averaging raises the issue of the comparability of 
data across countries. Three objections can be made, and 
Veenhoven (1993, 1996, 1997a, 1997b; see also Ouweneel 
and Veenhoven 1991) addresses each. The first relates to 
linguistic problems because words such as “satisfaction” or 
“happiness” may have subtleties and connotations that differ 
across languages, Veenhoven sees little reason for concern, 
since the rank ordering of countries is virtually identical for 
national Means on questions about “life satisfaction,” “hap- 
tae or a rating between “best and worst possible lives.” 

is also is further evidence in support of the validity and 
reliability of the survey items themse! Veenhoven (as well 
as Inglehart 1990) finds, too, that average levels of satisfac- 
tion within multilingual countries do not differ by language 
group. 

A second issue relates to social desirability. If the social 
pressure to overstate happiness differs across countries, 
meaningful comparison would be impossible. Among other 
tests, Veenhoven (1 compared average satisfaction in 
countries where surveys indicate that happiness ranks high in 
value hierarchies and countries where it does not. He found 
no differences. The third issue is the reverse phenomenon: In 
some cultures it may be immodest or unseemly to admit 
satisfaction with one’s life, which would lead to underreport- 
ing. Veenhoven (1996) found no evidence in multiple empir- 
ical tests. 

In sum, we can both measure life satisfaction with reason- 
able accuracy and i a levels of satisfaction across 
nations without great difficulty. To demonstrate the stability 
of the reported results, however, I (see note 5) performed the 
analysis with two alternatives to life satisfaction: self-reported 
happiness (v18), a four-point ordinal scale recoded so that 
higher values mdicate greater happiness, and an additive 
index of two dummies that access mood: whether the respon- 
dent had recently “felt on top of the world” or that “life was 
wonderful” (v90) and “pleased about having accomplished 

















Party Control 
Life Left Welfare Regimes 
Nation Satisfacton Decommodificaton Domlnance Center Soctalist Liberal Conservative Individualism 

Ausina 8.51 31.1 1128 0 2 4 8 8 
Belgrum 760 324 6.77 24 32 4 4 B 7 
Canada 7.89 22.0 —12.91 32.09 4 12 2 9 
Denmark 8.16 38.1 16.19 10.07 8 6 2 8 
Finland 7.68 292 18.16 22.78 6 4 6 8 
France 6.78 27.5 —14.88 9.44 2 8 8 7 
Gemrany 7.22 277 —13.25 8.34 4 6 8 8 
Ireland 7.88 23.3 -7 88 28.72 2 2 4 5 
ttaly 7.30 24.1 3.88 39.67 0 6 8 6 
Japan 8.53 27.1 —44.11 .09 2 10 4 4 
Netherlands TTT 32.4 —1.39 25.29 8 8 4 9 
Norway 7.68 38.3 27 52 5.62 8 0 4 7 
Sweden 7.97 38.1 36.50 5.81 8 0 0 8 
United 

Kingdom 7.58 23.4 —12.68 (6) 4 6 0 9 
United States 7.73 13.8 —26.00 19.00 0 12 0 10 





Sources, Life satiefachon rs from the World Valuee Survey |l (v36); decommodicaton : from Esping-Andersen 1900, Table 2 2, the party control vanables 
are computed from Huber, Ragin, and Stephene 1997, the Liberal, Sociakst, and Conservatie variables are from Esping-Andersen 1690, Table 3.3, 
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something” (v88). The national ave for both items have 
high correlations with life satisfaction (.69 for 


mood, .71 for heppna) for the present sample (correlations 


tend to app unity when using larger samples; e.g. 
Schyns 1998). 
Major Independent Vartables 


The socialist, liberal, and conservative welfare state charac- 
teristics are provided by Esping-Andersen (1990, 55-78), who 
first scores each country on seven ratio-level variables (Table 
3.1). Two variables measure stratification of (and the main- 
tenance of status differentials inherent in) pension schemes: 
the number of occupationally distinct pension programs, and 
the expenditure as a percentage of GDP on pensions to civil 
servants per se (Le., on programs that fund such workers 
separately from workers in general). Three variables measure 
the “residualism” of welfare policy (an index of the reliance 
on means-testing of income maintenance, the percentage of 
pensions that are provided by employers, and the percentage 
of total health care spending that is private rather than 
governmental). Another two measure welfare state “univer- 
salism”: the percentage of the labor force aged 16-65 eligible 
for benefits, computed by averaging the percentage eligible 
for state sickness, unemployment, and pension programs; and 
an indez for the level of benefit equality such programs afford 
citizens, defined as the percentage of the maximum legal 
benefit that the “standard” citizen tends to receive. 

These three sets of measures correspond to the indicators 
for creating the conservative, liberal, and socialist regime 
variables, respectively. Hach country is assigned a value for 
each of these summary variables, based upon its scores on the 
individual indicators. The socialist variable, for example, is 
computed as follows. Each country is assigned a value 
between 0 and 4 on each of the two measures of pension 
universalism, and these values are in turn summed to obtain 
the score on the socialist attributes variable. The assignment 
on the four-point scale is based upon a nation’s relative 
position in the distribution for the given measure of univer- 
salism (as defined by mean and standard deviation). The 
socialist variable thus ranges from a minimum of zero for 
countries with low eligibility and low equality of benefits (e.g., 
the United States) to a maximum of eight for nations with 
both high eligibility and high equality (c.g, Denmark). 

Decommodification is from Esping-Andersen (1990, 35- 
54). It refers to the ability of citizens to decline work without 
the loss of “income or general welfare” (p. 23). The details of 
the rigorous operationalization are not readily summarized. 
Perhaps the most succinct description is offered by Messner 
and Rosenfeld (1997, 1399): 


[The index] encompasses three primary dimensions of the 
underlying concept: the ease of access to welfare benefits, 
their income-replacement values, and the expansiveness of 
coverage across different statuses and circumstances. A 
complex scoring system is used to assess [the amount of 
decommodification provided by] the three most important 
social welfare programs: pensions, sickness benefits, and 
unemployment compensation. The scoring system reflects 
the “prohibitiveness” of conditions for eligibility [e.g., 
means testing], the distinctiveness for and duration of 
entitlements [e.g., maximum perio of benefits], and the 
degree to which benefits replace normal levels of earning. 
The indices for these three types of .. . programs are then 
aggregated into a combined [additive] index. 


The individual indices are weighted by the percentage of 
the relevant population covered by the respective programs. 
Bach dimensional index is built from multiple indicators (e.g., 


five for old age pensions, four each for sickness and unem- 
ployment) that reflect the concerns noted above. The sum- 
mary score, which is the sum of the scores for each of the 
three dimensions, has a mean of 27.2 and a standard devia- 
tion of 7.7. 

The two variables for party control of government are 
computed from data reported by Huber, Ragin, and Stephens 
(1993, 1997). Left dominance is the proportion of cumulative 
leftist party cabinet seats (from 1950 to 1990) less the 
proportion of rightist party seats. Center is the cumulative 
proportion for all centrist parties. Huber, Ragin, and Ste- 
phens divide parties into left, secular center, center Christian, 
center Catholic, secular right, right Christian, and right 
Catholic. I created total center and total Right by collapsing 
the last six categories into two. 

Data on the individualism of cultures are based on the 
theoretical perspective articulated by Triandis (1989, 1995). 
The basic distinction is between the prevalence of cultural 
norms that encourage people to define themselves as either 
essentially autonomous (although not necessarily atomistic) 
or members of (typically nested) superordinate groups (e.g., 
family, religion, ethnicity, region, nation). Autonomous indi- 
viduals have relatively greater personal freedom in choosing 
and pursuing their life goals, and they also may attach greater 
importance to personal happiness. In collectivist cultures, 
individual freedom is subordinated to the customs, practices, 
and perhaps needs of the reference group. There is relatively 
more emphasis on conformity and obedience to authority, 
and relatively less on maximizing personal satisfaction. 

Collectivist cultures share an affinity with what are often 
labeled “traditional” societies, and cultures tend to become 
more individualistic as they modernize (Triandis 1989). I use 
Triandis’s ten-point expert rating of the collectivist-individu- 
alist dimension, with higher values indicating greater individ- 
ualism; they are reported in Diener, Diener, and Diener 
(1995, 856). This indicator has become the standard measure 
of what Veenhoven (1997b) calls the “individualization” of 
cultures, and he observes that the validity of the Triandis 
indicator appears high (e.g., it correlates highly with a variety 
of other more narrow survey-derived indicators). 


Additional Variables In the Individual-Level 
Models 


The variables with reference to the 1990 WVS (ICPSR 
#6160) are: life satisfaction (v96), 1-10 scale in ascending 
order; married (built from v181), 1 = “yes” or “living as 
married,” 0 otherwise; number of children (v211), 0-6+; 
gender (v353), 1 = male, 2 = female; education (from v356), 
the age education was completed in seven ascending catego- 
ries; reported quality of home life (v180), 1-10 ascending 
scale; age (v377), six ascending categories; dummy for 
whether chief wage earner is unemployed (built from v358, 
v360, and v361), 1 = “yes,” 0 otherwise; household income 
(¥363), in 1-10 ascending mcome ranges; and church atten- 
dance (built from v147), 1 = once a month or more, 0 
otherwise. 


Additional Variables Used In Two-Stage 
Least-Squares Models 


From Granato, Inglehart, and Leblang (1996) I took the 
following: per-capita GDP in 1960 (from Levine and Renelt 
1992); the number of students enrolled in primary school as 
a proportion of the relevant age group in 1960 (Levine and 
Renelt 1992); and motivation, based on their “Four Item 
Achievement Motivation Index,” computed as (Thrift + 
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Determination) — (Obedience + Religious Faith). The items 
in this equation are based on the percentage of respondents 
in a given country who agree these “are qualities children 
should be encouraged to learn at home,” from the WVS 1990 
(v232, ¥233, v234, v236). 

From Hall and Franzese (1998), I took the following: 
central bank independence, defined as the average of “the 
five most commonly used indexes” (for references, p. 516, n. 
37) for assessing “both the legal status of the central bank 
and its reputation for independence,” 0-1 scale; an index (p. 
516) of the “degree of wage bargaining” between trade 
unions and employer associations, a five-point scale; eco- 
nomic openness of the economy as measured by exports plus 
imports as a percentage of GDP (p. 518, n. 43); the interna- 
tionally comparable unemployment rate ın percent (p. 518, n. 
43); and labor union density in percent (for sources, see p. 
518, n. 43). All data are averages for 1955-90. 
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fluctuations in macropartisanship are responsive 


approval nor consumer sentiment is found responsible for the 1980 shift. Realignments aside, macroparti- 
sanship is guided by a stable, not a continuously moving, equilibrium. 


[A]l those millions of Democrats and Independents who I 
know are looking for a cause around which to rally and 
which I believe I can give them. 

—Ronald Reagan 


he study of party identification has entered the 

twilight zone. Gone seems to be the day when 

partisanship was regarded as an immovable ob- 
ject, impervious to the turbulence of politics and 
guiding everything else in the manner of an irresistible 
force. Instead, the view has dawned that this may be an 
object in perpetual motion, buffeted by the prevailing 
winds of politics and economics. There is no question 
that macropartisanship—the balance between Demo- 
crats and Republicans in the American electorate— 
has exhibited substantial movement over the last half 
century. What is less certain is how seriously to take 
such movement and where it originates. 

We contend that a rare, massive, and enduring shift 
of the electoral equilibrium has occurred, commonly 
known as a partisan realignment. Although familiar to 
students of elections, the concept of realignment has so 
far attracted little interest in efforts to come to grips 
with i ip. This research demonstrates a 
marked and lasting shift of the partisan equilibrium 
after 1980. The election of Ronald Reagan triggered a 
systematic growth of Republican identification, which 
cut deeply into the overwhelming Democratic lead that 
dates back to the New Deal realignment. The new 
GOP competitiveness enabled the Republican Party, 
among other things, to crack the solid Democratic 
control of the House of Representatives in the 1994 
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midterm election and to deliver unified control in 2000 
for the first time in nearly fifty years. 

The identification of a realignment distinguishes our 
model from recent efforts to come to grips with the 
dynamics of macropartisanship: the moving equilib- 
rium (Erikson, MacKuen, and Stimson 1998; also 
MacKuen, Erikson, and Stimson 1989), the constant 
equilibrium (Green, Palmquist, and Schickler 1998), 
and an intermediate type of equilibrium process known 
as “fractional integration” (Box-Steffensmeier and 
Smith 1998). Instead, we see a sharp but rare regime 
change in macropartisanship. There is one equilibrium 
before the Reagan presidency and another following it. 
Even though the shift was not an overnight phenome- 
non but a gradual adjustment, it should not be mis- 
taken for a moving equilibrium. In our model, the 
equilibrium shift is a rare event, not a response to the 
flow of everyday politics. In large part, our account of 
macropartisanship preserves the American Voter para- 
digm of party identification (Campbell et al. 1960). 

We begin by spelling out the case for a major 
realignment of partisanship in the 1980s and then test 
for its existence with data from the National Election 
Studies (NES). We next propose a dynamic model of 
partisan realignment to be tested with a version of 
macropartisanship that matches the NES standard 
(New York Times/CBS News surveys). We control for 
the influence of everyday economic and political 
forces, such as presidential approval ratings and con- 
sumer confidence. Moreover, alternate specifications 
of realignment will be considered. Finally, we discuss 
how our findings relate to changing theories of party 
identification. 


PARTISAN REALIGNMENT 


Students of American elections commonly describe the 
relationship between the major parties as one of dom- 
imation, with Republicans or Democrats enjoying elec- 
toral superiority for long stretches of time. One ob- 
server likens the majority party to the “sun” of the 
electoral universe, orbited by the minority party 
“moon” (Lubell 1952). Like planetary configurations, 
the majority-minority alignment is not expected to 
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change often, and a shift is akin to a cosmic upheaval, 
called a critical or igning election 1970; 
Campbell et al. 1960; Key 1955). Sellers (1965) postu- 
lates an “equilibrium cycle” in American politics: Re- 
alignments that lead to the ascendancy of one party are 
followed by a return to a more equal balance until the 
atrival of the next realignment. The elections of 1860, 
1896, and 1932, according to most observers, can be 
designated as realigning. 

The causes and consequences of realignment have 
received much scholarly attention (Clubb, Flanigan, 
and Zingale 1980; Sundquist 1973). A realignment 
requires “the presence of a great national crisis, lead- 
ing to a conflict regarding: governmental policies and 
the association of the two major parties with relatively 
clearly contrasting programs for its solution” (Camp- 
bell 1966, 76). During these periods politics becomes 
unusually ideological, and electoral defeats of the 
majority party are not simply deviations brought on by 
temporary lapses in judgment or by the personal 
appeal of a candidate. Instead, the vote begins to 
reflect a major reorientation of long-term party attach- 
ments. Some argue that the reorientation occurs 
through conversion of partisanship in the electorate 
(Erikson and Tedin 1981; Sundquist 1973), and others 
emphasize generational change or mobilization 
(Anderson 1979; Beck 1974; Campbell et al. 1960; 
Norpoth 1987). Moreover, the shift alters the compo- 
sition of party coalitions in the electorate, giving some 

ups a stronger voice and weakening that of others 
(Petrocik 1981; Stanley and Niemi 1995). 

We contend that the 1980 presidential election, in 
and of itself, exhibited the symptoms of a realignment 
in five respects. First, for the first time since 1932, an 
elected president was defeated. Although the majority 
party had previously lost control of the White House in 
“deviating elections,” Jimmy Carter was repudiated in 
a landslide. He garnered even fewer votes. in the 
Electoral College than did Hoover in 1932, whose 
ouster ushered in the New Deal realignment (Burnham 
1981; Pomper 1981). . 

Second, the national economy was in severe distress, 
suffering from a seemingly incurable combination of 
high inflation and high unemployment (stagflation). 
This adversity, along with humiliation in foreign policy, 
cost Carter the election (Markus 1982). More impor- 
tant, a dismal performance of the economy under- 
mined the Democratic Party’s New Deal reputation as 
the party of prosperity (Pomper 1981, 91). 

Third, the winner in 1980 promised a radical depar- 
ture from traditional Democratic ideology and a reso- 
lution of the national condition diagnosed as “malaise” 
by the losing incumbent. Ideological zeal set Reagan 
apart from the other victorious Republicans before 
him. As the “most conservative president since ... 
1928” (Burnham 1981, 99), Reagan would be able to 
count on a conservative tilt in the American public 
(Markus 1982, 551). 

Fourth, Reagan was an intensely partisan Republi- 
can, determined to make the GOP the majority party. 
He was the “only president of the post-war era who 
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took office as an avowed partisan and unvarnished 
ideologue” (Miller and Shanks 1996, 168). 

Fifth, the turmoil and discontent of the previous 
fifteen years softened the electoral terrain for a parti- 
san realignment in 1980. Attachments to either major 
party had diminished through a process dubbed 
“dealignment” (Converse 1976; Nie, Verba, and Petro- 
cik 1976; Norpoth and Rusk 1982). By 1980, a huge 
pool of voters with weak or no partisan attachments 
waited to be tapped by a party with a new agenda. 

Hence, there was opportunity, motive, and weapon 
in 1980 to fashion a partisan realignment out of an 
electoral victory. Yet, Republicans in the past had 
found ways to win presidential elections but could not 
crack the solid lead of the Democratic Party in parti- 
sanship. Would Reagan have more success? 


CHANGE OF MACROPARTISANSHIP 


As is common in the study of macropartisanship, the 
variable of interest is the percentage of Democrats 
among those with a major party identification (Demo- 
crats and Republicans, regardless of the strength of the 
identification). Independents, including those who pro- 
fess to leam toward either party, are excluded. Macro- 
partisanship simply registers the relative weight of 
Democratic vis-a-vis Republican identification. For ex- 
ample, a figure of 62 indicates that among 100 respon- 
dents expressing a partisan identification, there are 62 
Democrats and 38 Republicans. The focus on partisans 
at the expense of independents is quite congenial for 
the examination of a possible realignment, since we are 
concerned with shifts of the partisan balance. 

Figure 1 displays the percentage of Democrats 
among party identifiers in all NES surveys, including 
midterm elections, from 1952 to 1998. The imprint of 
the Reagan innovation on the partisan balance in the 
electorate is unmistakable. Between 1980 and 1988, 
Democratic identification fell from 645% to 56%, 
which is by far the single largest shift in half a century. 
From 1952 to 1980, the Democratic Party maintained 
the overwhelming lead in partisanship forged in the 
New Deal realignment, despite defeats in presidential 
elections. If there exists a steady-state period for 

i , it extends far beyond 1964, all the 
way to 1980. Although partisan identification unrav- 
eled during the 1960s and 1970s, the relative propor- 
tion of Democrats among identifiers was virtually the 
same in the 1980 NES survey (64.5%) as in 1952 
(63.4%). Only in the aftermath of the 1980 election did 
the Democratic share fall sharply, and, while continu- 
ing to fluctuate, remained at a lower level. 

Table 1 provides a statistical account of the change. 
For each presidential election for which there are 
sufficient time points before and after in the NES 
series, we determined how strongly macropartisanship 
shifted after the election. The higher the correlation 
between the binary time variable and macropartisan- 
ship, the stronger is the change; a negative sign indi- 
cates a shift favoring the Republican Party. Consider 
the correlations in the first column of Table 1, where 
no controls are applied. It appears that after all elec- 
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FIGURE 1. Macropartisanship, 1952-96 NES Surveys 
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tions from 1972 to 1988 the GOP made significant elections. The control for any of the other elections 
gains io macropartisanship, but not all instances re- that initially proved significant (1972, 1976, and 1988) 
main significant once controls are applied. Only two does not make much difference for the two Reagan 
elections stand out: 1980 and 1984. Clearly, something elections. At the same time, controlling for either 1980 
important happened to macropartisanship after these or 1984 renders all others nonsignificant. It is safe to 










TABLE 1. Change In Macropartisanship after Preeldential Elections 
Correlation between Time and Macropartisanship 

















Controlling for 
Presidential Election No Control 1972 1976 1980 1984 1988 
1956 “  =.05 16 .16 .22 17 .08 


25 
11 
—.02 
1972 —.44* — .02 .10 —.05 — 22 
1976 -50 -28 = 17 —.04 =27 
1980 —.68"™ —.59™* —.56™ = —.27 —.48* 
1984 —.69"* —.60% —.55™ —.31 — 

1988 —.58™ —.48* —.42* —.24 —.07 — 
Source: 1948-96 oumuiatrve NES fie. 

Note: Entnes ere correlations between NES mecroparbeenship and a binary time variable. For each election, the time variable was scored 0 for ail time 
ponts up to and including that election, 1 for all time ponts afterward. For example, the corrsiation of —.05 between the 1968 tme vansbie and 


indicates that macroparbeenshap was lower (ees Demooratic) after the 1966 election than before, although not by much or sgnificantty 
so. ‘p = .06, “p = .01, ™p = .001. 
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conclude that, in the nearly 50 years of NES polling, 
the only marked and lasting shift in macropartisanship 
occurred in the wake of the Reagan elections. All the 
others either made no dent or only appear to do so 
because of their proximity to the Reagan elections. 

These findings are based on an analysis that includes 
midterm as weli as presidential elections. Some of the 
midterm readings for macropartisanship in Figure 1, to 
be sure, do not boost our claim. The first Reagan 
midterm (1982) shows no decline whatsoever in Dem- 
ocratic partisanship relative to 1980. Had realignment 
not yet begun? Or was this a temporary delay in 
Republican consolidation due to an unusually bad mix 
of economic and political forces that obscured an 
on-going realignment? The analysis below provides an 
answer. Although some of the pro-Republican shift 
was left intact by the second Reagan midterm (1986) 
and the Bush midterm (1990), they raise doubts about 
realignment claims (Hurley 1991). Whatever the eco- 
nomic and political fallout in those elections, the 
pronounced fluctuation in macropartisanship between 
on-year and off-year elections is a novelty. This is not 
the place to delve into a resolution of this complication. 
Suffice it to say that the erratic behavior of partisanship 
in midterms notwithstanding, the Reagan years are the 
only significant instance in nearly half a century when 
macropartisanship moved with lasting consequence. 

We agree that “between 1980 and 1988, at least a 
limited version of the long-heralded partisan realign- 
ment took place” (Miller and Shanks 1996, 166; also 
Miller 1991, 564). It was limited in that it did not 
entirely sweep the Democratic Party from majority 
status (among identifiers) in the American electorate. 
And it did not give the GOP immediate control of both 
houses of Congress, although that finally materialized 
in 1994. In some way, the 1980s realignment is the 
mirror image of the one in 1896, which turned a highly 
competitive balance between the two parties into a 
decided advantage for the Republicans. Almost a 
century later, a decided advantage for the Democrats 
that dated back to the New Deal was turned into a 
highly competitive balance. What type of realignment 
model can explain such a marked shift of the partisan 
equilibrium in the American electorate? 


A MODEL OF REALIGNMENT 


By definition, realignment presumes durable, not tran- 
sient, change. That does not mean that the new condi- 
tion must prevail forever, but it must last a reasonably 
long time. The concept of realignment is ambivalent, 
however, about whether such change takes place at a 
single critical moment or occurs more gradually. A 
model of gradual change with permanent conse- 
quences has the form (minus the error term): 


Y, =[w/(1 — 8) 1%. 


In the case at hand, Y denotes the macropartisanship 
index and X the onset of realignment; and 8 refer to 
parameters that capture the dynamic of change (ex- 
plained below}. The gradual-permanent model is 
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rooted in transfer function analysis (Box and Jenkins 
1976; Box and Tiao 1975; Clarke, Norpoth, and White- 
ley 1998). Carmines and Stimson (1989) apply this 
model with much success in their analysis of issue- 
evolution realignments. In their “dynamic growth” 
model, the central features are as follows (p. 143): 


The model is dynamic because it presumes that at some 
point the system moves from a fairly stationary steady- 
state period to a fairly dramatic change; the change is 
manifested by a “critical moment” in the time series. 
Significantly, however, the change—the dynamic growth— 
does not end with the critical moment. Instead, it contin- 
ues over an extended period, albeit at a much slower place. 


In other words, a critical shock gradually moves a 
system from one steady state to another. The effect is 
enduring for the foreseeable future, but it is not 
accomplished in a single step. Two parameters govern 
this process: w captures the initial shock, and 8 sum- 
marizes the aftershocks. Think of œ as initiating either 
a climb or a descent to a new equilibrium. Either way, 
the initial effect may be positive {a gain) or negative (a 
loss), but the rate at which it continues to build 
invariably has a positive sign. In the long run, the total 
accumulation of change predicted by the gradual- 
permanent model will add up to the ratio w/(1—8). It is 
by such an amount that the aftermath equilibrium will 
differ from the one prevailing before the realignment. 
Technically, the period of change is infinite, but in 
practical terms most of the change will register within 
a foreseeable time frame. 

The requirement that change is supposed to occur 
over some longer period sets the gradual-permanent 
model apart from the theory of critical elections, which 
posits a relatively sudden conversion (Burnham 1970; 
Key 1955). Furthermore, the gradual model is not the 
same as a secular realignment framework, which views 
change as proceeding incrementally (Converse 1966; 
Key 1959), on southern eee The gradual- 
permanent model captures the story of a rapidly un- 
folding change during a “realigning electoral era” 
(Campbell 1966, 75). Even if the change turns out to be 
abrupt, the model still proves useful. The aftershock 
parameter will drop to zero, and all the weight will rest 
on the initial shock. The difference between a gradual 
and an abrupt model is in the delta parameter. We can 
let the parameter estimate settle this issue, but we need 
guidance as to the onset of change. How certain is it 
that the equilibrium shift commenced with the 1980 
election? Why not with the policy success of Reagan’s 
first year in office? Or with the recovery following the 
1981-82 recession? Or with the “It’s morning in Amer- 
ica, again” reelection campaign of 1984? 

To settle these questions, we need a more frequent 
sounding of partisanship than the biennial NES but 
one that nonetheless relies on the same instrument. 
The NES party identification question, with its “gener- 
ally” and “usually” qualifiers, aims to determine not 
just any partisan but especially the “long-term identi- 
fier who is momentarily piqued at his own party, or 
tempted to defect temporarily to vote for a charismatic 
candidate of another party” (Converse 1976, 35). 
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TABLE 2. Annual Shifts In 
Macropartisanship during the Reagan Era, 
New York Times/CBS News Surveys 








Independent Variable Estimate 
1980 —6.72"™* (1.15) 
1981 2.82 (1.48) 
1982 —1.35 (1.48) 
1983 —2.02 (1.48) 
1984 —4.22™ (1.48) 
1985 1.07 (1.48) 
1986 .57 (1.48) 
1987 20 (1.47) 
1988 —1.97 (1.09) 
Constant 65.60*™ (.47) 


Sources New York Times/CBS News polls, quarterly aggregates, 1976: 


scored 0 from the fret quarter of 1976 to the fourth quarter of the 
respective year and 1 afterward through the fourth quarter of 1998. *p = 
05, “p = .01, “p = .001. 


Among available surveys that probe this sentiment with 
some regularity, the New York Times/CBS News polls 
offer by far the best alternative: “Generally speaking, do 
you usually consider yourself a Republican, a Demo- 
crat, an Independent, or what” (emphasis added)?! 
The New York Times/CBS News polls are conducted 


often enough during a year to permit quarterly, but not 


monthly, aggregation. These polls are thus well suited 
to construct a valid measure of traditional party iden- 
tification and to probe the dynamic properties of the 
partisan realignment. In short, this is the best available 
measure. Its advantages compared to the Gallup ver- 
sion have not escaped the attention of scholars (Green, 
Palmquist, and Schickler 1998), although none have 
probed for realignment effects. 

Our efforts to identify the onset of the realignment 
focus on annual shifts in macropartisanship during 
Reagan’s presidency, beginning with the 1980 election. 
It would be tedious to search for this type of needle in 
a quarterly haystack; as a practical shortcut, we limit 
the search to annual aggregates. Table 2 presents the 
estimates for each of those years. Consider 1982: The 
estimate (—1.35) indicates that at the end of that year, 


1 In contrast, the Gallup Poll measure (“In politics, as of today, do 


you 

dent?”) probes a more short-term form of 
this measure by MacKuen, Erikson, and Stimson (1989) has sparked 
heated controversy (Abramson and Ostrom 1991, 1992, 1994; 
Bishop, Tuchfarber, and Smith 1994; MacKuen, Erikson, and Stim- 
son 1992). Conmderable wording confusion reigns in the literature on 
(macro)partisanship. MacKuen, Erikson, and Strmson (1989) did not 
report the wording of the Gallup question in their origmal article and 
later (1992) provided an mcomplete version of the New York 
Tumes/CBS News question, omittmg the “usually” qualifier. Such 
slrppage, however, bas an honored precedent. The American Voter 
(Campbell et al. 1960, 122) itself left out “usually,” although The 
Voter Decides (Campbell, Gurin, and Miller 1954, 90, 217) got it 
nght. 
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macropartisanship fell by 1.35 percentage points (be- 
coming less Democratic). Note that this is a partial 
estimate that controls for the effects of all the other 
annual shifts included in the table. Although it points 
in the right direction (change of macropartisanship 
toward the Republican side), the estimate for 1982 falls 
short of significance, as do many of the others in the 
table. 

Only two entries in Table 2 clearly pass the test: the 
two presidential years. The 1980 election estimate 
packs by far the largest wallop, cutting the Democratic 
edge in macropartisanship by 6.7 percentage points. If 
there was a realignment during Reagan’s presidency, it 
began right after the election. Yet, although less im- 
pressive, the significant effect for 1984 suggests a 
second shock. Is the one-shock realignment model too 
simple? Did both Reagan elections shape the realign- 
ment, each perhaps with a rationale and dynamic of its 
own? In the next section, we test the rival realignment 
specifications, but first we note the fundamentals of 
economics and politics introduced by the theory of 
macropartisanship. Those fundamentals pose serious 
competition for any realignment account of partisan 
change. 


THE DYNAMICS OF MACROPARTISANSHIP 


Although many scholars concede that a partisan re- 
alignment of some form may have taken place in the 
1980s, it is argued that this only proves the sensitivity of 
partisanship to prevailing political and economic con- 
ditions. Contrary to the classical theory of party iden- 
tification, which leaves little, if any, room for systematic 
short-term variation in party identification itself (as 
opposed to short-term forces affecting the vote choice 
at any given election), the work of MacKuen, Erikson, 
and Stimson (1989), in particular, documents fluctua- 
tions in partisanship that are neither random nor 
negligible. These variations appear to go hand in hand 
with political and economic circumstances (also Fio- 
rina 1991; Weisberg and Kimball 1995). 

Without a doubt the incumbent in the White House 
is the most conspicuous standard-bearer of his party. 
His standing with the mass public, among all possible 
conditions, should be able to translate into support for 
the party he represents. Few issues can match the 
potential of the economy to affect partisan choices, as 
a vast domain of research documents (e.g., Norpoth, 
Lewis-Beck, and Lafay 1991). This is not the place to 
sort out some of the numerous disputes over the ways 
in which the economy affects politics, but the specifics 
of the claims about macropartisanship merit close 
attention. A critical replication of those claims shows 
that without numerous control variables to represent 
the effects of presidential administrations and political 
events macropartisanship moves far less strongly with 
the economic and political winds (Green, Palmquist, 
and Schickler 1998). Apparently, those factors exert 
considerable influence, even though their role is not 
explicated in the theory of macropartisanship. 

In response to that challenge, Erikson, MacKuen, 
and Stimson (1998) reformulated their theory. The new 
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TABLE 3. Models of Macropartisanship 
Independent Variable 





Model 1 
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-1.00 
(0.15) 
0.91 
(0.01) 


(Presidential Approval x Reagan),_, 
Initial effect 


Dynamic rate 


Consumer Sentiment,_, 
Inttlal effect 


Dynamic rate 
Constant 


Adj. R? 
Standard error of estimate 


LBQ oo 
N 


Source New York Times/CBS News pole, quarterly aggregates, 1976:01-1998 04 (N = 92). 
estimates 


Note: Cell entnes are tranefer functon errors are n 
"1980" and "1984" are binary time vanabtes scored 0 from 1978-01 to 1980. 
1998 Grven the Demoorabo direction of the 


Mecropertsanehep measure, 
inverted around ther overal means for penode of Republican control of the presidency. “*p < .001 


version argues that macropartisanship is a moving 
equilibrium, subject to the accumulation of past polit- 
ical and economic shocks. The key factors are still the 
familiar variables of presidential approval ratings and 
consumer sentiment, but the revised hypothesis states 
that changes in approval and sentiment, however slight 
in the short run, cumulate to generate substantial shifts 
in aggregate partisanship. At the same time, other 
possible influences are dismissed: “Changes in macro- 
partisanship from other sources that do not register in 
the approval and sentiment series (such as election 
campaigns and their aftermath) are mostly transient 
and of little long-term consequence” (Erikson, Mac- 
Kuen, and Stimson 1998, 901). Seen from this point of 
view, any realignment in the 1980s must have been the 
work of an unusually popular president and good 
economic times. Did short-term elements somehow 
fuse into a long-term. compound? 

Our first attempt to answer that question focuses on 
a realignment specification that chooses as onset the 
Reagan election in 1980. The variables Presidential 
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65.60% 
(0.46) 
0.84 
2.05 
16 
92 


for Mew York Times/CBS News The vanables 


macroperbeansiup. 
04 and 1984-04, reepectrvety, and T afterward through the fourth quarter of 


the observations for presidential approval and consumer sentiment were 


Approval and Consumer Confidence enter the macro- 

equation with a lag of one quarter so as to 
avoid problems of simultaneous causation.? The results 
in the first column of Table 3 provide strong evidence 
for a dynamic model of realignment. Sparked by the 
1980 election, the decline of (Democratic) macropar- 
tisanship proceeds at a rapid pace. The estimate for the 
initial effect '(—1.0) may seem puny,’ but with a dy- 
namic rate of 91%, the aftershocks are nearly as strong 
as the initial tremor. Quarter after quarter, they chip 
away the Democratic edge and boost the Republican 


2 Since the macropartisanship measure has a partisan direction 
(percentage Democrat), both presidential approval and consumer 
sentiment must be inverted when Republicans control the White 
House. To do so we centered each series around rts mean and 
multiplied the observations for cach of those variables by (—1) 
whenever a Republican was in the White House. 

3 This estimate should not be confused with the one for the 1980 
election in Table 2. That was an estimate of the difference m average 
macropartisanship before and efter the election, without considering 
a dynamic parameter. 


American Political Science Review 


FIGURE 2. The Trajectory of Reallgnment 
76 


Vol. 95, No. 4 


“ap sos aaa ese asada Aas ea EE 
Source. New York Timee/CBS News pols. 


Note: The obeerved senos represents quarterty macroperteanship 
parameters of the 1960 realignment (the initiel efect and dynamic rate of the “1960" variable of model 2 n Table 3, with all other Independent vanables 
heid constant). 


of the 


side. It is reassuring to note. that the prerealignment 
level of macropartisanship in the New York Times/CBS 
News series provides a constant (65.4%) that closely 
matches the long-run average of Democratic identifiers 
in NES surveys up to that time (63.7%). Given the 
virtually identical measurement instruments, that is 
how it should be. The baseline for testing the dynamic 
model of realignment with New York Times/CBS News 
surveys is not a deviant case that would produce an 
exaggerated shift in the subsequent period. 

Aside from the 1980 realignment, presidential ap- 
proval ratings, as the estimate in the first column of 
Table 3 indicates, also move macropartisanship in a 
highly significant fashion. Consumer confidence, how- 
ever, does not appear to have a direct effect. When 
jointly considered with presidential approval, this eco- 
nomic measure loses out as a predictor of macroparti- 
sanship. Still, given the fairly strong correlation (.45) 
between those two predictors, the economy has an 
indirect effect on macropartisanship, mediated by pres- 
idential approval. Economic assessments, not surpris- 
ingly, drive presidential approval ratings and through 
that vehicle manage to affect i ip. Such an 
indirect influence could be parceled out of the overall 
effect attributed to approval, but in the end that does 
not give us any better purchase on macropartisanship 
than does presidential approval alone. Therefore, the 
consumer confidence measure was dropped from sub- 
sequent specifications. 

To expand the event horizon of the key short-term 
variable, we added a dynamic factor for presidential 
approval ratings. According to the results presented in 


index (1976:01-1998 04) The predicted series 1s derived from the 





the second column of Table 3, the dynamic rate for 
approval proves strong and significant. In other words, 
the president’s party benefits from high ratings of his 
job performance, and the effect cumulates at an im- 
pressive rate. The question, however, is what that does 
to the realignment specification. The answer is that the 
1980 realignment model continues to perform power- 
fully even in the presence of the economic and political 
fundamentals. Given the estimate for the initial effect 
(—1.03) and a dynamic rate of 91%, the cumulative 
effect reaches 11.4 percentage points. That is the 
long-term change in macropartisanship triggered by 
the Reagan victory in 1980. This shift in the electoral 
equilibrium occurred independently of the on-going 
fall-out from presidential performance as recorded in 
job approval ratings. 

Before we accept the conclusion that the 1980 
election triggered a realignment, let us consider the 
alternative of a second shock in 1984, associated with 
Reagan’s fourth year in office, that culminated in his 
reelection. The results in the third column of Table 3 
fail to support the two-shock model. The initial effect 
for 1984 appears quite strong and has the right sign 
(—1.8), but it falls woefully short of significance, as 
does the dynamic rate. At the same time, the specifi- 
cation for the 1980 election survives unscathed. What 
took the wind out of the apparent effect for 1984? 

We believe the 1980 election set in motion a shift in 
the partisan equilibrium that was well under way by 
1984. Figure 2 maps the predicted trajectory of the 
shift, based on the parameter estimates for the 1980 
election in Table 3 (second column). According to that 
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trajectory, most of the realignment effect was concen- 
trated in Reagan’s first term. The pace of change slows 
markedly in his second term and flattens out after- 
ward.* What obscures this long-term shift in the ob- 
served series of macropartisanship, however, are short- 
term setbacks for the Reagan administration, 
especially the 1981~82 recession, which dropped the 
president’s approval ratings to a record low in 1982. 
We do not deny that presidential approval moves 
macropartisanship, but that is short-term change 
around a baseline. As Reagan’s ratings tumbled in his 
first term, so did his party’s standing, which led to an 
above-equilibrium rise (given the Democratic direc- 
tion) in the macropartisanship index. When Reagan’s 
popularity rebounded along with the economy in late 
1983 and throughout 1984, macropartisanship inched 
back toward the equilibrium, even overshooting the 
mark. 

By no means a trivial factor for partisanship, presi- 
dential approval ratings nevertheless do not possess the 
leverage, even considering their cumulative effect, to 
generate realignment-size shifts in macropartisanship. 
Given the parameter estimates for model 2 in Table 3 
(an initial effect of .06 and a dynamic rate of .61), a 
one-point change in presidential approval produces a 
cumulative shift of just 0.15% in macropartisanship. To 
engineer the realignment of the 1980s—a long-term 
shift of 11 points in macropartisanship—would require 
‘a boost of nearly 70 points in approval ratings above 
the average level, which is a statistical impossibility. In 
other words, the realignment of the 1980s was not the 
work of exceptional prosperity of a hugely popular 
president, at least not in the way those conditions, 
according to our estimates, normally affect partisan- 
ship. 

But realignment periods are not ordinary times. 
They are moments of qualitative not just quantitative 
change. The upheaval surrounding an electoral realign- 
ment may very well be associated with mutations in the 
way factors such as presidential popularity and the 
economy affect partisanship. Structural change in 
model parameters cannot be ruled out (Maddala and 
Kim 1998, chaps. 13 and 15). In view of the realignment 
that unfolded during Reagan’s tenure, one could sup- 
pose that his approval rating had a structurally different 
grip on partisanship than was true under his predeces- 
sors and successors. The same could be said for eco- 
nomic sentiment, but the failure of this variable to 
prove significance in the presence of approval pre- 
cludes its further use. 

A model that allows parameters to vary for the 


4 This reles on the New York Thmes/CBS News measure for 
frequent soundings of partisanship, but it should be noted that the 
Gallup measure also registers an enduring shift in the wake of the 
1980 election. An analysis of the Gallup series going back to 1953 
and leading up to 1998 confirms a realignment effect that is similar in 
nature to the one based on the New York Times/CBS News data for 
1976-98. The magnitude of the mmmediate effect is smaller (—.49) in 
the Gallup series, but the dynamic parameter (.94) is very similar, 
and the cumulative shift still reaches —8.3 percentage points. At the 
same time, the strong autocorrelation remaming in the Gallup series 
testifies to the keener sensitivity of that survey item to short-term 
disturbances. i 


Reagan years, during which most of the realigning shift 
accumulated, leads to the estimates shown in the 
fourth column of Table 3. To be sure, the initial effect 
estimate for Reagan’s approval (.12) is impressive, 
double the normal effect (.06). The dynamic rate looks 
formidable as well, but the negative sign hints at a 
puzzling trajectory of the cumulative effect. There is no 
point in pondering this puzzle, however, since the 
initial shock lacks significance. It also can be noted that 
the overall fit of the model including the Reagan 
interaction is no better than the model without. Which- 
ever way one looks at it, the results are disappointing 
for the structural-change hypothesis. Presidential ap- 
proval ratings did not move macropartisanship any 
more during the Reagan years than at other times 
between 1976 and 1998. The realignment in the 1980s 
was not the work of especially strong presidential 
approval. Of course, it did not hurt the GOP that, by 
1984, economic times were good and Reagan’s ratings 
were high, but those conditions only added a wiggle 
around a shifting baseline. The partisan equilibrium in 
the American electorate had begun moving before 
either prosperity or popularity took hold. 


THE NATURE OF PARTISAN EQUILIBRIUM 


The identification of a realignment may go a long way 
toward resolving a knotty puzzle about macropartisan- 
ship. On the one hand, it is undeniable that aggregate 
partisanship has not stood still during the half-century 
span of observation. It has moved, and not just back 
and forth around a stable baseline. On the other hand, 
the macropartisanship index has stayed within a narrow 
range of barely 20 percentage points. The latter points 
to the existence of equilibrium, but the former raises 
doubts about it. What kind of equilibrium, if any, does 
macropartisanship exhibit? 

The pioneers in this field make a strong case for the 
notion of a moving equilibrium (Erikson, MacKuen, 
and Stimson 1998; MacKuen, Erikson, and Stimson 
1989). The thrust of their argument is that there is no 
unique equilibrium to which partisanship will necessar- 
ily return after some departure due to external forces. 
Instead, the equilibrium is whatever is produced by the 
accumulation of political and economic conditions. In a 
sense, politics and economics constantly move macro- 
partisanship to a new equilibrium. 

In contrast, Green, Palmquist, and Schickler (1998) 
see great merit in the notion of a constant equilibrium. 
They make a compelling argument by asking hypothet- 
ically whether the fluctuations in macropartisanship 
would stay within such narrow boundaries without the 
gravitational pull of a stable force, which they believe is 
very doubtful. Indeed, the bounded nature of the 
macropartisanship movement is hard to reconcile with 
the notion of an ever-changing equilibrium. We agree 
that the familiar political and economic variables Jack 
sufficient leverage to shift equilibrium, even with cu- 
mulative effects taken into account. 

To be sure, it is not easy to identify a constant level 


for macropartisanship over a half century even though 
its movement appears to be bounded. This suggests an 
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intermediate type of equilibrium where shocks do not 
dissipate quickly, yet they do not remain at full strength 
forever (Box-Steffensmeier and Smith 1996, 1998). 
Such a dynamic fits uneasily into a zone between 
autoregressive adjustment and random drift. If macro- 
j ip possesses equilibrium, then it must be one 

that can tolerate prolonged departures or is of the 
moving variety. ; 

We take the position that macropartisanship exhibits 
a realignment-induced equilibrium. At any point there 
may be movement, but the level of macroparti i 
is fairly stable until disrupted by conditions powerful 
enough to warrant the label “realignment.” Then the 
baseline changes, and for a long time. The adjustment 
is gradual, which should not be confused with a moving 
equilibrium. The point of distinction is that in our 
model an equilibrium shift is rare, not a daily possibility 
depending on the flow of everyday politics. If the 1980 
realignment is seen as a “structural break” (Maddala 
and Kim 1998), we can demonstrate beyond a reason- 
able doubt that macropartisanship behaves as a station- 
ary process. Every version of the structural-break test, 
by a wide margin, supports that conclusion.> At the 
same time, these test results cast doubt on the latest 
technology employed by Erikson, MacKuen, and Stim- 
son (1998) in defense of their macropartisanship the- 
ory. The presence of a structural break, if not taken 
into account, will lead to erroneous evidence for a 
cointegrating relationship (Maddala and Kim 1998). 
Ultimately, it is a backhanded confirmation of the 1980 
realignment that it makes such a powerful difference: 
Ignore this intervention, and macropartisanship is a 
bundle of contradictions; take it into account, and we 
have a well-behaved specimen of equilibrium. Coming 
to grips with a realignment effect solves a major puzzle 
regarding the nature of aggregate partisanship. 


CONCLUSION 


Much of the movement in aggregate party identifica- 
tion (macropartisanship) in the American electorate 
during the last half century can be traced to a major 
realignment. Our analysis, based on both the NES and 
the New York Times/CBS News surveys, demonstrates a 
substantial and lasting shift of the partisan balance. 
The 1980 election sparked a growth in Republican 
identification that shrank the overwhelming Demo- 
cratic lead dating from the New Deal realignment. 
Although the GOP did not achieve outright majority 
status, the new balance meant that the two major 


parties would compete on nearly equal terms in na- 


3 We examined three versions of a modified unit-root test with the 
New York Tones/CBS News i series, assuming a 
structural break in 1980. Version A (crash), which allows for a 
one-time change of intercept, produces a test statistic of —5.26 (with 
a critical value of —3.77). Version B (changing growth), which allows 
for a change of slope, produces a test statistic of —5.81 (with a critical 
value of —3.80). Version C (both intercept and slope changes) has a 
test statistic of —5.23 (with a critical value of —3.99). Thus, the 
unit-root hypothesis is firmly rejected in all three instances. In 
comparison, the standard unit-root test without the specification of a 
structural break 1s a close call: a Dickey-Fuller test statistic of —2.79, 
with a critical value of —2.89. 
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tional politics, which paved the way for the Republican 
victory in the 1994 congressional elections. The success 
of the Democrats in the presidential contests of 1992 
and 1996 failed to reverse the Republican gains in 
partisanship, let alone restore the pre-1980 Democratic 
lead. 

The 1980 realignment fits the pattern of a dynamic 
model with a gradual-permanent set of parameters. A 
critical moment interrupts the partisan equilibrium and 
sets in motion an evolutionary change that eventually 
settles at a new steady state. This is no overnight 
phenomenon. The process triggered by the 1980 elec- 
tion took most of Reagan’s first term before yielding 
noticeable gains in Republican partisanship. However 
much this suggests that the realignment was perfor- 
mance driven, our analysis shows otherwise. The key 
factors—presidential approval ratings and consumer 
sentiment—may generate wiggles around the partisan 
baseline, but they are not capable of triggering an 
avalanche. Reagan’s popularity did not generate the 
1980s realignment. If popularity were the key to such 
shifts, the opportunities were far better under Eisen- 
hower and even Nixon (before his disgrace). What is 
more, given the relationship between approval ratings 
and partisanship, it would require impossible, not just 
improbable, surges in popularity to account for parti- 
san realignments. 

The key to the 1980 realignment lies in the ideolog- 
ical reorientation of national policy during the Reagan 
years. Swept into office by an economic crisis of 
uncommon severity, Reagan pursued a version of 
conservatism (Reaganomics) that departed sharply 
from New Deal liberalism. His party was able to exploit 
an ideological division in the electorate, in which 
conservatives outnumbered liberals. As in previous 
realignments, the agenda of the federal government 
underwent a change that to a large extent endured 
under control by the opposition party. It would be 
foolish, of course, to claim that the 1980 shift would 
have taken hold even if the economy had not recov- 
ered, just as the New Deal realignment might have 
been doomed had the Depression continued. Exactly 
how the Reagan agenda may have helped bring about 
the 1980 realignment is a question for future research. 
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nonvoting, has been increasing since 1972. During the 1960s the turnout rate trended downward both 
nationally and outside the South. Although the average turnout rates for presidential and congressional 
elections are lower since 1972 than during 1948-70, the only pattern since 1972 is an increased turnout rate 
in southern congressional elections. While the voting age was lowered to 18 in 1971, the lower turnout rate 
of young voters accounts for less than one-fourth of reduced voter participation. 


important, most familiar, most analyzed, and 

most conjectured trend in recent American po- 
litical history” (Rosenstone and Hansen 1993, 57). 
Researchers, theorists, pundits, politicians, and re- 
formers are searching for causes or cures. It is claimed 
that less expensive campaigns, loftier political rhetoric, 
weightier journalism, public financing, easier registra- 
tion, online voting, or more distinctive party platforms 
will bring voters back to the polls. 

Beginning with the work of Walter Dean Burnham 
(1965, 1982, 1985, 1987), an extensive literature seeks 
to explain the decline in turnout rates (e.g., Abramson 
and Aldrich 1982; Ansolabehere and Iyengar 1995; 
Cassel and Luskin 1988; Cavanagh 1981; Franklin and 
de Mino 1998; Miller and Shanks 1996; Putnam 1995, 
2000; Rosenstone and Hansen 1993; Shaffer 1981; 
Teixeira 1992). The paradox is that while certain 
factors favor higher turnout, notably increased educa- 
tion levels, the removal of structural impediments such 
as poll taxes and Jim Crow laws, and less restrictive 
voter registration, turnout rates still dropped (Brody 
1978). Echoing those who attempt to uncover biases in 
turnout (Leighly and Nagler 1992; Shields and Goidel 
1997; Wolfinger and Rosenstone 1980), Rosenstone 
and Hansen (1993, 248) warn “the more recent decline 
of citizen involvement in government has yielded a 
politically engaged class that is not only growing 
smaller and smaller but is also less and less represen- 
tative of the American polity.” 

Is electoral participation declining? Contrary to con- 
ventional wisdom, voters are not “disappearing” (Teix- 
eira 1992). We show that although the turnout rate 


T= decline in voter participation is “the most 
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outside the South is lower than in the 1950s and early 
1960s, there has been no downward trend during the 
last 30 years. The apparent decline since 1972 is an 
artifact of using the voting-age population (VAP) to 
calculate the turnout rate. As Bureau of the Census 
documentation clearly states, the VAP includes people 
who are ineligible to vote, such as noncitizens, felons, 
and the mentally incompetent, and fails to include 
those living overseas but otherwise eligible (Day 1998). 

It is widely acknowledged that the VAP is substan- 
tively different from the eligible population (Andrews 
1966; Bruce 1997; Burnham 1985; Gans 1997; Plissner 
and Mitofsky 1981; Wolfinger 1993; Wolfinger and 
Rosenstone 1980), but no one has collected the data 
necessary to estimate the turnout rate over time among 
eligible voters. We calculate an accurate estimate of 
the voting-eligible population (VEP) from the VAP 
and show that, since 1972, the ineligible population is 
growing faster than the eligible population, which gives 
rise to the perception that voter participation is de- 
creasing. 


CONSTRUCTING THE TURNOUT RATE 


The turnout rate equals the total number of votes cast 
divided by the eligible electorate. As straightforward as 
this calculation may seem, a variety of measures for the 
numerator and denominator are used around the world 
(Lijphart 1997). In the United States, researchers 
primarily rely upon Census statistics of the VAP for the 
denominator. This is the most readily available num- 
ber, but it does not constitute the eligible electorate by 
any but the most extreme definitions. More important, 
the errors that result are neither random nor constant 
over time. 

We construct the turnout rate for post-World War I 
elections, 1948 to 2000, by carefully distinguishing the 
VEP from the VAP. We provide the numbers used in 
our adjustments so that anyone can redefine the VEP 
and recalculate the turnout rate to test whether a 
change in definitions or estimation procedures alters 
our conclusions. 
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Numerator: Total Votes Cast 


Even a figure as apparently simple as the number of 
votes cast in an election must be constructed out of 
disparate data sources. Because the Constitution 
grants states the authority to regulate elections, there is 
no requirement of uniform reporting of comparable 
election data, and there is no national election com- 
mission to collect these data. We use information 
provided by the Congressional Research Service, which 
in turn contracts Election Data Services to collect 
voting statistics from each state (see note 8 below). 

The ideal numerator for the turnout rate is the total 
number of voters who cast any baliot for any office, but 
this measure is not available for all states. Only 17 
states reported total turnout in 1948, and 13 still do not 
report this measure. Using this number where available 
would bias historical as well as interstate comparisons. 
Historical studies avoid this problem by using a number 
reported by all states for all years, the vote for highest 
office. In presidential election years, this is simply the 
total number of persons who voted for the presidential 
candidates. In other election years, this is the largest 
number of people who cast a vote in a statewide race, 
usually either for governor or U.S. senator. If there is 
no statewide race, the vote cast in all U.S. House 
elections in a state is combined (Crocker 1997, 6). 

Using total turnout instead of the vote for highest 
office would, of course, increase the level of turnout. 
Our data and the historical analysis of Burnham (1985) 
suggest that total turnout is on average 23% greater 
than the vote for highest office in presidential elections 
and 2.6% greater for congressional elections. Never- 
theless, if we compare elections or states using the vote 
for highest office, we are not distorting any compari- 
sons between election years. Until total turnout is 
routinely reported by all states, researchers concerned 
with exact turnout figures, either for comparisons with 
other countries or because of the normative impor- 
tance attached to a turnout rate of more than 50%, 
should multiply (not add) the reported turnout rate by 
1.023 or 1.026. 


Denominator: Voting-Age Population versus 
Voting-Ellgible Population 


The turnout rate is highly sensitive to the specification 
of the total eligible population; seemingly insignificant 
changes in the denominator can reverse conclusions 
about the turnout rate. Nearly all reports are based on 
the VAP from the Bureau of the Census. Both the 
Congressional Research Service (Crocker 1996, 1997, 
1999) and the widely cited Center for the Study of the 
American Electorate (Gans 1997) rely on the P-25 
series of the bureau’s Current Population Reports, 
entitled “Projections of the Voting-Age Population for 
States,” for November of each election year. Although 
the VAP is commonly treated as the “true” error-free 
denominator, it is an estimate, albeit the best available, 
of the number of persons of voting age who reside in 
the 50 states. For non-Census years, the bureau esti- 
mates the WAP by adjusting the last full Census to 
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account for deaths, the number of persons who reach 
voting age, immigration, and the number of people 
entering and leaving overseas military service. As de- 
fined by the bureau in 1998, “the voting-age population 
includes all U.S. residents 18 years and over. This 
consists of both people who are eligible to vote and 
these not eligible to vote, such as non-citizens, con- 
victed felons, and prison inmates. These projections do 
not cover Americans living overseas who may vote” 
(Day 1998, 1). 


Eligibility. Who is an eligible voter? Who should be 
included in the denominator? Is an eligible voter a 
registered citizen, a citizen who could register, any 
citizen at all, or any person in the country who could be 
made eligible to vote? We do not believe that there is 
a good argument for including only the registered, but 
all other possibilities have proponents. 

Although registration figures are widely used for the 
denominator in Europe, few in the United States 
defend studies of turnout based on registered voters. 
Turnout based on registration is used in Europe be- 
cause registration is synonymous with eligibility: It is 
generally done by the government or required by law 
(Powell 1986, 21). There is widespread agreement that 
such a restricted definition of eligibility gives a mislead- 
ing picture of the turnout rate. If registered voters were 
to be used as the denominator in the United States, 
comparisons between elections and among states 
would be confusing, because registration laws vary 
substantially. Besides, it is virtually impossible to 
gather accurate registration figures due to outdated 
registration rolls. 

The inclusion of everyone of voting age in the 
denominator has proponents on both normative and 
practical grounds. Teixeira (1992, 6) argues for the 
VAP because each person in the country of voting age 
could be allowed to vote, should the already eligible so 
decide: “At the most basic level, the voting-age popu- 
lation is the eligible electorate. Although it is little 
known, citizenship is not a constitutional requirement 
for voting in the United States. Both the time it takes 
to become a citizen (national) and the actual restric- 
tions of suffrage to citizens (states) are matters of 
legislation.” Teixeira (1992, 6) also makes a pragmatic 
argument: Adjusting the WAP to remove ineligible 
voters is a “difficult and imprecise process.” Gans 
(1997, 46) defends the VAP on scholarly grounds, 
because “consistency and comparability are the only 
way that students and scholars of voting research can 
do comparative research.” 

Nevertheless, consistency and comparability do not 
force scholars to use the VAP. Burnham’s (1985, 1987) 
pioneering work, for example, is based on straightfor- 
ward methods for estimating the number of eligible 
voters in the country for every election. To be sure, 
collecting the data necessary for correcting the VAP is 
a difficult and onerous task, but there is little justifica- 
tion for making policy and normative claims on the 
basis of a statistical measure that we shall show is 
confounded in some surprising ways. 
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ing the VEP. The VAP both includes ineligi- 
ble and excludes eligible voters. It includes noncitizens, 
disenfranchised felons, mental incompetents, and peo- 
ple who do not meet residency requirements. It ex- 
cludes military personne! and civilians living outside 
the United States. Our more accurate VEP uses a 
variety of government statistical series to adjust the 
VAP. We remove noncitizens using estimates found in 
the Current Population Survey (CPS) and the full 
Census of Population. We remove persons who are 
ineligible due to criminality based on Department of 
Justice statistics on the correctional population. We 
add military and civilian personnel living overseas 
using statistics from the Bureau of the Census, Depart- 
ment of Defense, Office of Personnel Management, 
and the United States Consular Service. 

We do not make two adjustments. We do not 
remove the number of people ineligible due to state 
residency requirements because the CPS question on 
mobility does not employ detailed response categories, 
so it is not possible to determine how various state 
residency requirements affect ineligibility. We believe 
that this number has remained approximately 1% of 
the VAP. We also do not remove the number of 
mentally incompetent persons because we lack a reli- 
able source; we estimate that this number is approxi- 
mately one-tenth of 1% of the VAP, or approximately 
250,000 persons in 1995.1 

The statistics we use are not fully reported for every 

, and the various sources occasionally change their 
definitions. At times we must draw on different sources 
or impute missing data. The methods we employ are 
detailed in the Appendix. In every case we make a 
conservative adjustment to the VAP so that our cor- 
rections do not overstate the turnout rate. 


Further Correction: The Census Undercount. The VAP 
estimate does not correct for the undercount of the 
population in the Census. The undercount is the net 
product of two errors: Some people are counted more 
than once (overcoverage), and others are not counted 
(undercoverage) (GAO 1997).2 We do not correct for 
undercounting because we are not aware of a good way 
to determine how much this affects the VAP estimate 
generated between censuses. Correcting for undercov- 
erage would actually strengthen the case that there is 
no ongoing decline in voter participation, as a more 
accurate count reduces the turnout rate, ceteris pari- 
bus. Therefore, by not making this correction we can be 
confident that we do not overstate trends. 


1 Mentally incompetent persons can be found m high-level care 
nursing homes that address their special needs. Our estimate i$ 
drawn from the 1995 National Nursing Home Survey, the most 
recent of four conducted m 1974, 1977, 1985, and 1995 by the 
National Center for Health Statistics. The exact number of mentally 
incompetent residents of voting age is unknown, since there is no 
breakdown of residents by type or age. 
2 Overcoverage and undercoverage are estimated by the Bureau of 
the Census in a postenumeration survey. A sample of persons in 
representative areas are mterviewed following the Census to deter- 
mine whether they were recorded once, twice, or not at all The 
are extrapolated to the entire country to derive estimates 
of the net errors. 


The degree of undercoverage has declined since 
1940. According to the U.S. General Accounting Office 
(GAO 1997), the estimated net undercoverage was 
5.8% of the total population in 1940; 4.1% in 1950, 
3.1% in 1960, 2.7% in 1970, 1.2% in 1980, and 1.8% in 
1990. This decline would create the impression of 
declining turnout rates. For example, suppose that 
among a constant adult population of 100 million, 50 
million voted in both 1940 and 1990. The 5.8% under- 
count in the 1940 Census would result in a reported 
turnout rate of 50/04.2 or 52.2%. The same turnout in 
1990, with a smaller undercount of 1.8%, would result 
in a reported turnout rate of 50/98.2 or 50.9%. The 
increased accuracy of the more recent Census would 
result in a decline of 1.3% in the reported turnout rate. 


The VAP and VEP Turnout Rates 


In Table 1 we report the data necessary to construct the 
national turnout rate from 1948 to 2000 using the VAP 
and the VEP as the denominator. In addition, after 
1971 we report an estimate of persons age 18-21 and 
the number who voted in order to account for the 
effects of the 26th Amendment. Below, we replicate 
our analysis within and outside the South to control for 
the dramatic rise in participation in that region. 

In recent decades two major corrections to the VAP, 
noncitizens and ineligible felons, are segments that are 
increasing faster than the rate of population growth. 
The percentage of noncitizens among the voting-age 
population has risen steadily, from 2% in 1966 to 8.0% 
in 2000. As for ineligible felons, the historical average 
before 1982 was 0.5% of the voting-age population, 
and the figure rose to 1.4% in 2000. The number that 
needs to be added to the VAP—eligible voters living 
abroad—remained at nearly the same percentage 
throughout our analysis, about 15%, relative to the 
resident voting-age population; the percentage was 
higher during the Korean and Vietnam conflicts and at 
the peak of the Cold War in the 1980s. During the 
1990s the overseas percentage slightly decreased as the 
U.S. military presence declined more than the number 
of civilians living abroad increased. 

Figure 1 plots the VAP and VEP turnout rates for 
presidential elections since World War II. A glance at 
the VAP line shows why analysts who take the VAP 
turnout rate at face value are understandably worried 
about civic erosion, a possible dearth of social capital, 
and the decline of the public sphere in America. 
During the 1970s and 1980s there was a steady drop of 
nearly 10 percentage points from the high in 1960, and 
the lowest point in the postwar period—indeed, in the 
century—was reached in 1996. The post-1972 VEP 
turnout rate does not decline as much. Adjustments to 
the VAP do not simply move the VEP turnout rate 
upward a constant amount across time. The lines 
diverge after 1972, when the ineligible population 
began growing faster than the total population. 

Although an informal view of the 1972-2000 figures 
in the figure may suggest continuing decline, a prudent 
and statistically sound assessment is that there is no 
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TABLE 1. National Turnout Rates 
Vote for Vobtng Tumout Tumout Clttzans Voters Turnout 
Highest Age Rate Ineligible Overseas Rate Age Age Rate VEP 


Office Populabon VAP Noncitizens Adj. Felons Adj. VEP Ad. VEP 1820 18-20 Adj. Age 21+ 
Year (10008) (1000s) (%) (10008) (36) (10008) (%) (1000s) (%)  (%)  (1000a) (1000s) (%)  (%) 
1848 48839 95573 61.1 2198 +12 348 +02 440 -02 62.2 
1950 41984 88134 42.8 1880 +0.8 372 +02 391 -02 43.6 
1952 61552 $9920 61.6 1899 +12 379 +0.2 1131 -0.7 623 
1954 43854 102075 43.0 1839  +0.8 411 +02 9887 —0.4 43.5 
1958 62027 104515 59.3 t988 +11 428 +02 981 -06 602 
1958 47203 106447 44.3 2129 +09 464 402 91 -04 46.0 
1960 68838 109672 62.8 2193 +1.33 481 +03 912 -06 63.8 
1962 53141 112962 47.0 2259 +1.0 401 +02 1113 -06 47.7 
1984 70645 114090 61.9 2282 +13 478 +0.3 1212 -0.7 62.8 
1966 58188 118638 48.2 2363  +1.0 448 +02 1621 -0.7 487 
1968 73213 120285 60.8 27868  +1.4 421 +02 1856 -09 61.5 
1870 58014 124498 46.6 3148 +12 443 +02 1765 -0.7 47.3 
1972 77719 140777 55.2 3840 +15 443 +02 1581 -0.6 562 10726 4819 +1.0 572 
1974 56944 146338 38.2 4148 +1.1 496 +0.1 1510 -04 39.1 11288 2126 +1.77 40.8 
1978 81556 152308 53.5 45568 +1.7 588 +0.2 1582 -0.5 548 11706 4322 +16 58.4 
1978 58918 155809 37.8 5760 + 1.5 629 +02 1753 -04 39.0 11870 2062 +1.7 407 
1880 88515 163945 62.8 86827 +23 803 +03 1803 -06 547 11538 4068 +15 582 
1982 67616 166724 40.6 10554 +42.7 832 +02 18682 -0.5 43.0 10873 2028 +18 448 
1984 92653 173995 53.3 13252 +44 1153 +04 2361 -0.7 572 10802 3799 +14 5886 
1886 64981 177922 36.5 12223 +2.77 1308 +03 2218 -04 39.0 9839 1625 +14 40.4 
1988 91685 181956 50.3 13942 +42 1633 +404 2527 -0.7 542 9732 3206 +1.3 56.5 
1990 67859 185888 36.5 18297 +35 1845 +04 2669 -05 39.8 9644 1629 +14 412 
1992 104405 189687 55.0 17826 +56.7 2117 +06 2418 -07 60.6 8546 3445 +1.1 61.7 
1984 76106 193163 38.9 13205 42.9 2385 +05 2229 -04 418 9464 1502 +14 432 
1996 96263 196928 48.9 13948 +3.7 2545 +06 2499 -0.6 526 10021 3081 +1.3 539 
1998 72637 200928 36.1 15070 +29 2822 +05 2837 -05 39.0 10416 1378 +15 40.5 
2000 106326 205813 51.2 16500 +46 2851 +07 3008 -0.7 656 10786 3208 +1.6 657.1 


Sources Vote for Highest Office Congressional Research Service "Voter Regustrabon and Turnout 1948-1694" and memorandum, "Voter Ragstraton 
and Tumout 1996." For 1998 provided by Royos Crooker at Congresssonal Research Service; 2000 provided by ABC News Polling Unit, compded by 
Assoomted Prees. Votng-Age Populabon: Bureau of the Census Current Popuitbon Report P25-1132, “Prosectons of the Voting-Age Population for 
States: November 1998" and previous meues of same series, 2000 projection from Census Bureau, http /Anww.cersus.gov/populatonserenw, 
voting-htm!, accessed 25 May 2001. Noncitizens Current Populaton Survey, Voter Supplement File (vanous years), Bureau of the Ceneus pubhcation 
senes P-20, “Voting and Regrstrabon m the Electon of fvanous years? and Census of the Population (vanous years). Ineligible Felons: Correctonel 
Populetone of the United States (various years, as of December 31), Department of Jushce reports (Back 2000, 2001; U S. Department of Justice, 
Department of Justice Statistica 2000), and ICPSR Study #6912 “Histoncal Stabstics on Prisoners In State and Federal Inetrtutions, Yearand 1925-1988 " 
Overseas VEP. Statstcal Abstract of the Unrted States (vanous yeara), unpublished reports from United States Consutar Services (vanous years), Miltary 
Personnel Hetoncal Reports (vanous yeara), and Office of Personnel Management Manpower Reports (vanous years). Persona Age 18-20 and tumout 
estimates: Currant Popuahon Survey Voter Supplement Fie (vanous years). 

Note: VAP æ the voting-age population, a Bureau of the Census estimate of people of voting age Inving within the Unried States. VEP is the 

Population, our estimate of people eligible to vote In U.S. elecbone. Numbers are subject to rounding Data Include Washington, DC, m 1964, 1968, and 
1972-2000 ae well as Alaska and Hawar as of 1960. Exoluded s Lousiana 1978 and 1962 (no statewide November election} 





post-1971 trend in the presidential turnout rate among 
those eligible to vote. Regressing the 1972-2000 VEP 
presidential turnout rate on a linear trend variable 
produces estimates of a miniscule decline of 0.007 
percentage points per election, or a total drop of 0.05 
percentage points for the period. Yet, the standard 
error is so large (0.40) relative to the estimated trend 
that we cannot reject the null hypothesis of no trend in 
the presidential VEP turnout rate from 1972 to 2000. 

Recent rates are inconsistent and appear susceptible 
to short-term forces. The VEP in 1992 is no longer a 
minor deviation in a period of continual decline but is 
rather within the range of turnout rates in the 1950s 
and 1960s. Whatever else one may conclude about 
media, parties, campaigns, and civil society in the 
current era, turnout rates today can still attain their 
former level. 
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In 1996 the VAP rate reached its low since World War 
IL and the VEP rate dropped as well, to 52.7%, but that 
is slightly higher than the low of 52.2% in 1948. This is a 
small difference, but when the two VEP calculations are 
further corrected for the Census undercount, we find a 
difference of 2.5 percentage points. 

The low voter participation in 1948 is not given the 
attention it merits since the 1952 election is the starting 
point for most scholarly analyses. Although the decline 
from the 1950s and early 1960s buttresses the case for 
reform, the 1948 figure is not merely an aberration. 
Burnham (1987) adjusted turnout rates for the pres- 
ence of noncitizens among the VAP before 1948, and 
his results show that voter participation from 1920 





3 After correcting for the undercount the 1948 turnout rate is 49.3% 
(52.2%/105.8) and the 1996 rate 1s 51.8% (52.796/101.8), a difference 
of 25 percentage points. 
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National VAP and VEP Presidential Turnout Rates, 1948-2000 
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through 1948 is strikingly similar to the turnout rate 
after 1972. The highs in the 1950s and 1960s are 
actually quite unusual for the period after the weaken- 
ing of political machines (Burnham 1987) and perhaps 
were more momentary than usually supposed. 


The Twenty-Sixth Amendment. The drop in turnout 
between 1968 and 1972 is usually attributed to expan- 
sion of the franchise from age 21 to age 18 (Rosenstone 
and Hansen 1993, 57).4 This is a plausible assumption, 
since turnout rates for younger persons are lower than 
for older persons. Table 1 shows the turnout rate 
excluding persons under age 21. The numerator is 
derived by using the CPS figures to calculate the 
proportion of all votes reported by persons age 21 and 
older, and then multiplying that proportion by the total 
number of votes cast for highest office; the denomina- 
tor is derived by removing the number of citizens age 
18-20 from the VEP (see the Appendix). The average 
effect of removing this group since 1971 is an increase 
of 1.3 percentage points in presidential elections and 
1.7 percentage points in congressional elections. That 
is slightly more than one-fourth the decrease of 4.7 
percentage points for the VEP turnout rate between 
1948—68 and 1972-2000. Because a large drop in voter 
participation occurred between 1968 and 1972, it is 
assumed that the new voters were a significant factor in 
the decline. Yet, the downward trend of the 1960s 
carries through to the 1972 election even though 
people age 18-20 are not included prior to 1972. And 
the turnout rate of voters under age 21 was 49.2% in 


4 Before 1971, four states allowed persons under age 21 to vote: 
Georgia since 1944 (18+), Kentucky since 1956 (19+), Alaska since 
1960 (19+), and Hawaii since 1960 (20+) (GAO 1997). 





1972, according to the CPS, so their presence in that 
election only lowered the rate by a single percentage 
point. 

Southern and Nonsouthern Turnout Rates. We need to 
account for the dramatic rise in southern turnout rates 
during the 1960s to make comparisons of the aggregate 
turnout rate over time. The civil rights movement led 
to the Voting Rights Act, which effectively enfran- 
chised blacks and poor whites in the South. Accord- 
ingly, voter participation rose dramatically in that 
region during the 1960s (Kousser 1999). 

The increase in southern turnout masks some of the 
decline in the rest of the nation. The corrections 
performed in Table 1 are repeated in tables 2 and 3 for 
nonsouthern and southern states, respectively. Figure 2 
plots the national, southern, and nonsouthern age 21+ 
VEP turnout rate for presidential elections since 1948, 
thereby controlling for the effects of the extended 
franchise and the elimination of Jim Crow laws. Turn- 
out rates in the South rose precipitously as the elector- 
ate mobilized, but elsewhere the electorate contracted 
(DeNardo 1998).5 

Nationally and regionally, using either VAP or VEP 
in the denominator of the turnout rate, there are two 
distinct eras of post-World War II turnout divided by 
1971. The national VAP presidential turnout rate is an 


ieo 
3 Southern states are Alabama, Arkansas, Florida, Georgia, Louisi- 
ana, Mississippi, North Carolina, South Carolina, Texas, and Vir- 
ginia. The contribution of the southern turnout rate to the national 
figure increased over the last half of the twentieth century. In 1960, 

tely one in four eligible voters resided in the South. As 
in-migration to that region increased, the number rose to almost one 
out of three in 1996. Thus, decline in nonsouthern voter participation 
is offset by the shift m distribution of eligible voters across regions. 
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TABLE 2. Nonsouthern Tumout Rates 





Vote for Votng Tumout 


Highest Age Rate Ineligible 
Offce Populaton VAP Nonctizens Adj. Felons 





Year (1000s) (1000s) (3) (1000s) (36) (1000s) 
1848 44129 76182 57.9 1752 +1.4 131 
1950 399393 77254 51.0 1468 +1.0 134 
1852 53890 77710 69.3 1476 +13 151 
1954 40386 79734 50.7 1515 +1.0 159 
1956 54343 81823 66.4 1555 +13 178 
1858 43750 82649 52.9 1853 +1.1 184 
1960 59618 86289 69.1 1726 +1.4 190 
1962 47431 88711 53.5 1774 +11 178 
1964 59476 89221 86.7 1784 +41.4 169 
1968 47803 91018 52.5 1820  +1.1 147 
1968 59658 93594 63.7 2153 +15 180 
1970 48488 96520 50.2 2413 +1.33 212 
1972 63485 108494 58.5 3038 +-1.7 242 
1974 47171 111878 42.2 3244 +13 293 
1976 64689 115823 55.9 3475 +17 326 
1978 48490 119832 40.5 4673 +1.86 361 
1980 67453 123573 54.6 5314 +25 473 
1982 54595 128707 43.1 8869 +32 517 
1984 71034 129436 54.9 10484 +4.88 632 
1988 49490 131904 37.5 9636 8 +3.0 706 
1988 69977 134951 519 11066  +4.6 847 
1990 51326 138703 37.5 12850 +3.9 1020 
1982 78842 138934 56.7 14171 +6.4 1187 
1994 57023 141438 40.3 10042 +31 1307 
1996 71871 142889 50.3 10161 +3.9 1419 
1998 655123 145081 38.0 11461 +3.3 1567 


average 7.1 percentage points lower during 1972-2000 
than during 1948-68. The VEP turnout rate is also on 
average lower, but by 4.7 percentage points. The gains 
in southern presidential turnout rates (an increase on 
average of 8.9 for VAP and 9.9 for VEP) are offset by 
losses outside the South (a decrease on average of 11.1 
for VAP and 8.6 for VEP). 

As shown in Table 2, the nonsouthern VAP turnout 
rate exhibits continued decline of greater magnitude 
than the national rate (Table 1), although both pat- 
terns are interrupted by the high turnout in the 1992 
election. The most noticeable disjunctures between 
the VAP and VEP nonsouthern turnout rates begin in 
1972 (Table 2). For nonsouthern elections we estimate 
a small downward trend in VEP of 0.22 percentage 
points per election, with a standard error of 0.44, which 
implies no statistically significant trend. The southern 
presidential VEP turnout rate (Table 3) trends upward 
an estimated 0.87 percentage points per election, with 
a standard error of 0.37, which gives some confidence 
(90%) in the trend.7, 





é For nonsouthern states, we estimate a downward trend of 0.79 
percentage point per election, with a standard error of 0.31. 

7 We estimate an upward trend for the southern VAP presidential 
turnout rate of 0.38 percentage pomt per election, but with a 


relatively large standard error of 0.30. 
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2000 77995 148696 62.5 12289 +4.7 1620 +05 2174 -0.8 669 7703 2322 +16 58.5 
Note. Nonsouthemn ts the District of Columbia pius ail states except those lsted in Table 3 as “southern.” 
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Tumout Cttzens Voters Rate 
Overseas Rate Age Age VEP 
Ad. VEP Ad. VEP 1820 1820 Ad} Age 21+ 
(3%) (10008) (%)  (%) (10008) (10008) (%)  (%) 
+0.1 356 -0.3 69.1 
+0.1 313 -02 519 
+0.1 899 -0.8 70.0 
+0.1 785 -05 512 
+0.1 781 -06 672 
+0.1 751 -0.5 53.6 
+02 717 -06 701 
+0.1 874 -0.5 64.1 
+0.1 948 -0.7 67.4 
+0.1 1265 -0.7 529 
+0.1 1444 -1.0 64.4 
+0.1 1368 -0.7 509 
+04 1219 -06 696 98621 4000 +12 608 
+01 1155 -04 43.1 9080 1840 +2.0 45.1 
+02 1188 -0.6 57.1 9284 3429 418 589 
+0.1 1326 -04 41.8 9408 1746 +21 438 
+02 1354 -0.6 566 9659 3238 +20 597 
+02 1478 -0.5 46.0 8657 1638 +21 480 
+0.3 1752 -0.7 692 7931 2012 41.6 607 
+02 1638 -05 40.1 7899 1237 +1.5 41.6 
+03 1868 -0.7 56.0 7372 2449 41.4 575 
+0.3 1857 -05 41.1 7117 1180 +15 428 
+05 1772 -07 62.9 6268 2802 +1.1 64.0 
+04 1628 -05 43.3 6847 1140 +15 448 
+0.5 1784 -0.6 64.0 7055 2223 41.4 553 
+04 2121 -0.5 41.1 7366 1047 +16 426 








CONCLUSION 


We attribute the apparent decline in turnout rate since 
1972 to an increasing number of ineligible persons 
being counted among the VAP, which is the denomi- 
nator for the calculations by the Bureau of the Census. 
We disagree with Abramson, Aldrich, and Rohde 
(1998, 68), who contend that correcting the denomina- 
tor leads only to “relatively small differences in the 
overall estimate of turnout.” Correcting the data 
changes the level of the turnout rate sufficiently to 
come to some different conclusions about the trends. 
Contrary to Teixeira (1992, 25), we agree with Wolfin- 
ger (1993, 7) that use of the VAP leads to errors that 
are unevenly distributed geographically and chronolog- 
ically and produces “quite misleading distortions.” 
The great divide in the turnout rate is the 1972 
election. Based on the VEP, with or without adjust- 
ments for the inclusion of younger voters, nationally 
and outside the South there are virtually no identifiable 
turnout trends from 1972 onward, and within the 
South there is a clear trend of increasing turnout rates. 
Our analysis points to a surge in nonsouthern voting 
in the 1950s, followed by a decline during the 1960s. 
Since then, turnout is lower, but there is no “down- 
ward trend.” Absent further decline, divided govern- 
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TABLE 3. Southern Turnout Rates 


Taos. wee a IM 
Vote for Voting- Tumout Tumout Citizens Voters Rate 
Highest Age Rate Ineligible Overseas Rate Age Age VEP 
Office Population VAP Noncltizens Adj. Felons Ad. VEP Ad), VEP 1820 1820 Adj. Age 21+ 

Year (10008) (1000e)  (%) (10008) (96) (1000e) (%) (1000s) (%)  (%) _ (1000s) (10008) (%) (%) 
















1948 4132 17888 23.1 411 +05 132 +02 84  —0.1 23.7 
1950 2590 1%17 13.3 369 +0.3 137 +0.1 79 —0.1 13.6 
1952 7681 20016 38.3 380 +0.7 155 +03 232 -04 389 
1954 3469 20489 16.9 389 +0.3 163 +0.1 202 —0.2 172 
1956 7684 20919 36.7 397 +0.7 184 +03 200 -0.3 37.4 
1968 3453 21934 15.7 439 +0.3 191 +0.11 1989 -0.1 18.1 
1960 9220 23383 39.4 468 +0.8 19 +03 184 -03 402 
1962 6710 24238 23.6 485 +0.5 182 +02 239 -0.2 24.0 
1964 11168 24869 44.9 497 +0.9 171 +03 264 -05 456 
1966 8385 25620 32.7 512 +0.7 147 +02 356 -0.4 33.1 
1968 13565 26734 50.7 615 +12 181 +03 412 -0.8 651.4 
1970 9626 27980 34.0 700 +0.9 215 +03 397 -05 347 i 
1972 14234 32282 44.1 813 +0.9 271 +0.4 383 -05 448 2668 797 +11.4 462 
1974 8772 34458 25.5 896 +0.7 329 +02 356 -0.3 26.1 2822 307 +14 275 
1976 18888 36486 48.2 1095 +1.4 3864 +05 374 -05 47.6 3010 877 +1.8 48.4 
1978 10427 35781 29.1 1048 +0.9 390 +03 427 -03 30.0 2927 334 +12 317 


1980 19062 41024 46.5 1518 +1.8 329 +04 449 -05 48.1 3000 858 +16 48.7 
1982 13020 _ 40312 32.3 1896 +1.6 4146 +03 604 -0.3 338 2714 352 +1.7 36.4 
1984 21618 45030 48.0 2792 +32 621 +06 609 -0.6 51.1 2872 686 +15 52.6 
1986 15501 46571 33.3 2855 +2.0 602 +04 578 -0.4 353 2764 403 +14 36.7 
1888 21617 47827 452 2870 +29 688 +0.7 681 —0.6 48.1 2694 757 +13 49.4 
1990 16533 49109 33.7 3438 +25 828 +06 703 -0.5 363 2821 446 +14 377 
1992 25663 50595 50.5 3693 +4.0 930 +0.9 645 -06 648 2471 844 +12 56.0 
1994 18082 62212 34.6 3237 +2.3 1058 +07 601 —0.4 37.3 2760 362 +15 38.7 
1996 24382 63608 45.5 3108 +2.8 1126 +1.0 715 -0.6 48.7 2645 732 +12 499 
1998 17414 55848 312 3618 +21 1255 +07 816 -04 33.6 3054 279 +15 36.1 
2000 27331 57118 47.9 4209 +3.8 1330 +1.11 834 -0.7 521 3083 883 +15 53.6 


Note Southern states are Alabama, Arkansas, Fionde, Georgia, Louixana, Mesaisespp,, North Caroline, South Carohna, Texas, and Virginia 
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ment (Franklin and de Mino 1998), negative campaign- 

ing (Ansolabehere and Iyengar 1995), and declining 
civic engagement (Putnam 2000) have not depressed 
turnout. Whether due to the Cold War, a popular 
military hero as president, the emergence of national 
television news media not yet cynical and distrustful of 
power, or other factors, it was the 1950s that were 
unusual in being the high point of twentieth-century 
voter participation outside the Jim Crow South. Old 
theories will have to be transformed or new theories 
will have to be developed to explain the decline i in the 
turnout rate during the 1960s. 

An explanation for lower turnout in America than in 
most other industrial democracies, we suggest, must 
begin with the institutional structure of the political 
system, not the psychology of the voters or the tactics 
of the parties and candidates. Powell (1986) notes that 
in contrast to parliamentary systems, which foster 
strong national parties and clear lines of responsibility 
for government performance, responsibility is divided 
between state and national governments in the United 
States, and between two legislatures and an executive 
at each level. Federalism and the separation of powers 
increase the costs to voters to gather and process the 
information about which vote, for which candidate, for 
which office, on which date, matters for a given issue, 
and registration is neither done by the government nor 
compulsory. Furthermore, the frequent primary and 
general elections required to fill the many elected 
offices increase the burden of democracy for the voter. 
Indeed, the other two industrialized democracies with 
chronically low turnout rates are Switzerland and Ja- 
pan, countries with diffused lines of authority and 
responsibility. 


APPENDIX 


The data and methods we used to construct our measure of 
the turnout rate among eligible voters are described below. 
Our sources do not report all the data we needed to construct 
fully each component measure for every year. When possible, 
we developed procedures for imputing the missing data, as 
detailed below. 

When we analyze the effect of the 26th Amendment in 
1971, we further adjust the VAP using estimates of the age 
distribution of the total population from the P-25 Current 
Population Reports. We use the Current Population Survey 
Voter Supplement File to determine the proportion of 
persons age 18-20 among eligible voters and remove this 
proportion from the vote for highest office. 


Turmout 


The numerator of the turnout rate is the number of persons 
who vote in a given election. Our national and state data are 
drawn from a 1996 Congressional Research Service (CRS) 
report on turnout in the 1948-94 elections, a 1997 memo- 
randum on 1996, and a 1999 memorandum on the 1998 
election (Crocker 1996, 1997).8 As described in the text, two 





8 Royce Crocker of the CRS provided the 1998 numbers to us as an 
Excel spreadsheet. Since 1986 the CRS has contracted with an 
outside vendor, Election Data Systems, to collect turnout and 
registration information from the states. Election Data Systems, m 
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numbers found in these reports are commonly used as the 
numerator m studies of turnout, vote for highest office and 
total vote. We use the vote for highest office in our analysis. 


Voting-Age Population 


The base number for the denominator of the turnout rate is 
the national and state VAP estimates drawn from the P-25 
series Current Population Reports. The Bureau of the 
Census compiles these adjustments to the Census from 
various sources that report preliminary numbers. The bureau 
then releases final estimates. The 1994, 1996, 1998, and 2000 
VAP estimates we use are preliminary figures still subject to 
minor changes. 


Noncttizens 


We use two methods to estimate the number of noncitizens 
among the VAP. From 1948 through 1966, we use a tech- 
nique proposed by Burnham (1985) in his study of turnout 
rates in the nineteenth century to interpolate the number of 
noncitizens for inter-Census years. We interpolate the per- 
centage of noncitizens reported in the 1940 and 1950 Census 
and between the 1950 Census and the 1966 Current Popula- 
tion Survey (the 1960 Census did not include a citizenship 
question). Between 1950 and 1966 the number of noncitizens 
in the VAP was virtually unchanged, rising from 1.9% in 1950 
to 2% in 1966. We are confident that this simple procedure 
does not miss an intervening wave of immigration, since legal 
inflows reported by the U.S. Immigration and Naturalization 
Service (1997) were small and stable from 1950 to 1966. 

Beginning in 1966, we estimate the number of noncitizens 
in the VAP from the Current Population Survey Voter 
Supplement Files.!° The CPS allows us to avoid the problems 
of interpolation during the wave of immigration that began in 
the 19703 and peaked in 1991.11 Yet, use of the CPS 
introduces the sampling and measurement errors associated 
with surveys. In particular, there is a reported decline of 4.6 
million noncitizens between the 1992 estimate of 17.8 million 
and the 1994 estimate of 13.2 million that is puzzling. We 
believe the change in CPS methodologies is primarily respon- 
sible for the difference. 

The CPS has used three different sets of citizenship 
questions. From 1966 to 1976, respondents were not directly 
asked if they were citizens but whether they were registered 
to vote. If not, they were asked why, and “pot a citizen” was 
one of the options. From 1978 to 1992, respondents were 
queried directly about their citizenship status, and their “yes” 
or “no” answers were recorded. Between 1976 and 1978 the 
CPS revealed an increase in the number of noncitizens, from 


consultation with the CRS, compiled data from the CRS records, 
Amenca Votes, and mformation supplied by Curts Gans to construct 
turnout figures dating back to 1948 (personal correspondence with 
Royce Crocker, April 22, 1999). 

°» There is a slight discrepancy between the modem published reports 
of the 1948 presidential vote totals and hxstorcal reports, such as in 
P25-185. We use the slightly larger P25-185 number. 

10 The first CPS Voter Supplement File questionnaire, in 1964, did 
not include a citizenship question. 

u Tn 1991, the U.S. Immigration and Naturalization Service (1997) 
reports 1.8 million legal immigrants entered the country, the highest 
number in Amencan history. Immigration has since declined slightly 
but still remains at historical 

12 There is a spike m noncitrzena between 1980 and 1982. Some of 
this is due to a flood of immigrants from Cuba who participated in 
the Mariel boatlift, as well as to new asylum laws that were 
introduced im late 1980. 


American Political Science Review 


4.5 million to 5.8 million. This is probably a true increase and 
not an artifact of changing survey methods. Although the 
increase is slightly larger than the 1970s trend, the U.S. 
Immigration and Naturalization Service (1997) reports a rise 
in legal immigration in 1978. 

In 1994, the CPS citizenship item was changed again to 
determine possible types of status. Respondents were given 
five options: “Native, Born in the United States”; “Native, 
Born in Puerto Rico or U.S. Outlying Area”; “Native, Born 
Abroad of American Parent or Parents”; “Foreign Born, U.S. 
Citizen By Naturalization”; and “Foreign Born, Not a Citizen 
of the United States.” Moreover, there were significant 
methodological changes that may affect the CPS estimates. 
The 1994 CPS was the first in our analysis to use the 1990 
Census as a baseline to construct weights and the first to use 
computer-aided interviews. 

Ig it possible that the number of noncitizens declined 
between 1992 and 1994? We think not. In 1991 and 1992, 
there were 3.4 million legal immigrants who entered the 
country and 550,000 naturalizations. In 1993 and 1994, there 
were 1.9 million legal immigrants and 750,000 naturalizations 
(U.S. Immigration and Naturalization Service 1997). Since 
the net changes in legal immigration and naturalization 
cannot account for the decline, could a decrease this large 
occur solely due to large changes in the rates of out-migration 
and illegal immigration? There was heightened emphasis on 
controlling illegal entry under the’ Clinton administration, as 
well as more anti-immigration legislation. There could have 
been a true decline in ilegal entry or greater reluctance of 
noncitizens to acknowledge their status, but we cannot say 
whether the 1994 number is more or less accurate than 
previous numbers. We do know, however, that legal immi- 
gration statistics clearly reveal a rise in the number of legal 
entrants that began in the 1970s, peaked in 1991, and 
declined slightly thereafter, consistent with the overall CPS 
trend. The numbers are consistent with two possibilities: less 
overreporting of citizenship in 1994 than in 1992, or more 

in 1994 than in 1992. Thus, while our calcu- 
lation of the VEP for any one election is susceptible to errors 
in the CPS survey methodology, we are confident that the 
overall character of the trends in noncitizens we discuss is 
correct. 


Ineligible Felons 


Depending upon state law, felons may not vote if in prison, 
on probation, or on parole, and they may even be perma- 
nently disfranchised.4 Four states, Maine, Massachusetts, 
Utah, and Vermont (now three, following the 2000 adoption 
of a restrictive constitutional amendment in Massachusetts), 
did not disfranchise even prisoners during the period of 
study. The first three categories of disfranchised felons (in 
prison, on parole, or on probation) are compiled for 1986-96 
from the U.S. Department of Justice, Bureau of Justice 
Statistics, annual Correctional P. 7 of the United 
States 15 Nearly all prisoners and parolees and half of all 
probationers have been convicted of a felony (U.S. Depart- 


Boe 

14 See the Department of Justice report, “Civil Disabilities of Con- 
victed Felons: A State-by-State Survey” (Love and Kuzma 1996). 
15’ Footnotes in Correctional Populations detail that state agencies 
report their information with varying levels of accuracy, particularly 
the number of felons on parole and probation. States that do not 
report these data tend to be the same states that grant felons on 
probation or parole the right to vote. Additional error arises from the 
practice by some states of combining their prison and jail populations 
into one reported number. These five states (Alaska, Connecticut, 
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ment of Justice, Bureau of Justice Statistics 2000, 2). We 
assume that all prisoners, all parolees, and half of all 
tioners are felons. For pre-1986 data, we used ICSPR Study 
#8912, “Historical Statistics on Prisoners in State and Fed- 
eral Institutions, Year End 1925-1986” (U.S. Department of 
Justice, Bureau of Justice Statistics, 1997). Some felony 
prisoners are noncitizens, although the exact number over 
time is unknown, so an adjustment is not made.!7 

We do not have a source for the number of felons on 
probation or parole before 1986, so we had to estimate this 
number. We calculate national and regional estimates for the 
number of ineligible felons using the ratio between the 
number of prisoners and the number on probation and parole 
for 1986 to 1996, for which all data are available. With the 
year as the unit of analysis, from 1986 to 1996 a regression 
estimates that nationally there were 1.50 ineligible felons on 
parole or probation for every ineligible felon in prison. The 
southern states had 2.18 ineligible probationers and parolees 
for every ineligible prisoner, and nonsouther states had 
1.13.18 

Before 1972, we make one further adjustment to the 
estimate of the number of ineligible felons. We must calcu- 
late the percentage of felons age 18-20; otherwise, when we 
subtract felons from the WEP, we will overcorrect the de- 
nominator and overstate the true rate of turnout. We must 
estimate the unreported age distribution of felons. We begin 
by assuming that felons on probation and parole are older 
than 20. We then assume that 18-to-20-year-olds are 15% of 
the prison population. 

It is more difficult to determine the numbers of felons 
permanently disenfranchised despite having completed pro- 
bation or parole, and we have not included this group in our 
measure of ineligible felons. 

The Sentencing Project and Human Rights Watch (Fellner 
and Mauer 1998) conducted an exhaustive study of state data 
and estimated that in 1996 there were approximately 1.39 
million permanently disfranchised felons. They compiled the 
number of felons released since 1970 within states that have 
such laws and adjusted for recidivism. The study does not 
account for felons who vote despite being ineligible, moved 
or died. Our measure of ineligible felons is most certainty 
an underestimate that probably is correlated with the actual 
number of disfranchised felons, since a felon must first go 
through the correctional system before becoming perma- 
nently disfranchised. If these 1996 reports are accurate, the 
permanently disfranchised number slightly more than one 
half of the 2.85 million disfranchised felons in prison, on 
probation, or on parole. 


Eligibie Overseas Voters 


Eligible voters living overseas are comprised of military 
personne! and their dependents, nonmilitary government 


Delaware, Hawaii, and Rhode Island), plus the District of Columbia, 
tend to have small prison 

16 No breakdown of federal prisoners among the states m available. 
We divide federal prisoners among states by assuming that states 
have the same share of federal prisoners that they have of state 


prisoners. 
17 In 1994, about 30,000, or 12%, of the felony prisoners wore 
identified as noncitizens (U.S. Department of Justice, Bureau of 
Justice Statistics, 1996). 

18 The regression estimate of this ratio is very consistent, with a 
standard error of lees than .01. Unless major changes ın incarceration 
patterns occurred in the past, we believe this imputation method is 
sound. 

19 A small number of permanently disfranchised felons have been 
granted the right to vote through pardons. 
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personnel, and nongovernment civilians. Our primary 
sources for these figures are the Statistical Abstract of the 
United States and the Census. Because the Statistical Abstract 
does not regularly report the total overseas population, let 
alone its age distribution, we must turn to other sources and 
make a number of assumptions. We supplement the Statisti- 
cal Abstract and the Census with Department of Defense 
recorda, various Office of Personnel Management reports, 
and estimates provided by the U.S. Consular Services of the 
total number of civilians abroad (which includes those work- 
ing for private companies). 

The estimation procedures differ for the periods 1948—66, 
1968-82, 1984-92, and 1994-96 because of changes in 
reporting. For some time periods there are separate numbers 
for military dependents, nonmilitary government personnel, 
and “other civilians,” while for other years all nonmilitary 
categories are combined. For all years, we have accurate 
numbers on overseas military personnel, available on-line 
from the Department of Defense web site.® The post- 
Vietnam high was 609,000 overseas personnel in 1988, while 
today there are 240,000. We assume the age distribution of 
soldiers overseas is the same as the domestic population. We 
must use different techniques, however, for the number of 
overseas 


1948-66. To arrive at the number of eligible voters living 
overseas from 1948 to 1966, we estimate the nonmilitary 
number for the three years for which we have data: 1950, 
1960, and 1968. We then mterpolate that number for the 
years with missing data and add the number of eligible 
military personnel overseas (these data are available 
throughout the series).77 

The 1950 Census reports the number of military personnel 
and their dependents living overseas. We do not know the 
age distribution of these groups, so we assume the relative 
proportions younger and older than 21 are the same as in the 
domestic population and that military personnel are at least 
18 years old. We use the P-25 series report, which summa- 
Tizes yearly estimates of the age distribution of the domestic 
population, to adjust the overseas population by removing 
the estimated number of persons under age 21. 

The 1960 Census reports similar data and an additional 
category, the number of civilians living overseas. We make 
the same assumptions and adjustments as before for military 
personnel and their dependents. We assume that the age 
distribution of civilians living overseas is the same as the 
domestic population and remove persons under age 21. 


1968-82. The overseas eligible voters from 1968 to 1982 
are estimated from the 1984 Statistical Abstract, Table 4, 
“US. Population Living Abroad: 1968 to 1981.” Data are 
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persons who would need to be evacuated in the event of a 
crisis. We use this as the total of overseas civilians. Again, we 
make the same assumptions and adjustments for the age 
distribution of this group as before, and we use similar 
military statistics. We deflate these figures by an estimate of 
the number of persons under age 18. 


1994-98. We estimate the number of eligible voters over- 
seas for 1994 and 1996 using unpublished reports of the 
overseas civilian population provided by the U.S. Consular 
Service. We again deflate this number by the proportion of 
the domestic population under age 18. We then add the 
number of military personnel abroad, provided by the De- 
partment of Defense, and the number of nonmilitary govern- 
ment personnel, from Office of Personnel Management 
reports (we assume these employees ate at least 18 years old) 
to arrive at an estimate of the eligible voters overseas. 


2000. For 2000 we used the 1998 proportion of overseas 
citizens multiplied by the 2000 voting age population since 
the 2000 numbers were not available as of this writing. 

The number of military and nonmilitary government em- 
ployees living abroad is likely to be accurate, since these data 
are compiled and reported by the government. The estimates 
of civilians abroad is based on reports by consulates, which 
vary in the accuracy of their reporting. The total number of 
overseas civilians reported in 1998 was 3.1 million, a high 
since World War II. Footnotes that accompany data origi- 
nating from the U.S. Consular Service (for 1968-81 in the 
1984 Statistical Abstract, and unpublished data from 1987 
through the present provided by the Consular Service) warn 
that the civilian population overseas is almost certainly 
underestimated. A similar warning accompanies the 1960 
Census, which stresses that participation in counting was 
voluntary. We are highly confident of the overseas military 
and government employee figures, but we are less confident 
about our estimates of overseas civilians. 


Persons Age 18-20 


In order to control for the effect of the 26th Amendment, we 
calculate a turnout rate for persons at least 21 years old. 
From the CPS we calculate the proportion of voters age 
18-20 and the proportion age 21 and older. We use this latter 
figure and the total votes for highest office to calculate the 
number of votes cast by persons older than 20. We remove 
the number of citizens age 18-20 from our measure of the 
VEP. We similarly remove estimates of persons in that age 
group among felons and the overseas population, where 
appropriate, in order to avoid double counting. We then 
calculate a new turnout rate using the adjusted numerator 
and denominator. 
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subsamples of the CPS to make all the same adjustments 
using the CPS from 1966 onward, as detailed above. 

For the remainder of our adjustments, data are missing, 
and we must make some assumptions. Before 1966, we use 
the national estimate of the proportion of noncitizens among 
the VAP as the regional estimate. In 1966, when regional 
numbers became available, both the southern and nonsouth- 
ern proportions of noncitizens were 2% of the regional VAP, 
so we believe this is a reasonable assumption. Because we do 
not know the home state of eligible voters living abroad, we 
allocate the overseas population between the two regions in 
proportion to the VAP of each region. 
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Culture and Equality: An Critique of Multicul- 
tnralism. By Brian Barry. Cambridge, MA: Harvard Uni- 
versity Press, 2001. 399p. $35.00. 


Monique Deveaux, Williams College 


Barry is well known for his hard-nosed, no-nonsense defenses 
of traditional liberal principles, as evidenced in his Theories of 
Justice (1989) and Justice as Impartiality (1995). Here he turns 
his attention to arguments by proponents of multiculturalism 
in political life, liberal and nonliberal alike. The book is 
divided into three main sections: Multiculturalism and Equal 
Treatment, Multiculturalism and Groups, and Multicultural- 
ism, Universalism, and Egalitarianism. The central argument 
does not so much unfold gradually as it is asserted at the 
outset and then defended intermittently thereafter. 

Barry’s task is twofold: Offer a thorough and critical 
analysis of recent political theory that advocates cultural 
pluralism, and defend core liberal principles and a concep- 
tion of liberalism as equal justice for all against what the 
author plainly sees as the misguided efforts of multicultural- 
ists. Barry seeks to wrest liberal principles from the hands of 
so-called liberal proponents of multiculturalism (such as Will 
Kymlicka and Chandran Kukathas) as well as to show why 
efforts to discredit liberalism as inadequate to the demands of 
culturally plural societies are mistaken. Rather than eschew 
liberal norms or reform them beyond all recognition, Barry 
argues, we need to seek solutions from within liberalism. 
Specifically, we need to extend equal rights and treatment to 
all, and we must rectify the pervasive social and economic 
disadvantages that multiculturalists wrongly impute to 
groups’ distinctive cultural identities. 

makes no attempt to obscure his thoroughly skepti- 
cal view of this literature. Indeed, he states at the outset that, 
in his “naively rationalistic way, I used to believe that 
multiculturalism was bound sooner or later to sink under the 
weight of its intellectual weaknesses and that I would there- 
fore be better employed writing about other topics” (p. 6). 
Only multiculturalism’s inexplicable persistence persuaded 
him to undertake the task of rebutting the arguments of 
leading proponents. As befits this motivation, his central aim 
is to demonstrate that much of the normative argumentation 
underpinning proposals for greater accommodation of cal- 
tural minorities in liberal democratic states is fundamentally 
flawed. Barry also seeks to show that many if not most of the 
specific proposals advanced by contemporary political theo- 
rists for special rights, exemptions, and other arrangements 
for cultural minorities are poorly defended and potentially 
disastrous from the vantage point of politics. 

Among the thinkers Barry takes on in his thorough and 
highly critical survey of the literature are Bikhu Parekh, 
Chandran Kukathas, Will Kymlicka, Charles Taylor, James 
Tully, and Iris Young. In his discussion of the various 
arguments of these and other proponents of cultural plural- 
ism, Barry brings his analytical skills to bear in exposing 
vague, contradictory, and poorly argued aspects of their 
work, and he employs his political savvy to speculate about 
the potential effects of concrete proposals for accommodat- 
ing cultural minorities. At times this strategy is remarkably 
insightful, as in his exploration of the implications of the 
suggestion (which he attributes to Iris Young) that liberal 
cultural minorities should have disproportionate power in 
shaping the public policies that apply to their own practices 
(p. 303). 
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At other times, Barry too readily engages in sarcastic 
dismissal of the works he discusses. His writing style is highly 
rhetorical and often witty, but the witticisms frequently come 
at the expense of authors whose arguments or political 
visions are forced through Barry’s analytical grinder. Exam- 
ples include Iris Young’s Justice and the Politics of Difference 
(1990), which Barry discusses at length and whose central 
arguments he rejects on the basis of some very persuasive 
reasons. Nonetheless, punctuating his analysis are derisive 
asides—Young ultimately advances a “perverse thesis about 
the assimilionist impulse behind liberalism” (p. 69}—and he 
twice quotes a passage from Young’s book that discusses an 
idea he finds particularly absurd, namely, the notion of a 
distinct women’s culture as including traditional women’s arts 
and rituals deriving from witchcraft (pp. 94 and 278). 

Similarly, it is difficult to avoid the conclusion that Barry is 
merely poking fun at a central aboriginal image from James 
Tully’s Strange Multiplicity: Constitutionalism in an Age of 
Diversity (1995}—the black canoe, employed by Tully as a 
metaphor for his vision of constitutionalism amid diversity— 
even though Barry hastens to assure readers that “there is 
nothing intrinsically absurd in any of the argumentative 
strategies that Tully attributes to the mythological bear” (p. 
262). References to the Politically Correct Thought Police 
(p. 271) and the Commussioners of Political Correctness (p. 
328) abound, reinforcing Barry’s view that cultural pluralism 
as a political strategy is the concoction of a handful of 
posturing intellectuals who rely on dogmatism to support 
their cause. 

As ever, Barry’s strong suit is his talent for articulating the 
essence of liberal principles and presenting persuasive (al- 
though not, in my view, ultimately convincing) arguments to 
support his case against self-styled liberal and postliberal 
theories of multiculturalism. A key strength of the book is 
Barry’s knack for demonstrating that often political theorists 
who purport to amend flaws within liberal theory simply 
misconstrue or fail to understand what liberal norms, princi- 
ples, and conceptual distinctions actually entail. For example, 
the liberal distinction between public and private has, Barry 
argues, been profoundly misconstrued by critics. Urging us 
back to John Stuart Mill (whom he discusses at length), Barry 
persuasively argues that such a division is indispensible for 
justice. 

Barry also unpacks and holds up to critical scrutiny much 
of the rhetoric of recent political theory that endorses 
cultural pluralism, sometimes in quite useful ways. His dis- 
cussion in the final chapter, “The Politics of Multicultural- 
ism,” effectively demonstrates the dangers of the presump- 
tion that liberal democracies should move toward a 
conception of the “special interests” of ethnic and cultural 
minorities, in part because such a conception obscures the 
ways in which public institutions and resources (like educa- 
tion) are a matter of critical concern for all citizens, regard- 
less of their ethnicity. 

Barry’s claims that “a politics of multiculturalism under- 
mines a politics of redistribution” (p. 8) and that “pursuit of 
the multiculturalist agenda makes the achievement of 
broadly based egalitarian policies more difficult” by “divert- 
ing political effort away from universalistic goals” (p. 325) 
combine to shape a provocative and sobering thesis. The 
book is, however, a bit short on solutions to the very social 
and economic disadvantages and inequalities that are, Barry 
claims, misattributed (by proponents of multiculturalism) to 
culture and group identity. It is hoped that a follow-up book 
is in the offing, in which Barry will address precisely these 
problems from his trenchant liberal social justice perspective. 
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The Postmodern Marx. By Terrell Carver. University Park: 
Pennsylvania State University Press, 1999. 240p. PANY 
cloth, $19.95 paper. 


Ernesto Laclau, University of Essex 


Two main features of this remarkable book deserve special 
mention. The first is its attempt to reflect on Marr's texts in 
the light of present social and political concerns. This is the 
reason for “postmodern” in the title. The present is not seen 
by Carver as organized around a unified perspective but as a 
true plurality: He does not take refuge in any ready-made 
coherence or in easy labels. The complexities of the present 
are projected into the Marxian text—or, rather, rediscovered 
in it—so that they lead to its authentic deconstruction. One 
ends with the feeling that there is much more to be found in 
the texts of Mar, despite their anachronisms and limitations, 
than conventional wisdom would have us believe. 

The second important feature is that the Marx Carver finds 
at the end of his exploration is also plural. A great deal of 
literature has tried to discover the “true” Marx, but we get 
from Carver a fascinating unveiling of the multiplicity of the 
Marxian text, of the various discursive sequences whose unity 
results from contingent articulations rather than from any 
underlying univocal principle. Seen from this angle, the work 
of Marx appears as a sort of microcosm in which we find, m 
nuce, all the potential and often contradictory trends of the 
history of Marxism in the century following the founder’s 
death. Being partly inside and partly outside that history, 
Carver sees this multiplicity of crossroads already operating 
in the founding gestures of Marx himself and gives a pene- 
trating analysis of its constitutive dimensions. 

In the introduction Carver states the different paradigms, 
both theoretical and political, that have governed the reading 
of Marx’s texts. Later, he successively analyzes the role of 
metaphor in theoretical writing, the link between Marx’s 
approach and the idealist philosophy immediately preceding 
him, the shortcomings of the rational choice reading of his 
texts, the framework of his political economy approach, his 
conception of communism, the main dimensions of his 
political theory and strategy, the interpretative strategies to 
be found in the translations of his work, his relations with 
Hegelian philosophy, and even those aspects that Marx did 
not touch: women and gender. 

Carver is very much aware of the main contemporary 
developments in textual analysis: from rhetoric to decon- 
struction in its Derridean version and from the hermeneutics 
of a Gadamer or a Ricoeur to the contextualism of Quentin 
Skinner and the Cambridge School. The traces of these 
methodological approaches (or, rather, of these reading 
practices, given that the notion of “method” would be very 
much contested by some of these currents) can be found 
everywhere in Carver's book, and their employment is always 
careful and rigorous. Carver has written a work that can be 
considered a model of contemporary textual analysis. 


Deliberative Democracy and Beyond: Liberals, Critics, and 
Contestations. By John S. Dryzek. Oxford: Oxford Univer- 
sity Press, 2000. 195p. $29.95. 


Bob Pepperman Taylor, University of Vermont 


John Dryzek identifies the “deliberative turn” in democratic 
theory in the past decade and wishes to “establish what a 
defensible theory of democracy would look like” (p. 2). The 
fundamental features of the theory he develops are: a critical 
hostility to what he takes to be liberalism’s view of democracy 
as simple interest aggregation, a subsequent defense of 
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deliberative democracy’s educative and transformative po- 
tential; and an argument for distancing deliberative democ- 
racy from the state and formal voting mechanisms and 
emphasizing instead the democratic importance of a plural, 
politicized, and vocal civil society. 

In elaborating these ideas, Dryzek hopes to reinvigorate a 
radical critical theory that he fears has become too closely 
allied with liberalism in the past few years. This allows him to 
distinguish, on the one hand, between what he takes to be the 
full critical potential of deliberative democracy and, on the 
other hand, conventional liberalism, rational choice theory, 
and the work of “difference” democrats. Having carved out 
the terrain of his own theory, he then applies this theory to 
the consideration of two mnteresting and important cases: the 
democratic potential in international politics and the rela- 
tionship between democracy and Green politics. Throughout, 
he is deeply concerned with what he calls democratic “au- 
thenticity,” or “the degree to which democratic control is 
engaged through communication that encourages reflection 
upon preferences without coercion” (p. 8). 

There is much to like about and learn from this small but 
densely argued volume. At the most general level, all scholars 
interested in the relationship between democracy and volun- 
tary organizations in civil society will find important food for 
thought. Although Dryzek focuses primarily on politicized 
organizations in civil society rather than the whole range of 
voluntary associations (Greenpeace, say, rather than Robert 
Putnam’s bowling leagues and choral societies), he makes a 
strong case for thinking that many such organizations may 
make their greatest contributions to democratic deliberation 
by maintaining their independence from the state, by viewing 
their contribution to democracy as flowing from the deliber- 
ation they stimulate in the polity more generally rather than 
the degree to which they either share formal political power 
or aim primarily to influence the outcome of elections. 

This idea allows Dryzek to offer a novel argument about 
the role of an international “civil society” in promoting 
transnational democracy. The model he has in mind is of 
networks of organizations, perhaps like those working and 
demonstrating in recent years against free trade, or the 
universe of international environmental organizations. In the 
course of making these arguments, Dryzek discusses social 
choice theory, the recent liberal turn of critical theory (most 
notably Habermas}, and “difference” theory, all of which will 
be welcomed by theorists in these fields as well as by those 
involved with debates about “public reason” and the proper 
constraints on political debate (Dryzek defends a moderately 
broad understanding of these constraints). 

As in most richly argued and thoughtful books, there is 
much with which to quarreL I shall mention three of the main 
issues that may generate discussion. First, a strong case can 
be made that Dryzek’s portrait of “liberalism” is much too 
narrow and constrained. We should not forget, for example, 
that one of the most important “deliberative” democrats was 
John Dewey, perhaps the dominant liberal voice from the 
first half of the twentieth century in the United States. It is 
true that for much liberal political theory democracy is 
narrowly construed as the aggregation of interests. This by no 
means exhausts the liberal literature or position, however, as 
works from John Stuart Mill to Amy Gutmann and Dennis 
Thompson demonstrate. Nor do all liberals “fail to recog- 
nize... that getting constitutions and laws right is only half 
the battle” (p. 21} one need only think of liberal feminism, 
such as that developed by Susan Moller Okin, to see how 
unpersuasive this claim is. Critics will suspect that Dryzek’s 
liberalism comes dangerously close to being a straw man. 
More generously, others may suggest that his constrained 
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view of liberalism unn Testricts his access to litera- 
tures and arguments that would be most useful to his own 


A second worry concerns the connection Dryzek makes 
between democracy and civil society. He recognizes that 
many organizations in civil society are neither internally 
democratic nor representative of majority public opinion, but 
he nonetheless believes some of these organizations are 
democratic because they give voice, to excluded 
groups or interests. By making our understanding of democ- 
racy “discursive rather than electoral” (p. 51), Dryzek severs 
(or at least seriously weakens) the link that legitimizes 
democracy through the empirical act of voting. In order to 
evaluate the democratic integrity of a particular discourse or 
group, he is left only with arguments about authenticity (he 
holds that authenticity in deliberation requires no more than 
that communication be based upon generalizable and non- 
coercive positions) and the substantive degree to which 
‘communication aims toward the liberation of legitimate 
voices. Many readers will suspect that both of these argu- 
‘Ments are more complicated, difficult, and ambiguous than 
Dryzek appears to believe. 

A third and related concern is that although Dryzek never 
holds that liberal democracy and deliberative democracy are 
incompatible, he obviously emphasizes the latter and dis- 
counts the former in his discussion. Although many (like 
myself) will be persuaded that democracy cannot be reduced 
"to voting, many (like myself) will also think that Dryzek is in 
danger of losing sight of the fundamental, even primary, 
importance of voting and elections in any democratic system. 

There are other issues, such as Dryzek’s discussion of 
Green democracy, that could be the focus of an entire review. 
Even though this book will raise many questions and criti- 
cisms, these will reflect not Dryzek’s failure as a political 
thinker but his strength as an interesting, provocative, and 
serious democratic theorist. 


Enduring Liberalism: American Political Thought Since the 
1960s. By Robert Booth Fowler. Lawrence: University of 
Kansas Press, 1999. 332p. $35.00. 


Nancy S. Love, Pennsylvania State University 
Has conflict replaced consensus in American political 
thought since the 1960s? This is the central question of 
Enduring Liberalism, and Fowler answers it with a carefully 
nuanced “no.” “To grapple with contemporary interpreta- 
tions ... there is no substitute for understanding their prede- 
cessors. For contemporary views are in constant dialogue 
with them” (p. 34). He reconstructs this dialogue between the 
political thought of post-1960s public intellectuals and the 
general public, on the one hand, and earlier interpretations 
of American values, on the other. 

The first three chapters move quickly from nineteenth- 
century classic analyses (de Tocqueville and Turner), to 
Progressive Era conflict interpretations (Charles Beard and 
Vernon Partington), to post-World War II consensus theo- 
ties (Louis Hartz and Daniel Boorstin), and then to chal- 
lenges to consensus—from the Left, feminists, conservatives, 
and postmoderns—during and following the 1960s. Fowler 
claims these critiques have congealed around an “interpretive 
orthodoxy” that “a consensus does not exist in the United 
States and never has. A frequent corollary is that 1950s 
consensus advocates were political conservatives devoted to 
the defense of an elitist, inegalitarien past that was not worth 
preserving” (p. 36). 

In the rest of the book, Fowler contests both claims and 
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offers his own interpretation of American political thought— 
scholarly and popular—since the 1960s. Finding “no consen- 
sus favoring liberalism among contemporary American intel- 
jectuals,” he concludes that “diversity is alive and well in 
American intellectual life, and it is more and more the 
reality” (p. 61). Public opinion tells a different story, however: 
“Substantial differences of opinion on basic (and liberal) 
political values do not exist within the American public” (p. 
62). 

Fowler examines debates among public intellectuals on 
three topics: the meaning of community (chap. 6), the 
struggle for environmental protection (chap. 7), and the 
decline of civil society (chaps. 8 and 9). His analysis is 
comprehensive, ranging far beyond standard Right/Left ideo- 
logical positions. For example, he discusses proponents of the 
following versions of community: inclusive, national demo- 
cratic, participatory democratic, smali group and therapeutic, 
spiritual, and traditional. Throughout the book Fowler iden- 
tifles doubts about liberalism (at least, when defined as 
radical individualism, individual rights, and/or market capi- 
talism) shared by public intellectuals, despite their different 
visions of American democracy. 

Fowler sets the intellectual debates against the backdrop of 
a largely privatized American public, a bittersweet triumph of 
the core liberal principle of individual freedom (chaps. 4-5). 
He presents the American public as, in Alan Wolfe’s phrase, 
“one nation after all,” although a nation that lacks robust 
concepts of democratic citizenship or public life. Instead, a 
weak consensus limits even the most conflictual policy de- 
bates to “the current liberal framework” (p. 245). Fowler’s 
discussion of how liberal individualism has pervaded private 
life, especially family and religion, is fascinating, despite his 
questionable association of privatization with 1960s-style 
liberation. Racism, he argues, remains the deepest division 
among American citizens and the greatest threat to national 
unity. 

As even this brief summary Fowler provides a 
comprehensive survey of American political thought, includ- 
ing an invaluable bibliography. Yet, breadth occasionally 
supersedes analytical clarity. For example, the line between 
public intellectuals and public opinion is less clear and less 
clearly drawn than Fowler suggests. He defines the former as 
“a certain kind of reflective participant in American public 
life” (p. x). His discussions include Robert Bellah, Jean 
Bethke Elshtain, Amitai Etzioni, Michael Walzer, William 
Bennett, Seymour Martin Lipset, William Galston, Stephen 
Macedo, and many, many others. Unfortunately, Fowler also 
draws on their ideas and surveys to characterize American 
public opinion. One is left wondering whether Fowler’s claim 
that the American public embraces liberal values is itself an 
artifact of the public intellectuals he studies. 

The author’s definition of liberalism is also rather ambig- 
uous. Fowler acknowledges that diversity exists not only 
outside liberalism but also within it: “Liberalism is a complex 
outlook, after all, with somewhat porous and highly contested 
definitional boundaries and many internal tensions” (p. 250). 
When discussing conservatism, Fowler asks: “How much 
does it illuminate anything to say “They are all liberals?” Yet, 
he still tends to interpret distinct intellectual positions as 
variations on liberal themes (p. 123). For Fowler, most 
feminisms fall within liberalism (p. 131), environmentalism 
includes many liberal moments (p. 198), communitarians 
belong to the liberal tradition they criticize (p. 174), and the 
decline of neoconservatism is a partial victory, since its 
proponents endorse the economic values of classical liberals 
(p. 248). The conclusion of Fowler's story of American 
political thought is that “much intellectual criticism of liberal 
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institutions or even liberal ideology—often in language that 
quite explicitly repudiates liberalism—amounts to much less 
than it appears” (p. 247). 

Early in Enduring Liberahsm, Fowler cautions that “telling 
a story and endorsing it have no necessary connection” (p. 
23). He also maintains that “at the heart of the rebirth of 
liberal thought since the 1960s has been the courage to do 
liberal theory, rather than only tell its story or cautiously 
reflect on the ideas of others” (p. 57). This book 1s best read 
on both levels, as story and theory. In the conclusion, Fowler 
identifies his personal politics: “part Enlightenment liberal, 
part Burkean conservative, part Emersonian anarchist, and 
part religious existentialist” (p. 243). This is telling. Guiding 
Fowler’s story of American political thought is a desire to 
keep hope for an enduring—and evolving— liberalism alive. 
Of course, the question remains for whom liberalism is the 
best, last hope. The story Fowler tells shows that to this 
question there are no easy answers. 


Negotiating Postmodernism. By Wayne Gabardi. Minneap- 
olis: University of Minnesota Press, 2001. 192p. $42.95 
cloth, $16.95 paper. 


Leslie Paul Thiele, University of Florida 


This fast-paced tour of the postmodern condition will repay 
readers who may not have the time for more leisurely reading 
and reflection. It supplies both a useful guide to the theoret- 
ical terrain and a menu of practical responses to the social, 
political, and cultural challenges that postmodernism pre- 
sents. 

Wayne Gabardi’s thesis is that the polemics of the 1970s 
and early 1980s that characterized the initial interaction 
between those modernists indebted to critical theory (such as 
Habermas) and those postmodernists indebted to Heidegger 
and Nietzsche (such as Foucault and Derrida) have given way 
to a period of “critical postmodernism,” marked by dialogue, 
accommodation, and synthesis. This thesis is not wholly 
Se an arse aae e Ae 
Stephen White (Political Theory and Postmodernism, 1991), 
Richard Bernstein (The New Constellation, 1991), and others. 
Yet, the continuing failure of many modernists to distinguish 
between radical and evolved forms of postmodernism, dis- 
missing the second on account of the excesses of the first, and 
the continuing failure of many postmodernists to integrate 
modern critical theory into their own intellectual enterprise 
means that these ships continue to pass unseen in the night. 
Consequently, the project of critical postmodernism receives 
less attention than it deserves. 

Of course, the distinction between radical and critical 
postmodernism is somewhat contrived. It cannot be located 
temporally in any precise fashion. Aspects of radical and 
critical are found in both the early and later 
works of all but the most extreme postmodernists (such as 
Jean Baudrillard). Still, a drift toward critical postmodernism 
is apparent, and a book that systematically charts this drift is 
well warranted. 

Negotiating Postmodernism is part survey text, part syn- 
thetic research. Gabardi adopts a “profile and critiqne” 
approach. The first half of the book mostly provides well- 
crafted review summaries of the literature along with insight- 
ful, if not sustained, critiques. Spanning the intellectual world 
of postmodernity in less than 100 pages is a monumental feat, 
which Gabardi performs with admirable thoroughness and 
clarity. One marvels at the comprehensiveness. Yet, Negoti- 

Postmodernism provides too cursory an assessment of 
the literature, pitched at too high a level, to be of much help 
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to those wholly unfamiliar with it. Those more at home with 
postmodern thought might sift through the review section on 
the basis of the promissory note that Gabardi issues in the 
introduction. He will chart practical options for a democratic 
politics informed by critical postmodern insights. 

Gabardi makes a strong case for critical postmodernism as 
a theoretical and ideological orientation that best captures 
the sociocultural dynamics of late-modermm times. Moving 
beyond the modernist pursuit of universal principles, he 
attends to the complex cultural pluralism and fragmented 
market globalism of contemporary societies. Moderating 
radical postmodernism, Gabardi refuses to overplay the 
technological burdens and aesthetic potentials of postmod- 
em life. He is attentive to the merits of transgression in a 
world of pastoral power, but he also refrains from the 
dangerous embrace of anarcho-ethical alternatives. He sees 
himself as a skeptical moderate. Mixing Montaigne with (the 
later) Foucault, Gabardi molds a hybrid intellectual hero who 
deftly, if somewhat reactively, negotiates postmodern life. 

The political commitments of the postmodern moderate 
are fundamentally democratic. Democracy, for Gabardi, is 
essentially about citizens’ struggle for freedom. Yet, freedom 
today is not to be won in emancipatory revolutions. Rather, it 
is to be glimpsed through the cracks of disciplinary power, 
tasted briefly in performative transgressions, and shared 
tentatively through efforts to forge new ethical relationships 
of self and other. Gabardi calls for various forms of creative 
resistance. These aestheticized acts of refusal and self-inven- 
tion reveal our individual and collective capacity for freedom 
within an otherwise enveloping neoliberal world of globalized 
media and market relations overseen by a “techno-oligar- 

Foremost among the conditions that would allow the 
exercise of such freedom is increased leisure. Leisure allows 
us to take serious pleasure in the life of the mind, body, soul, 
and community. It is meant to foster cultural creation and 
critical engagement in civic life. We are urged not to use our 
extra free time for the increased consumption of commodi- 
ties and simulated experiences (e.g., television, video gaming, 
Internet surfing). Apart from the pedagogical force of “lei- 
sure studies” to be added to the college core curriculum, 
however, we are left without much hope that people will opt 
for civic agonistics and high cultural aesthetics rather than 
more frequent trips to the mall. There is little reason for 
optimism. Perhaps that is why Gabardi acknowledges that 
those who adopt a critically postmodern approach will nec- 
essarily remain peripheral in a lifeworld dominated by pro- 
duction and, i ingly, by consumption. 

For those who take on the challenge of critical postmod- 
ernism, the goal is to fashion the social, political, economic, 
and cultural conditions that allow fugitive experiences of 
freedom to proliferate. Despite ail the talk of collective 
efforts, however, the creative resistance and self-care adyo- 
cated by Gabardi harbor a narcissistic quality. In this respect 
Gabardi evidences Foucault’s mentorship. Although in debt 
to Foucault for his genealogical perspective and strategic 
overview, Gabardi also relies on Danilo Zolo (Democracy 
and Compleaty, 1992) for his politicocultural analysis. He 
accepts Zolo’s chaste vision of postmodern life. Zolo foresees 
the “Singapore model” for democracy: a prosperous, efficient 
society of managed consumers living in an “antipolis.” The- 
orists such as Gabardi and Zolo may point to Smgapore as 
the vision of things to come. But the threat, or promise, of a 
Singaporean future may seem strangely out of touch, or out 
of reach, to the 70% of humankind living in substandard 
housing and unable to read, the 50% suffering from malnu- 
trition, and the 500 million people currently experiencing the 
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horrors of war, the loneliness of imprisonment, and the agony 
of torture. If there is a glaring shortcoming to Negotiating 
Postmodernism, it is that abstract theorization of the most 
sweeping variety takes place in the complete absence of 
empirical reference, even when practical solutions are prof- 
fered. i 
Gabardi’s encyclopaedic effort provides many succulent 
bones to chew on. Yet, little is discussed in depth or justified 
at length. One is tantalized but left rather hungry. For 
instance, we are at a loss to learn, in light of the dilemmas of 
postmodern life, why or how Gabardi’s economic and polit- 
ical proposals should or could become implemented. Frankly, 
I believe that most of his practical proposals are good ones. 
But that only means he is, like me, a social democrat 
informed by postmodern sensibilities. People with a different 
ideological bent would find the author’s refusal to justify 
many of his proposals rather irritating. They would be 
nonplussed by Gabardi’s abrupt leap from a theoretical 
ion of Habermas, Heidegger, Arendt, and Fou- 
cault to the practical advocacy of a negative income tax, 
universal child care, and a 32-hour work week. 

I am also troubled by the celebration of Foucaultian-style 
transgression. The problem is that such performative action 
is too easily colonized in the postmodern world. One need 
only think of the performers featured on Jerry Springer and 
other day-time talk shows who stimulate the public’s appetite 
for transgressive spectacles. Nietzsche said that decadence 
can be defined as the need for greater and greater stimulation 
to achieve the same level of satisfaction. Performative trans- 
gression is a facet of postmodern decadence that the media 
techno-oligarchy well exploits. I would think that Foucault’s 
dallying with sadomasochism bears the same danger. Bread, 
circuses, and sadomasochistic gladiators—all available on 
pay-per-view! 

To his credit, Gabardi recognizes the capacity of the 
postmodern world to colonize the most creative acts of 
resistance. Its capacity to exploit efficiently (even our subver- 
sive) desires and actions, to integrate minds, bodies, and 
souls into what Heidegger called the Bestand or standing- 
reserve, is perhaps the chief reason to fight for leisure. But 
this fight might best be waged not by spending more time free 
of work, but by learning how not to spend time, whether one 
is engaged in work or play, theory or praxis. 


Containing Nationalism. By Michael Hechter. Oxford: Ox- 
ford University Press, 2000. 256p. $29.95. 


Russell Hardin, New York University and Stanford University 


Michael Hechter focuses on three puzzles about nationalism 
(pp. 3-4). Why is it only a modern phenomenon? Why is it 
more acute in some countries than in others? And can its 
dark side of horrendous violence be contained? The title 
suggests that his principal concern is with the last question, 
but the bulk of discussion focuses on the second. One can 
readily answer the first question by saying that nationalism is 
strictly modern because it is about mass mobilization. Hence, 
it is kin to democracy, revolution, and socialism, none of 
which could arise in large countries before mobilization was 
possible. The democratic revolutions in the United States 
and France were about changing the locus of sovereignty 
from a monarch to the people. This is not strictly Hechter’s 
argument, but it is implicit in many of his claims, such as that 
nationalism requires the existence of organizations that work 
for the national sovereignty of their subgroup (p. 125). 

. This answer, of course, merely pushes the question back to 
ask why mobilization began to be possible little more than 
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two centuries ago. A quick and probably partial answer is that 
technology, communication, transportation, and aggregation 
of workforces enabled mobilization as never before. A sec- 
ond quick answer is that Napoleon changed modern warfare 
and modern states by organizing vast armies, almost all of 
them composed of conscripts or volunteers rather than 
mercenaries. (The Romans, Persians, and others had mobi- 
lized large armies, but Napoleon was revolutionary in his 
era.) Military and factory mobilization both tended to pro- 
duce people who could speak a uniform national language 
and thereby de facto created modern nations. The first of 
these nations were driven by territorially inclusive national- 
ism, as in the cases of France and the United States. But they 
provoked nationalisms that were culturally inclusive and 
therefore exclusionary in other respects (pp. 91-2). 
Hechter has a different historical account of the reason 
nationalism arose only recently in Europe. His specific claim 
is that the world of local control was displaced by the rise of 
direct rule, of the intrusive state governing a large population 
(p. 60). When governance was highly local, the very idea of 
nationalism could not occur to or motivate anyone—it would 
have no point. State formation in Europe proceeded by the 
confederation of distinct solidary groups (p. 42). Later, direct 
tule from the center of such confederated states broke the 
connection between the nation and the governance unit. 
Nationalism was therefore a response to growing state capac- 
ity for direct rule. Earlier empires had generally ruled 
indirectly, with governance structures and policies that varied 
from one group to another throughout the diverse empire. 
Hechter’s chief answer to the second question is that states 
have a limited span of control and need to organize more or 
less federally to manage large populations. The rise of direct 
rule in large states led to opposition to the center from 
culturally peripheral groups. In earlier work, Hechter argues 
that a multiethnic state would be easier to govern because 
each ethnic group would enforce some behaviors on its 
members, who therefore would have less energy to spend on 
more generally directed efforts that might challenge the state 
(pp. 156-7). Clearly, there is a risk in this arrangement 
because the subunit itself may organize against the state, 
jally if it becomes infected with nationalist fervor. In 
Yugoslavia, the federal organization of politics into ethnically 
concentrated regions enabled Croatian and Serbian nation- 
alism. But, Hechter supposes, the greater risk is to attempt 
direct rule that reduces policies to uniformity across a diverse 
population. He argues that decentralization of government to 
a cultural minority may give that minority the resources to 
mobilize protests. Although this may suggest that the move is 
destabilizing, it may stmultaneously undercut demands for 
sovereignty and, therefore, nationalist fervor (p. 146). 
Despite frequent concern with the possibilities for collec- 
tive action, both in protest actions and in mobilizing nation- 
alist groups, the argument here is almost entirely at the level 
of the nation and the governing structure, not at the level of 
incentives or specific individual motivations. As Hechter 
notes for particular cases, leaders often have motivations of 
self-interest in gaining leadership positions in a newly created 
state that is congruent with some nationality, and therefore 
they may work for the nationalist cause of separate govern- 
ment. The motivations of others in a nationalist movement 
are merely a desire for policies that differ from the larger 
state to which they are subject. Hechter’s main addition to 
this simple account builds on social identity theory, as in the 
work of Henri Tajfel and others, who attempt to explain the 
commonly spontaneous creation of exclusionary groups (pp. 
99-101). 
If we are to identify with a group, that group must already 
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be defined. Groups tend to form around characteristics that 
are important to individual welfare or around mere location. 
In social identity theory, individuals identify readily with 
high-status groups. If they are in a low-status group, from 
which they cannot exit, they will tend to revalue the group 
and will identify strongly with it. Hechter argues that in many 
multiethnic societies there is a cultural division of labor that 
contributes in just this way to the heightening of national 
identity, especially for the group that is lower in the hierarchy 
of division (chap. 6). Such cultural divisions can be broken 
down by the conditions of urban life, in which internal 
enforcement by group members against one another is too 
weak to sustain the division, which therefore must depend on 
political enforcement (p. 112). 

In sum, nationalism is primarily a result of the irritations of 
centralized direct rule over cultural minorities who seek 
autonomy, the kind of autonomy they might have had in the 
earlier era of indirect rule. They may be placated by grants of 
partial autonomy, as in various devolutions of governmental 
authority in recent times. And, if we may read between the 
lines, they are more likely to be placated if their economic 
prospects are good enough to displace concern with political 
status. 


Radical ent: Philosophy and the Making of Mo- 
dernity 1650-1750. By Jonathan I. Israel. Oxford: Orford 
University Press, 2001. 810p. $45.50. 

John Christian Laursen, University of California, Riverside 
This book offers a major challenge to the academic political 
theory establishment in the United States and United King- 
dom. Instead of Hobbes-Locke-Rousseau, the important 
story is Spimoza-Bayle-Diderot. If you are not teaching 
Spinoza and his influence in your surveys of early modern 
and Enlightenment thought, you should be. 

This is an epic drama with a cast of dozens. The story 
opens with Cartesianism and its spread around Europe, with 
major implications for society, institutions, women, sexuality, 
and more. Cartesianism is soon replaced by Spinozism, which 
pushes philosophical radicalism even farther. Important fig- 
ures making up a new canon include the previously obscure 
Van den Enden, the Koerbagh brothers, and Lodewijk 
Meyer. Benedict de Spinoza is the key figure, largely, as 
Israel argues, because he systematized the radical philosophy 
advanced since ancient times by less systematic figures, and 
because he was both vilified and followed by so many. In a 
nutshell, revelation, a providential God, freedom of the will, 
and miracles are ruled out on philosophical grounds, and 
immortality of the soul is denied by a theory that everything 
is one substance. Politically, this implies secularization, 
equality, democracy, freedom of expression, and women’s 
liberation. 

None of this was accepted quietly. A three-way battle for 
the hearts and minds of Europe was waged among conserva- 
tives, moderate Enlighteners, and radical Enlighteners. Fa- 
mous names such as Locke, Newton, and Voltaire are only 
moderates, in Israel’s analysis. The radicals are the Spi- 
nozists, such as Adriaan Beverland, Johannes Bredenburg, 
and Balthasar Bekker. 

One of Israel’s purposes is to push back the accepted dates 
for the important developments in early modern philosophy 
and political thought from the high Enlightenment of 1750- 
1800 to the early Enlightenment of 1650-1750. By 1750, it is 
argued, most of the work had been done. In the shadow of 
Spinozism came numerous controversies, from the brouhaha 
over Bayle’s claim that atheists could be good citizens to 
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Bredenburg’s fight with Limborch over the proper relations 
between reason and religion; from Fontenelle and Van Dale 
on oracles as political frauds to Leenhof on universal philo- 
sophical religion. Not only conservatives but also such mod- 
erates as Locke, Leibniz, Thomasius, and Wolf fought 
rear-guard battles against the growing influence of Spi- 
nozism. “Whig history” is a term that means all historical 
roads lead to the Whigs; here, all roads lead to Spinoza, 90 
this is presumably Spinozist history. 

This is cosmopolitan rather than nationalist history. Defy- 
ing the trend of studying the Enlightenment in a single 
national context, the book sweeps back and forth across all of 
literate Enrope: from Ireland to Naples, from Sweden to 
Portugal. A good part of the radical Enlightenment was 
underground, spread by clandestine manuscripts written by 
the likes of Boulainvilliers, Du Marsais, and other deists and 
Spinozists, most often in French. Radical German enlighten- 
ers, such as Tschirnhaus, Stosch, Lau, Schmidt, and Edel- 
mann, receive renewed attention here. Vico, Radicati, and 
Pietro Giannone prove that some Italians were up to date. 
And Israel shows that Spinozism played a role even in Spain 
and Portugal. 

This reassessment is on the order of the major works of 
Peter Gay, Quentin Skinner, John Pocock, and very few 
others. A few years ago, Steven B. Smith (Spinaza, Liberal- 
ism, and the Question of Jewish Identity, 1997) gave us a fresh 
reading of Spinoza’s political theory. Israel’s book sets that 
theory in context and spells out its implications for the history 
of ideas over a century and more. 

Repeatedly, Israel takes down the inflated reputations of 
Hobbes and Locke. He cites dozens of sources from the late 
seventeenth and eighteenth centuries that claim Spinoza 
raised the real issues, not the English writers. It is indeed 
remarkable how long it takes for insular and nationalist 
canons to be challenged. For example, in France La lettre 
clandestine reached its tenth annual volume without any 
significant circulation among Anglo-American political the- 
orists. For those of us who have been reading a large body of 
German, French, and Italian scholarship on these trends in 
the last decade, it is about time that a book such as Israel’s 
finally is issued by a mainstream English-language publisher. 
Since prestige is so important in the diffusion of scholarship, 
Israel’s position as professor of History at the Institute for 
Advanced Study at Princeton should add to the conviction 
created by his arguments. 

For political theorists who have little idea about what is 
going on in the rest of Europe, this volume is a magnificent 
opportunity to get up to date. What is at stake is the claim, 
now widely recognized elsewhere, that we moderns are not 
the intellectual heirs of the courtier Hobbes or the gentry 
spokesman Locke but, rather, of the former Jewish lens 
grinder Spinoza and his radical Dutch, German, and French 
followers. 

As with any wide-reaching synthesis, specialists will have 
bones to pick. Denis Vairasse’s History of the Sevarambes is 
described as a “French Spinozistic novel” and dated to 1677, 
but it appeared first in English in 1675. Israel asserts several 
times that Bayle was silent on freedom of the press, but what 
was his famous “Clarification concerning Obscenities” 
about? Israel claims that freedom of the press was always and 
only a radical position, but Elie Luzac’s defense of it in 1749 
was rather clearly a moderate stance. 

Specialists will also want to suggest further evidence. For 
example, Boureau-Deslandes’s Reflections on the Death of 
Free-Thinkers (1713) could have been mentioned on page 
298. Something could have been made of Martín Martfnez in 
Spain. Israe] has materials on libraries, learned journals, 
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encyclopedias, book catalogs, and other modes of diffusion of 
ideas, but there is no survey of the recently growing study of 
correspondence networks. The coda on Rousseau is a bit 
underdeveloped; much more has been said elsewhere about 
Spinoza’s reception in the period 1750-1800, and one area 
for future research would be Kant’s Spimozism. But any such 
matters of detail would only confirm the overall message of 
this book: Major sectors of English-language political theory 
and history of political thought have been missing a great 
deal of what was important in the seventeenth and eighteenth 
centuries, and it can be found here. 


Lincoln’s Sacred Effort: Religion’s Role in Ameri- 
can Self-Government. By Lucas E. Morel. Lanham, MD: 
Lexington Books, 2000. 251p. $70.00 cloth, $23.95 paper. 


David F. Ericson, Wichita State University 


Lucas Morel presents an excellent survey of Abraham Lin- 
coln’s frequent use of biblical language and allusions. Yet, 
Morel fails the significance test he sets for himself (pp. 1-2): 
Did Lincoln frequently use such language merely because it 
was the most common vernacular of his time; the vernacular 
with which his audiences would be most familiar? Or did he 
also frequently use such language because he thought that 
the right ordering of the relationship between religion and 
politics was critical to the maintenance of a democratic 
regime and that he actually had something important and 
original to say about that relationship? 

I agree with Morel that the latter is probably the correct 
answer, he does not show that it is the correct answer. This is 
far from a personal failure on his part, as he probably does 
the best he can with the available evidence. The problem is 
that so little evidence is available. In essence, Morel stretches 
that evidence into a set of arguments that Lincoln might have 
made about the proper relationship between religion and 
politics. He shows considerable ingenuity in developing these 
arguments, but it must be emphasized that he is the one who 
has developed them, not Lincoln. 

Perhaps a useful comparison is between Lincoln and 
Thomas Hobbes. In both cases, scholars have engaged in 
extensive speculation about their personal religious beliefs 
and whether they were atheists or, at most, tepid theists. In 
both cases, a lack of evidence has fueled this speculation. The 
two cases appear very different, however, once we move 
beyond the question of personal religious beliefs and begin to 
look at their views on the relationship between religion and 
politics. Much more evidence is available for Hobbes than for 
Lincoln. Morel tries to analyze Lincoln’s views on the rela- 
tionship between religion and politics as if Lincoln had 
written som equivalent to parts I and IV of The 
Leviathan. But of course he did not. 

In making this comparison, my intention is not to stress the 
difference between analyzing the works of a philosopher and 
a statesman so much as it is to emphasize the difference 
between analyzing Morel’s chosen topic and other possible 
topics in Lincoln’s works. The writings and speeches of 
Lincoln can bear a fairly high level of analysis on such 
subjects as democracy and slavery, as has been shown by, 
among others, Harry Jaffa, who is mentioned so prominently 
by Morel (pp. ix, 14). They simply cannot bear the same level 
of analysis on Morel’s chosen topic. There is a very good 
reason that, as Morel claims (p. 11), such a book has never 
been written before. 

The one possible exception to Lincoln’s relative silence on 
the relationship between religion and politics is his famous 
Lyceum speech of 1838 (chap. 2). Yet, as Morel emphasizes, 
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the political religion of that speech is not really a political 
religion but, rather, a civil disposition of obedience to law that 
religion then might be used to foster (pp. 8-9, 14-5, 31-2). 
Lincoln understands the relationship between religion and 
politics in this speech quite narrowly. But Morel is also very 
interested—and claims Lincoln 1s as well—in that relation- 
ship more broadly defined to include the ways in which 
politics should accommodate religion (chap. 3), in which 
religion might be misused politically (chap. 4), and in which 
religion teaches men the limits of politics as well as of religion 
itself (chap. 5). It is on these more strictly religious topics that 
Lincoln says so little and Morel says so much. 

This gap is especially yawning in chapter 4, which is the 
weakest of the book. (Chapter 5, which deftly but still too 
expansively for my taste analyzes Lincoln’s Second Inaugural 
Address, is the strongest chapter.) In chapter 4, Morel 
analyzes Lincoln’s temperance address of 1842 and elabo- 
rates one of the major motifs of his book: The abolitionists 
were Lincoln’s exemplar for the political misuses of religion 
(pp. 9-10, 26, 125-6, 140). Yet, the abolitionists were not 
Lincoln’s explicit targets in this address; self-righteous tem- 
perance reformers were. Furthermore, even when the aboli- 
tionists were Lincoln’s explicit targets, as ın his celebrated 
1858 campaign debates with Stephen A. Douglas, his attacks 
seem grounded much more in political expediency than in 
personal disdain for either the principles or tactics of the 
abolitionists. However moderate Lincoln’s own antislavery 
principles and tactics may have been, they eventually coa- 
lesced with those of the abolitionists (pp. 175-80). There is a 
large measure of truth to Wendell Phillips’s gloss on Lin- 
coln’s victory in the 1860 presidential election: “Lincoln 1s in 
place, Garrison is in power” (“Lincoln’s Election,” in Wendell 
Phillips, Speeches, Lectures, and Letters, 1864, p. 305; empha- 
sis original). 

Where does this leave us? Morel provides some very 
interesting speculations about Lincoln’s views on the proper 
relationship between religion and politics, but he stretches 
the evidence beyond what it can bear. 


Worlds of Difference: European Discourses of Toleration, 
c. 1100-1550. By Cary J. Nederman. University Park: 
ia State University Press, 2000. 157p. $40.00 

cloth, $18.95 paper. 


Preston King, Birkbeck College, University of London 


This book is novel, attending more to the history than to the 
logic or morality of tolerance. It propounds, against the 
popular grain, a significant presence for tolerance in medi- 
eval Europe. Cases are made for Abelard, Marsilius, and 
others as significant exponents. The result provides students 
with an opportunity briskly to explore work too often ig- 
nored. If this study hits methodological sandbanks, it is hoped 
that will not deter others from voyaging in premodern times 
and in non-European waters. 

Nederman takes aim at two key notions: The doctrine of 
tolerance is exclusively modern, and, more narrowly, toler- 
ance is the lineal progeny of “liberalism.” He is right to target 
the second, but he has invented the first. He is right to 
counter the view that “the Christian Middle Ages has [sic] 
nothing whatsoever to contribute to our understanding . . . of 
tolerance” (p. 3, emphasis added). Except that only one of 
four whom he “counters” arguably takes this view. A traveller 
who is construed to claim “there is no water whatsoever in the 
desert,” is proved wrong by the little rain that will eventually 
fall An observer who claims that no medieval writer can 
“readily” be conceived to oppose tolerance, or that medieval 
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religious intolerance was the “norm,” is not to be translated 
to mean: The Middle Ages have “nothing whatsoever” to 
contribute to tolerance. 

That aside, the substantive question must turn not round 
whether there 1s any evidence for tolerance in the medieval 
past but round the type, volume, and significance of that 
tolerance. The author dismisses the claim by Ole Grell et al. 
that “religious intolerance was the norm throughout the 
Christian Middle Ages” (Ole Grell, Jonathan Israel, and 
Nicholas Tyacke, eds., From Persecution to Toleration: The 
Glorious Revolution and Religion in England, 1991, p. 3). Yet, 
he later endorses Moore’s view that intolerance was the norm 
im the Middle Ages: The “decided trend” was the “enforce- 
ment of orthodox faith against a range of medieval dissent- 
ers.” The “Roman Church... pursued a systematic policy of 
imposing a unified set of Christian beliefs.” The “Church 
took direct aim at... religious difference, heresy, Judaism, 
intellectual dispute.” One may add that it is questionable how 
far this intolerance “culminated in the Fourth Lateran Coun- 
cil of 1215” (p. 11). After all, the Malleus Maleficarum (1486) 
was the decisive, authoritative handbook—commissioned by 
Rome and used all over Europe by Catholics and Protestants 
alike— governing the examination, torture, and execution of 
“witches.” 

The author’s counterargument to assertions that support 
the prevalence of medieval intolerance slips from the idea 
that proponents like Grell and Tierney are simply wrong to 
the idea that this intolerance “depicts only part of the 
terrain” (p. 11). Nederman thus hovers between two claims. 
First, intolerance was not the norm. Second, intolerance was 
the norm, but not omnipresent: “Not every medieval thinker 
was entirely comfortable with repression” (p. 25). It becomes 
the author’s settled view that not everyone, after about 1100, 
favored torture, forced conversion, and dogmatic theology, 
or, after 1492, the abject enslavement of the newly encoun- 
tered “Americans.” This in turn raises the question: What 
sense of tolerance is being deployed to make this sort of 
response relevant? 

Nederman’s basic stipulation for “tolerance” is the idea of 
accepting “a multiplicity of ways of life,” a diversity “of 
beliefs or doctrines” (pp. 1-2). The larger the net, the bigger 
the catch. If tolerance equals diversity, we are not likely to 
run short of tolerance, even in the Middle Ages. Nederman 
does not dispute the persistence of persecution. He says 
expressly there was persecution, heaps, but that “persecution 
did not halt dissent.” Theologians did not stop reading 
Aristotle when the Church ordered them to (p. 15). “Heretics 
flourished even in the face of inquisitorial procedures” (p. 
16). “Excommunication was unlikely to carry much weight 
[since] heretics were relatively unconcerned” about whether 
they were killed or not. It “is simply incorrect” to assume, 
because the Church made “war on heresy,” that it succeeded 
in “stifling all religious dissent” (p. 17). “Careful investigation 
of the historical record shows that forms of religious diversity, 
at an intellectual as well as a practical level, subsisted 
throughout medieval Europe” (p. 12). 

Nederman plainly assumes that, by locating diversity, he 
demonstrates tolerance. But, of course, diversity no more 
establishes tolerance than intolerance; diversity ig a formal 
condition for both. When there is no notable diversity, there 
is no remarkable Other to despise—or with whom to be 
tolerantly reconciled. Diversity is consistent with despotism; 
and the failure to extirpate difference does not demonstrate 
tolerance. 

The author is too little attentive to definitions and time- 
frame. What is true in general is true as well for the particular 
figures investigated. Consider two of the least familiar of the 
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writers Nederman treats. First, in the twelfth century William 
of Rubruck is sent by the King of France to the Mongol court 
as a missionary. His job is to win over the alien to the one true 
faith. Instead, William observes an array of ethnic and 
religious diversity, fully tolerated by the Great Khan, a 
toleration which is “in many ways a source of frustration to 
William.” Why? Because there can be no “inclination toward 
conversion” where the Khan sanctions diversity (p. 57). One 
may view William as tolerant because he is intelligent and 
apprised of the impossibility of securing conversions. But, 
unlike Nederman, one may also take the view, overall, that 
William is less a significant tolerator than a supple but failed 
proselytizer. 

Second, Las Casas may be Nederman’s strongest case of a 
significant tolerator, an exponent of racial (not ideational) 
tolerance, and an interesting contrast to William. But does 
Las Casas (d. 1566) really belong in this company? Nederman 
slots him as a “medieval author” (p. 119), as opposed to 
Bodin (d. 1596), who is assigned to “the Reformation” (p. 
36). In fact, they both belong to the Reformation. Is it just to 
claim that injustice has been done to “medieval writers” when 
one cites, as a major exemplar of their virtues, a contempo- 
rary of John Hawkins and John Calvin—and so dubiously 
“medieval”? 

As for Abelard and Llull, they are primarily concerned 
with rational and respectful debate among Jews, Christians, 
and Muslims. The emphasis is on reason rather than revela- 
tion. Although rational dialogue possibly “requires that one 
respect the integrity” of one’s interlocutor (p. 37), it may not. 
(Cannot firm enemies join in rational debate, as in court- 
rooms and boardrooms?) For John of Salisbury, “mortals can 
know very little” (p. 50). But this makes him more a fallibilist 
than a true skeptic; what he does not doubt is the existence of 
God and related notions. Marsilius opposes crusading and 
denies the church authority to excommunicate. Yet, Cathol- 
icism is for him the one true faith, and “heretics and other 
infidels... are to be shunned” (p. 79). The best case for 
medieval religious tolerance may be Nicholas of Cusa, who 
seeks respect for divergent faiths. Even he assumes rational 
argument in the end will reveal the superiority of Christianity 
over all other faiths (p. 88). There is little in any of these men 
to match the audacity of Bodin’s Colloquium Heptaplomeres 
(c 1593). 

The author’s task would have been more challenging under 
a tighter, apter stipulation. For example, if we read “toler- 
ance” as “accepting or putting up with items (behaviors) to 
which we object (disapprove),” then there are three conse- 
quences. First, no agent can tolerate everything. For A to 
tolerate x implies (i) A has the power not to tolerate x, plus 
(ii) A is intolerant of whatever undermines x, given (iii) that 
A qua tolerator is to be conceived as having the capacity to 
resist such undermining, else s/he cannot (property) be said 
to tolerate. Second, we should expect to find cases of 
tolerance, both as thought and practice, in any era or region, 
including medieval Christendom, given a reasonably exten- 
sive body of evidence. It is not conceivable that any people or 
epoch should altogether exclude the possibility of some 
actors sometimes putting up with behaviors or beliefs to 
which they object. Thus, the residual questions that historians 
must answer relate to type, volume, and significance of 
tolerance. Third, formulated as above, one does not think to 
derive tolerance exclusively from “liberalism,” any more than 
from “socialism,” “anarchism,” “Christianity,” or “Islam” 
(consistent with the position taken by Nederman). 
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The Play of Reason: From the Modern to the Postmodern. By 
Linda Nicholson. Ithaca, NY: Cornell University Press, 
1999. 179p. $47.50 cloth, $17.95 paper. 

Eloise A. Buker, Denison University 

The book is composed of nine essays; all but one have 

appeared in earlier publications. They have a timeless qual- 

ity, however, and even readers familiar with them may find a 

rereading productive, especially in the context of examining 

them as a body of work. 

Section I, which has five essays, examines “Modernity and 
the Problem of History.” Combining a solid reading of 
political theory from the seventeenth century to postmodem 
times, Nicholson traces the development of such key con- 
cepts as the family, “race,” gender, sex, and the body to show 
how history has shaped the kind of politics we imagine 
possible. The first essay, and possibly the least insightful, 
offers a critical review of the analyses of moral development 
by both Lawrence Kohlberg and Carol Gilligan. The next 
examines Marxism and draws on Mary O’Brien and Iris 
Young to illuminate how Marxism can be adapted to examine 
women’s subordination by reframing the categories of anal- 
ysis to include child rearing and sexual relationships. Nichol- 
son explores the key concepts of consumerism, family, kin- 
ship, production, and reproduction and goes beyond dual 
systems theory. She argues that both moral development 
theory and Marxist theory fall short in their ability to develop 
significant cross-cultural analyses. 

The third essay examines the ways in which feminist theory 
has problematized the private/public distinction. Reflecting 
on the work of Young and Rosalind Petchesky, Nicholson 
examines how the personal is political to reveal how the 
economy and state organize family relationships. She ex- 
plains how liberal theory created a public arena composed of 
male household heads that wholly excluded women. Main- 
taining the value of a private/public distinction, Nicholson 
sees the changes in this boundary as a political process and 
not simply a historical fact. 

In the fourth essay, “Interpreting ‘Gender,’” Nicholson 
questions the distinction between sex and gender by showing 
how these concepts are mutually constitutive. She argues that 
radical feminists like Janice Raymond as well as feminists like 
Gayle Rubin retain a kind of “biological foundationalism” 
that has some of the political problems present in biological 
determinism. She suggests reframing the concept of “wom- 
an” as a complex term that can serve a coalition politics based 
on women’s differences. 

The final essay in Section I is a historical review of how 
modernity constructs the family. After demonstrating that 
the so-called nuclear family is a myth emphasized in the 
19503 and 1960s, Nicholson argues that understanding fami- 
lies as living arrangements rather than as kinship networks 
permits feminists to undo the distinction between the tradi- 
tional family and the alternative family. She maintains that 
families provide social insurance and that the variety of 
family obligations suggested by cross-cultural analyses 
offers a broader understanding of family. The argument is 
persuasive, but she fails to show how this offers new ways of 
viewing the family as a political institution that shapes public 
life 


Section II, composed of four essays, examines “Postmod- 
ernism and the Problem of Connection.” The first essay, 
published in 1988 and coauthored by Nancy Fraser, is 
philosophically dated. It was important in initiating dialogue 
about postmodernism among feminists, but the issues raised 
have been more elaborately developed in subsequent works, 
including an important book Nicholson edited, Femunism/ 
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Postmodernism (1990). For example, postmoderns have be- 
come much more articulate about the way in which politics is 
a part of their analysis; so the point that postmodernism lacks 
social criticism and is therefore politically “anemic” no longer 
holds (p. 100). Furthermore, this essay skirts the central 
epistemological concerns raised by postmoderns, although 
Nicholson does address some of these in “Bringing It All 
Back Home.” In this essay, the one not previously published, 
she argues for a context-dependent understanding of reason 
based on pragmatism. She deals with the problem of relativ- 
ism and shows the importance of social theory that acknowl- 
edges the limits of history and culture. In the conclusion of 
this essay, however, she compares philosophy to religion and 
suggests that “salvation does not coexist well with diversity” 
(p. 128). This seems to be an unnecessarily limited view of 
salvation, whether it is constituted by either a religious or a 
philosophical discourse. 

The third essay in Section II offers a lucid argument about 
the limits of Charles Taylor’s politics of recognition but 
acknowledges the strength of his critique of liberalism. The 
final essay takes on the issue of emotion in public spaces and 
argues for a balance between emotion and reason as the basis 
for politics. Emphasizing the work of Freud, Nicholson 
maintains that the psyche needs to be taken into account as 
a factor in public life. She does not draw on either Jane Flax’s 
work in the ical or Judith Butler’s work on the 
psyche, The Psychic Life of Power (1997). Nicholson’s work is 
part of the same conversation, and it might have been useful 
to draw these thinkers into the discussion (pp. 156-61). 

“Bringing It All Back Home” comes closest to fulfilling the 
promise of the title, “the play of reason,” but Nicholson does 
not discuss “play,” and one wonders how this title emerged 
and what it means to her. Placing these essays in the context 
of her current thinking would have underscored her point 
about the importance of paying attention to context and 
would have offered a view of how she understands the 
connections and disconnections in these earlier essays. 

These important essays would serve well to introduce 
graduate students to some key issues in political theory that 
emerged in the United States from 1980 to 1999. The 
historical insights are especially relevant for those engaged in 
the history of modern political thought and its connection 
to postmodern concerns. Because the language of the text 
draws from the work of key philosophers rather than 
offering concrete examples, the book may not be useful 
for undergraduates. Scholars unfamiliar with postmodem 
political theory will find this a lucid introduction to some 
of the central issues. Certainly, political theorists will 
want to have this volume, as it offers ready access to the 
important contributions Linda Nicholson has made to polit- 
ical theory. 


The Values Connection. By A. James Reichley. Lanham, 
MD: Rowman & Littlefield, 2001. 304p. $35.00. 


Michael P. Federici, Mercyhurst College 


The connection between morality and politics is the core of 
political theory. The founders of political philosophy, Plato 
and Aristotle, demonstrated the indissoluble nexus between 
the life of the soul and the life of politics. Political theory was 
engendered as a response to the spiritual and political decline 
of Athens. Plato and Aristotle diagnosed the spiritual cor- 
ruption of Athens and provided a prescriptive response to it. 
Since then, political theorists and social scientists have tried 
to determine to what extent ethical behavior and moral 
principles matter to political and social order. Whereas the 
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ancient and Judeo-Christian tradition recognized the moral 
foundations of politics, modern thinkers such as Machiavelli 
and Hobbes depreciated the importance of transcendent 
moral values to the formation of a just political and social 
order. With some exceptions, modern political theory has 
attempted to transform the foundations of politics. Rather 
than a transcendent foundation for political order, modern 
thinkers have founded politics on self-interest, power, liberty, 
and natural rights. 

A. James Reichley sides with the ancient and Judeo- 
Christian tradition in the debate about moral values and 
politics. He is primarily concerned with determining which 
moral values are necessary for a free society. He identifies ten 
essential values for the creation and maintenance of a free 
society: the unique value and significance of each human life, 
the rule of law, continuity between individuals and the 
historical human community, moral equality, social justice, 
popular sovereignty/majority rule in a constitutional frame- 
work, tolerance of different behaviors and beliefs, honor, 
compassion for the troubled and needy. and moral realism. A 
free society is impossible unless these values are accepted as 
moral ideals. He also explains why six particular “isms” (e.g. 
egoism, collectivism, monism, absolutism, ecstasism, ctvil/ 
secular humanism) are destructive to a free society. In short, 
they fail to balance order and liberty. They either represent 
excessive individual freedom or excessive state authority. 
Moreover, they do not balance individual rights and social 
responsibility. 

The bulk of the book is expository. It describes the six 
inimical ideologies in a textbook or encyclopedic fashion and 
sketches their theoretical development. Whereas these ideo- 
logical systems are insufficient, Reichley determmes that 
“transcendent idealism,” because it embodies the ten essen- 
tial values, is the appropriate moral foundation for a free 
society. Transcendent idealism is the “source for moral 
regeneration and guidance in the twenty-first century” (p. 
173). It is a value system primarily shaped by the classical and 
Judeo-Christian cultural tradition, although its characteris- 
tics can be identified in the traditions of Buddhism, Confu- 
cianism, and other non-Western cultures. It accepts the 
existence of original sin and consequently the fallibility of 
human institutions. Because human institutions are fallible, 
individuals need freedom to question and challenge them. 

The efficacy of transcendent idealism stems, in part, from 
its continuity with constitutional democracy. Reichley be- 
lieves that constitutionalism evolves from Judeo-Christian 
insights that form the foundation for free societies, such as 
the unique and equal value of each human life and the rule 
of law. He associates a wide range of thinkers and artists with 
transcendent idealism, including the Hebrew prophets, So- 
phocles, Aeschylus, Augustine, Aquinas, Luther, Calvin, 
Rembrandt, Locke, John Adams, Jefferson, Jane Austen, 
Matisse, and Walker Percy. Explicitly excluded from the list 
are Plato, Hobbes, Rousseau, Hegel, Marx, Rorty, and 
Rawls. The compatibility of ideas represented by Reichley’s 
list of transcendent idealists ıs not self-evident. He fails to 
address important differences between such thinkers as Aqui- 
nas and Locke on property or Locke/Jefferson and Madison 
on representation. A broad net is cast in defining transcen- 
dent idealism that obscures important theoretical differences 
between specific thinkers. 

Reichley’s argument hinges on his definitions of key con- 
cepts. What he means by “transcendent” or “justice” matters 
more than the categories (e.g., transcendent idealism) used 
to construct his argument. At first glance Reichley, a senior 
fellow at the Public Policy Institute at Georgetown University 
and former Brookings Institution fellow, appears to be 
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making a conservative argument, that is, liberty and justice 
would be more secure in America if genuine religion and the 
Judeo-Christian tradition exerted more cultural influence. 
He is concerned that mainstream religion in America is on 
the decline and being replaced by secular (civil) humanism. 
Without connections to religious institutions, Americans will 
lose a sense of and commitment to the values that keep a free 
society vibrant. He rejects the moral relativism of the post- 
modern Left and subscribes to a theory of moral values that 
recognizes the transcendent as the necessary ordering force 
for a free society. He argues that transcendent idealism “finds 
no route to either freedom or justice that does not pass 
ultimately through reverence for God’s grace and love” (p. 
175). 

Arguments like this are common in recent scholarship. The 
idea that the transcendent is the source of existential, polit- 
ical, and social order is central to the works of such seminal 
scholars as Leo Strauss, Eric Voegelin, Irving Babbitt (to 
name a few), and their followers. The argument for a 
restoration of genuine religion in American culture has been 
made by Richard John Neuhaus, James Davison Hunter, 
William Bennett, and a host of neoconservatives. The argu- 
ment for a restoration of genuine spirituality as the cultural 
impetus for a renewal of the American political tradition is 
commonly found in such journals as First Things, Commen- 
tary, Modern Age, and Humanitas. A growing and diverse 
body of scholarship argues in one way or another that a free 
society depends on attunement to transcendent reality. Upon 
review of this literature, however, it becomes apparent that 
what each author means by moral universality and terms like 
“transcendent” varies significantly. Placing Reichley’s con- 
ception of universality in one of the variegated schools of 
thought that argue for a restoration of the classical and 
Judeo-Christian tradition is not clear-cut. In most instances, 
however, bis argument can be classified as neoconservative as 
opposed to paleoconservative. 

Although Reichley covers significant historical and theo- 
retical material, ultimately his argument is based on social 
science studies and surveys; he does not make a philosophical 
argument. An example of his reliance on social science is his 
use of Guenter Lewy’s work to demonstrate an inverse 
relationship between religious activity and criminal behavior. 
He recognizes that the social, cultural, and political benefits 
of Judeo-Christian values to a free society are due to its 
insights about the nature of reality..But the book neither 
searches for spiritual insight and growth nor provides a 
conception of the ethical life. 

Reichley tries to convince readers that transcendent ideal- 
ism is responsible for the formation and maintenance of free 
societies, but he treats transcendence in a rather abstract way. 
He is vague about the philosophical basis for ethical respon- 
sibility. His argument is justified by social pragmatism. The 
values of transcendent idealism are best because social 
science data verify lower rates of crime, divorce, illegitimate 
births, abortion, and other social maladies among those who 
accept those values. It does not occur to Reichley that 
spiritual insights can actually undermine social and political 
order. 

With these reservations aside, The Values Connection does 
address an important problem. It provides a good survey of 
the classical and Judeo-Christian tradition and uses it to 
discuss a pressing contemporary problem. Those interested 
in the contemporary relevance of old ideas and traditions will 
find Reichley’s book useful. 


American Political Science Review 


Obligations of Citizenship and Demands of Faith: Religious 
Accommodation in Pturalist Democracies. Edited by 
Nancy L. Rosenblum. Princeton, NJ: Princeton University 
Press, 2000. 438p. $72.50 cloth, $19.95 paper. 

Surviving Diversity: Religion and Democratic Citizenship. 
By Jeff Spinner-Halev. Baltimore, MD: Johns Hopkins 
University Press, 2000. 246p. $36.50. 


Robert K. Fullinwider, University of Maryland 


These two books—one a collection of essays and the other a 
sustained treatise—dwell on the problems posed for liberal 
theory and democratic practice by religious commitment. 
Both supply a rich menu of arguments and insights. 

Let us start with a central dispute. Michael McConnell, in 
his contribution to Obligations, insists that the terms of 
Supreme Court jurisprudence and the norms of liberal 
political/legal culture treat Americans with strong religious 
convictions as second-class citizens. Religiously based argu- 
ments are unwelcome in the public square, and religious 
believers must bend their practices to fit the law, not the 
other way round. These sentiments echo a common com- 
plaint that liberal “secularism” has achieved de facto “estab- 
lishment” in American law, public institutions, and elite 
culture. In place of the privatizing imperatives of that estab- 
lishment, McConnell offers an alternative vision, one of 
“religious pluralism”: All citizens are free to make, accept, or 
reject public arguments without limitation (p. 104), and the 
law bends to accommodate believers’ needs, imposing “the 
least possible violence” on their religious life (p. 103). 

McConnell comes under attack in Obligations from two 
directions. On the one side, Graham Walker derides his 
pluralism as just another form of liberalism, subject to the 
same complaints McConnell makes against the secularist 
version. What is needed, in Walker's view, is something 
altogether different from liberalism, namely, some form of 
open constitutional establishment of church. Such an estab- 
lishment would be more honest for being above-board and, 
property limited, should prove more supportive of real 
religious diversity than the prevailing covert secularist estab- 
lishment (pp. 117-21). 

From a different direction, Amy Gutmann takes issue with 
McConnell’s one-sided treatment of religious freedom and 
argues instead for “two-way protection.” Separation of 
church and state (not separation of religion and politics) is 
necessary to protect the church from state interference, to be 
sure, but equally, she claims, to protect citizen and state from 
inappropriate political aggrandizement by the church. Fur- 
thermore, proper separation “denies religious, anti-religious, 
and nonreligious citizens alike a general right—based on 
conscientious objection—to disobey laws that serve legiti- 
mate public purposes” (p. 143, emphasis original). The state 
can grant exemptions in cases in which doing so does not 
create “runaway precedents” that subvert legitimate public 
law (p. 144), but except in special cases, those in which 
nonreligious parallels do not exist, exemptions based on 
conscientious scruples should be extended to the religious 
and nonreligious alike, writes Gutmann. 

Nancy Rosenblum further explores this last issue by con- 
sidering a piece of important legislation, Title VII of the Civil 
Rights Act of 1964, which gives “preferential treatment” to 
religious associations by letting them discriminate im their 
employment practices (p. 172). The Supreme Court, in 


this exemption broadly to allow a religious organization to 
condition employment on a religious test no matter how 
remote the employment from any actual religious function. 
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Although Rosenblum is keen to protect the autonomy of 
private associations from demands for “convergence”— de- 
mands that their internal organization and norms mirror and 
support the larger public values of democracy, equality, and 
nondiscrimination—she sees the Amos rule as too generous. 
Religious associations ought to show some nexus between job 
and religious function before they are allowed to fire a 
jobholder on religious grounds (p. 181). Among the reasons 
against handing them such a blank check is the serious threat 
posed to the religious freedom of individuals adversely 
affected by discrimination (pp. 183-6). 

Jeff Spinner-Halev, in Surviving Diversity, takes as his point 
of departure multiculturalist arguments for valuing and sup- 
porting cultural diversity. Both liberal and “nonliberal” forms 
of these arguments actually narrow the room for diversity, he 
contends (the principal targets here are the views of Joseph 
Raz, Will Kymlicka, Nancy Fraser, and Iris Young). These 
arguments are especially impervious to the “difference” 
constituted by conservative religious belief. As an antidote, 
Spinner-Halev offers a more religion-friendly account of 
liberalism. The liberal state, he contends, should prove a 
commodious place for those religious communities that 
organize themselves around ideas of obedience to authority 
and submission to revealed truth rather than around ideas of 
autonomy, individualism, and self-discovery. The liberal state 
can leave such communities to their own ways because a 
healthy liberal “mainstream” culture provides an “exit” op- 
tion. Giving this option substance, however, may require 
some intervention by the state (pp. 49, 63, 73ff). 

A liberal society should do more than tolerate the reli- 
gious. It should take steps to include them in the public 
square and make efforts, in schools and in law, to exempt 
them from requirements of which they conscientiously disap- 
prove of (pp. 107, 136-9). Like Gutmann, Spinner-Halev 
favors extending accommodations and exemptions, where 
warranted, to all, not just the religious, who are burdened in 
conscience by public law or policy (p. 207ff). 

This brief survey does not begin to do justice to the depth, 
subtlety, complexity, and power of the works under review. 
They belong on the bookshelf of anyone intellectually en- 

by the recent renewal of interest in church-state issues. 
(Add to that shelf, as well, Brian Barry’s new book, Culture 
and Equality: An Egalitarian Critique of Multiculturalism, 
2001, whose extensive treatment of accommodation and 
lengthy reflections on the same court cases discussed by 
Spinner-Halev and the essayists in Obligations make it a 
forceful and provocative companion.) 

These two books show that the broad terms of liberalism 
do not dictate particular settlements of the religious question. 
History and circumstance must be given their due, and the 
“inescapability of judgment” (Rosenblum’s phrase, p. 189) 
means there are no formal or mechanical solutions to the 
problems at the heart of church-state relations. A great virtue 
of Obligations is three rich essays whose focus lies outside the 
United States. Gary Jacobsohn, Yael Tamir, and Martha 
Nussbaum ask us to consider religious freedom in the context 
of contemporary India and Israel. Even if we begin with 
broadly liberal values, the circumstances in these two coun- 
tries may lead us nevertheless to favor legal restrictions on 
religious speech and to countenance forms of religious 
establishment. Indeed, in Western European countries with 
histories far different from that of the United States, we may 
find religious establishments conducive to liberty, as Graham 
Walker hopes (see Nussbaum, p. 361). 

Walker puts forward a provocative, self-styled nonliberal 
ideal of a “mixed constitution” that establishes a church but 
protects religious liberties. But if a constitution genuinely 
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protects religious liberties, is it not a liberal one? Moreover, 
must not the “mixed constitution” ideal attune itself to 
historical realities just as liberal separationism must? The 
secularist who insists that the Lutheran establishment in 
Sweden must go shows no less imagination than the Walk- 
erite who thinks separation of church and state can be 
cleared off the decks in the United States and replaced by a 
religious establishment. Still, as an ideal to toy with intellec- 
tually, what is it about the “mixed constitution” that ought to 
attract the reader to it (besides its openness)? According to 
Walker, “unlike liberalism, it sanctions truth-seeking .. .; it 
does not insist that truth-seekers can never find any truth 
deserving public validation” (p. 121). 

This contrast, whether accurate or not, forces some inter- 
esting issues onto the table. First, there are questions for 
critics of the “secular establishment,” like Michael McCon- 
nell and Stephen Carter, who would measure U.S. policy by 
whether it hinders or supports religious pluralism. Why 
should a particular religious believer, committed to a set of 
specific theological truths, find religious pluralism in any way 
attractive? That is, why should he welcome the spread of 
religious error and theological confusion? What can an 
orthodox Catholic, say, find attractive about a constitutional 
order that facilitates the spread of Mormonism, Santerfa, 
Bahá’í, the Unification Church, Scientology, the Gospel of 
Wealth, mushy New Age “spiritualities,” and the theologi- 
cally anemic but media-savvy nondenominational “Christian” 
ministries mushrooming everywhere? 

Second, there is a question for Walker. If seeking the truth 
and having it publicly validated is important, can the mixed 
constitution be indifferent to which church is established? 
Otherwise, to meet his ideal it would be sufficient for 
secularism to come out of the closet, divest itself of the garbs 
of neutrality, announce its own truth, and openly luxuriate in 
its already existing establishment in the United States. 

Third, is liberalism really indifferent to seeking and pub- 
licty validating the truth? If so, what accounts for the 
abundance of public research universities that populate this 
continent, and the reams of government reports, administra- 
tive rules, and legislation that put the public stamp of 
approval on some views over others? Liberalism is hardly 
hostile to the truth, although it does dance a fine line about 
some truths. If we cannot live together as a people under 
certain descriptions (e.g., “God is one substance, not three”), 
we had better not stake our constitutional order on them. 
Still, each of us is committed to the truth or falsity of these 
descriptions, so must we not find our public order diminished 
in some way if it cannot acknowledge vital truths? Or is there 
a higher level truth we might all share that makes such an 
order morally valuable even as it allows—from our various 
perspectives—error to rub shoulders on equal terms with 
truth? If the reader is tired of watching John Rawls’s 
high-wire act on these matters, she might look, then, at 
another very different, yet surprisingly congruent, piece of 
acrobatic artistry, Dignitatis Hurnanae (Holy See, 1965). 


Modernity’s Wager: Authority, the Self, and Transcendence. 
By Adam B. Seligman. Princeton, NJ: Princeton University 
Press, 2000. 141p. $27.95. 


John R. Hall, University of California, Davis 


Who will read Adam Seligman’s important new book, and 
what will they make of it? The answers to these questions 
may reflect the very issue that Modemity’s Wager examines, 
modern individualism and its discontents. Seligman writes 
with great thoughtfulness and erudition, at the borderlands 
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of social theory, political philosophy, theology, and the 
history of religion, and he constructs a theoretical lingua 
franca to draw these disparate discourses into conversation 
with one another. Yet, he takes the substantial intellectual 
risk that his argument will not receive due consideration from 
the very audiences whose attention to it would most advance 
public discussion: rational choice theorists, Enlightenment 
liberals, and postmodern relativists, all of whom, in different 
ways, will question his claim for the importance of external 
sacred authority. In the end, Seligman proposes skeptical 
toleration as the most promising basis for restoring a couplet 
of transcendent authority and authentic (“constitutive”) self- 
hood that has been lost under conditions of modernity. Yet, 
for his proposal to gain a hearing will require more than 
toleration skeptical of its own verities: It will depend upon 
readers’ willingness to engage ideas far outside—and alien 
to—their own frames of reference. They will find the effort 
worthwhile. 

Modernity’s wager, as Seligman describes it, is a bet that 
the autonomous individual of liberal thought can be main- 
tained on the basis of a transcendental philosophy of natural 
Tights, without recourse to transcendent sacred authonty, and 
that the internalized moral selfhood of this autonomous 
individual, and collectivities of such individuals, will be 
sufficient for an organization of the social that will not 
succumb to totalitarianism of either a Jacobean or funda- 
mentalist persuasion. This wager is a risky one, in Seligman’s 
view, because the moral calculus of liberalism’s autonomous 
individual cannot be assured in the absence of a well-defined 
connection of the self to authonty that has an external, sacred 
basis. The wager of modernity stands to lose both the 
individual self as a fully moral being (rather than merely a 
utility maximizer) and the possibility of a communal social 
order that has any moral basis to it beyond what Durkheim 
called the precontractual principles necessary to maintain a 
world in which contracts among free individuals undergird 
social life. The wager, Seligman maintains, morally impover- 
ishes both individuals and the modern social order, and it 
leads to a paradoxical “politics of recognition,” but in the 
absence of any moral basis for community that would provide 
an “authoritative basis of value” on which to base recognition 
(p. 120). More important, it leaves modernity open to 
dangerous reversals that potentially threaten the very indi- 
vidual freedoms that liberalism is meant to protect. 

In the end, Modemity’s Wager must reconcile external 
sacred authority with the contemporary realities of religious 
pluralism. To do so, Seligman makes his own wager with the 
sacred. Recognizing that sacred authority has itself often 
been used in coercive ways (in effect acknowledging a certain 
importance of internal freedom of religious conscience that 
he otherwise declaims as a threat to sacred authority), 
Seligman shifts from affirming the formal necessity of sacred 
authority to embracing a particular content of sacred author- 
ity that might offer a workable resolution to modemity’s 
dilemma. This resolution requires modern liberals and hu- 
manists to take matters of religion seriously, rather than 
dismiss all religion as fundamentalist and regressive. (Indeed, 
as Seligman observes, the resurgence in religious faith of 
many different types creates facts on the ground that cannot 
be ignored.) The moral failure of modern individualism, 
combined with authoritarian and fundamentalist threats, 
requires proponents of both reason and of faith to act with 
greater humility and skepticism concerning their own author- 
itative claims. “What is demanded, then, is a midpoint 
between nihilism and postmodern relativism, on the one 
hand, and absolutist claims of both faith and reason, on the 
other” (p. 129). 
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The overall argument is developed on multiple and inter- 
locked fronts. In chapter 1, Seligman contends that the social 
sciences—from rational choice theory to Durkheim, from 
Hobbes to the symbolic interactionists—belie a (perhaps 
inescapable) modernist bias in their i i concep- 
tions of the self and in their divorce of social order from a 
theory of community. A richer conception of the self as a 
moral evaluator rather than simply a utility maximizer can 
only be achieved on the basis of a transcendent sacred 
authority (chap. 2). This authority not only substitutes judg- 
ments about the deeds that fulfill or fail to fulfill moral 
intentions in the face of Fortune but also produces commu- 
nity via ritual. By these two developments authority makes 
possible the symbolic linkage among persons that sustains 
individual moral effort and collective sentiments of shame 
and pride, sentiments that reinforce the individual’s connec- 
tion with transcendent moral authority (chap. 3). But a 
theological and intellectual history is held to demonstrate 
that the particular trajectory of modern individualism out of 
the Reformation eclipses external authority and community 
(chap. 4), and this leads to the challenge of sketching a 

imi alternative consolidation of moral individualism 
and sacred authority by way of humility, skepticism, and 
toleration (chap. 5). 

This deeply and thoughtfully argued book can easily be 
read as a jeremiad about the “specialists without spirit, 
sensualists without heart” (in both life and utilitarian theo- 
ries) that Max Weber anticipated as the end cultural product 
of modernity. Seligman follows in a venerable if highly varied 
tradition of scholars who seek to reckon the fate of commu- 
nity and the prospects of faith, scholars such as Robert 
Nisbet, Peter Berger, Robert Bellah and his associates, and 
Robert Wuthnow. Like the best of jeremiads, this one rises 
above mere nostalgia. Seligman seeks a way out. Readers 
may disagree with him, and even when they share his 
conclusions, they may doubt the paths of reasoning by which 
he reaches them. Yet, arguments and doubts will not under- 
mine the significance of this book. Seligman’s achievement is 
to consider the prospects of individualism and the transcen- 
dent sacred through a deep reading of social theory and a 
charting of theological history over the longue durée. He is 
thereby able to articulate the sources of modern liberalism’s 
crisis in a particularly incisive way. 

The challenge that now faces Seligman, and us, is to 
explore the potential for skeptical toleration (and mutual 
engagement) of faith and reason. Whether even a fuller 
explication of this project will offer a way to revive transcen- 
dent sacred authority seems to me doubtful. But perhaps the 
effort itself is more important than achieving its goal. Indeed, 
the nature of toleration suggests that the goal is unreachable. 
If we are to have a world with one contradiction, this may be 
the one worth having. 


NOMOS XLII: Designing Democratic Institutions. Edited by 
Ian Shapiro and Stephen Macedo. New York: New York 
University Press, 2000. 331p. $50.00. 


Matthew Festenstein, University of Sheffield 
In keeping with the tradition of this distinguished series, the 
editors have assembled a team of very fine scholars, repre- 
senting a range of perspectives from across the discipline and 
beyond, to explore an important and current aspect of 
political theory. In the introduction, the editors locate the 
impulse to think about institutional design against the back- 
ground of cynicism and disgruntlement about the trajectories 
of the newer democracies, as the euphoria surrounding 


democratic revolutions of the last decade in Europe, Latin 
America, South Africa, and elsewhere recedes. Yet, there is 
no “transitology” here (apart from a rude remark from 
Philippe Schmitter, p. 244), and neither the problems nor the 
examples picked out by the contributors are peculiarly the 
property of this wave of democratization. These recent 
experiences of constitution- (and institution-) building have 
undoubtedly sharpened the sense that the implementation of 
democratic values is not merely a straightforward task of 
“engineering”; other experiences loom larger here, such as 
worries about campaign finance in the United States, new 
forms of nationalism, the growth of global and regional forms 
of governance, and postwar decolonization. Of course the 
question of institutional design, democratic and otherwise, is 
very much part of canonical political thought from The Laws 
onward. 

The essays in the first part explore some familiar anxieties 
about the quality of democratic deliberation and decision. 
Ian Ayres makes an ingenious case for anonymity in political 
campaign finance in the United States, which is rebutted by 
Geoffrey Brennan and Alan Hamlin. John Ferejohn discusses 
the difficulties of institutionalizing deliberation in sociologi- 
cally plural polities. Philip Pettit is also concerned with the 
quality of democratic decision making and offers a fresh and 
very helpful statement of his republican thesis that democ- 
racy, when understood as aiming at the articulation of 
common interests, needs not only to create openings for the 
expression of interests but also to institute devices to chal- 
lenge putative expressions of the common interest. 

The contributions in the next two parts range more widely. 
Iris Marion Young builds on a critique of liberal nationalism, 
as found in the work of David Miller and others, m order to 
reconcile the emphasis on group identity that has been such 
a central feature of her own philosophy with an institutional 

litanism. Since the concept of a nation presupposes 
that of a sovereign state, she argues, nationality cannot be 
used as a criterion by which to justify a group’s claim to 
statehood. Furthermore, the very principle of state sover- 
eignty lacks legitimacy, and we should create a system of 
global governance that both supersedes the nation-state and, 
intriguingly, devolves powers to “self-determining peoples.” 

Russell Hardin acerbically argues that this last move 
reproduces the difficulties with nationalism, which errone- 
ously ascribes interests to groups and is itself a fundamentally 
“corrupt” ideology: “Liberal nationalism is too good to be 
true, and ordinary nationalism is too true to be good” (p. 
206). Robert Post argues that Young misconceives sover- 
eignty, which should not be understood as a state’s capacity 
for discretionary action but as the authorizing source of 
collective agency. Transnational institutions do not threaten 
sovereignty in this sense but may constitute its legitimate 
embodiment. This is appropriately followed by a contribution 
from Philippe C. Schmitter on how to remedy the democratic 
deficit in that sui generis institution, the European Union. 

Critics of rationalism in politics may wonder if the schemes 
for the design of democratic institutions are not in any case at 
the mercy of relatively intractable social forces. The third 
part of the book consists of a pithy and pessimistic account by 
Donald L. Horowitz of the prospects for successful constitu- 
tional design in ethnically divided societies, with responses 
from Brooke Ackerly and Philippe Van Parijs. Horowitz 
offers an excellent summary of his criticisms of the consocia- 
tional model of ethnic accommodation as well as a summary 
of his case for constitutions that attempt to lure votes across 
ethnic boundaries, in effect trying to give an institutional 
boost to the development of Lipset’s cross-cutting cleavages. 
But this is framed by a pessimistic argument that deep 
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division militates against the acceptance of any “centripetal” 
constitutional package; indeed, “substantial compromise on a 
plan to facilitate interethnic compromise decreases the like- 
lihood of interethnic compromise in the operation of the plan 
once adopted” (p. 272). 

Ackerly responds that neither ethnic cleavage nor institu- 
tional design is as important as Horowitz makes out. In 
particular, she argues (with reference to Nigeria and Malay- 
sia), economic inequality and the immense temptations of 
corruption are more significant. Van Parijs makes the case 
that, when the conditions are right (as in the Netherlands and 
Belgium), a pragmatic mixture of different schemes for ethnic 
compromise can work. In a rejoinder, Horowitz refuses to 
take solace. The ceteris paribus conditions put forth by Van 
Parijs in the case of Belgium are just an appropriate set of 
cross-cutting, nonethnic cleavages that compensate for the 
feebleness of institutional measures; when these do not exist, 
as in Northern Ireland, the institutional measures are pre- 
carious, (The recent general election results from that prov- 
ince, it should be said, bear out Horowitz’s pessimism about 
the dynamics of implementing constitutional accommoda- 
tions.) This is a very illuminating exchange, with (like several 
contributions to this volume) classical resonance. 

Designing Institutions is a rich and diverse 
volume. It should be of interest not only to political theorists 
but also to readers who skip to the reviews on American 
politics, international relations, or comparative politics. It 
inevitably offers what Brennan and Hamlin call a piecemeal 
rather than a synoptic view of its subject, which is vast, and 
any short statement about democratic institutions is bound to 
raise a host of vulnerable causal claims and contestable moral 
assumptions. But this book offers an excellent survey of some 
current concerns, should encourage concrete thinking from 
political theorists, and is strongly recommended. 


Thomas Hobbes and the Political Philosophy of Glory. By 
Gabriella Slomp. New York: St. Martin’s, 2000. 194p. 
$65.00. 


Ted H. Miller, University of Alabama 


All too often, Thomas Hobbes thought, we are undone by our 
pride. He used reason, cajoled, and even threatened his 
audiences to persuade them to accept this sad fact and to 
accomplish the still harder task of bringing them to swallow 
the strong political medicine he deemed necessary for cor- 
recting our condition. Hobbes himself, however, was not 
immune from the disease of pride. He noted our inability to 
acknowledge that others might be wiser than ourselves, but 
he sardonically recommended we take this inability as evi- 
dence of the equality of human wits: “For there is not 
ordinarily a greater sign of the equal distribution of anything 
than that every man is contented with his share” (Leviathan, 
ed., Edwin Curley, 1994, p. 75). The problem, of course, is 
that we demand of others that they see us in the same light as 
we see ourselves. Readers only vaguely familiar with his 
mathematical squabbles are aware that Hobbes felt he failed 
to receive due recognition for his talents from Britain’s 
mathematical authorities. He protested that these same 
authorities were all too satisfied with a deficient share of wit. 
According to Hobbes, we are the inheritors of a mathemat- 
ical tradition that went mad shortly after it rejected his 
contribution. 

Gabriella Slomp suggests that Hobbes’s views on glory 
changed between the earlier statements of his political doc- 
trine (Elements of Law and De Cive) and the later (Leviathan 
and De Homine). According to Slomp, the earlier Hobbes 
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made vainglory (a person’s inflated estimation of self-worth) 
the root cause of man’s pleasures: Our various passions are 
rooted in the satisfaction we derive from seeing ourselves 
honored. By the time he wrote Leviathan and De Homine (a 
work rightly emphasized in this analysis), Hobbes no longer 
gave vainglory all the credit. It emerges as one passionate 
motivation among many. Glory-related passions remain, but 
admiration of novelty, compassion, and love are given new 
emphasis (p. 91). Hobbes also offers greater acknowledgment 
that some individuals are not driven by vainglory. This is 
concomitant with another change. Hobbes’s subject, in 
Slomp’s view, becomes more malleable and educable, al- 
though this claim in chapters 7 and 9 and in the conclusion is 
a bit muddied by other elements of the argument. Slomp 
suggests that Hobbes may have shifted his views on vainglory 
to better match the experiences of readers so that they might 
be educated by the Leviathan; she also recognizes that the 
nonglory seeking individuals were always a part of Hobbes’s 
world view (pp. 88-90, 145). Slomp further acknowledges 
that Hobbes does not require all individuals in the state of 
nature to be vainglorious. Rational actors know that some are 
so inclined, and this is enough to transform the state of 
nature into a state of war we would wish to 

This is a very wide-ranging book, not always unified by a 
larger argument or restricted to the topic suggested in its 
title. It often reads like disparate pieces cobbled together. 
Slomp begins by defining herself (overzealously) against 
Cambridge School interpretation, but she nevertheless sus- 
tains a largely textnalist and analytic treatment by drawing 
heavily upon sources one might expect to find in a Cambridge 
School interpretation. Most notably she draws upon Hob- 
bes’s correspondence (particularly Leibniz’s observations) 
and his unpublished (until 1976) arguments against the 
scholar and author of De Mundo, Thomas White. The 
arguments against White are nearly the exclusive source for 
citations to Hobbes’s writings in Slomp’s sprawling last 
chapter and are used extensively throughout. 

The use of neglected sources should be welcome, but one 
might have hoped for some thoughts from the author con- 
cerning the standing of the work on White as a contribution 
to an interpretation of Hobbes’s contemporaneously pub- 
lished philosophy and a discussion of White’s book. Since one 
of the most provocative pieces on Hobbes’s views on honor 
and glory was written by the historian Keith Thomas (in 
Hobbes Studies, ed. K. Brown, 1965), who linked Hobbes with 
reform-minded aristocrats anxious to check the excesses of 
their contemporaries, the decision to forgo a_ historical 
treatment has its costs. Thomas’s claims are not addressed, 
although some of the older omnibus targets (such as 
MacPherson) are singled out for attention. In spite of 
Slomp’s great emphasis on “political geometry,” no attempt 
is made to approach Hobbes’s extensive writings on geome- 
try. 


The book is divided into two parts. In Part 1, major efforts 
are devoted to illustrating the centrality of Hobbes’s concerns 
over glory. Slomp traces the connections between his view of 
its dangers and Thucydides’ treatment of the subject in 
History of the Peloponnesian War (a work Hobbes translated 
early in his publishing career) and illustrates the above- 
mentioned shift. On the way to Thucydides, comparisons are 
drawn between Hobbes’s views on glory and other well- 
known sources, including Aristotle’s “honor,” Biblical pride, 
and Bacon’s Essays. Slomp shows parallels between Thucy- 
dides’ vainglorious peoples and Hobbes’s vainglorious indi- 
viduals, but she finds an ultimate divergence: Hobbes is more 
optimistic about the prospects for using fear to reunite 


American Political Science Review 


societies torn apart by the children of pride. Part 1 also 
includes a criticism of Carole Pateman’s reading of Hobbes. 

In Part 2, Slomp begins a more analytic or choice-theoretic 
treatment. Game theoretic approaches to Hobbes by Gau- 
thier, Hampton, and Kavka are criticized, but so is the 
criticism of such approaches by Patrick Neal. Combining 
what she takes to be the best from both camps, Slomp 
redescribes the situation of rational actors in the state of 
nature: They are not involved in a prisoner’s dilemma but are 
locked in an insolvable game of chicken (Who will veer off 
the road first?). Slomp’s “Chicken with Spices” leaves players 
paralyzed between the demands of vainglory and the “incom- 
mensurably negative value of death.” 

This is followed by a chapter entitled “Hobbes’s Impossi- 
bility Theorem.” The reference is to Arrow’s theorem by the 
same name, although Slomp’s proof, unlike Arrow, involves 
the strategic interaction of game players. “In a state of 
unrestricted liberty (UL), for men who regard death as the 
greatest evil that might occur to them (S) and know that other 
people, too, are concerned about their survival but might be 
glory-seeking (G), it is rational (R) to decide to kill, which 
decision, because of the equal and vulnerability 
of men, is against reason (non R)” (p. 147, emphasis original). 
The only way out is to introduce, ex machina, the all-powerful 
sovereign (nullifying the condition of unrestricted liberty). 

Moreover, it is Slomp’s innovation to suggest that rational 
actors living in the state of nature are themselves incapable of 
resolving the problem. As such, one must stand back from the 
conflict, in the calm of an already peaceful society where one 
is able to realize the need for such as sovereign. With this 
assertion, Slomp joins a growing crowd of Hobbes scholars 
who argue that the works are best seen as directed toward the 
minds of persons already within societies, rather than as 
advice for those living in the state of nature. 

Slomp’s game-theoretic treatments are constructed from at 
least one opportunistic reading of Hobbes’s work. The as- 
sumption of “equal dangerousness” is drawn, in part, from 
Hobbes’s claim that we are all equals primarily because we 
are capable of killing one another. In Slomp’s formulation, 
however, equality in physical conflict must always conclude 
with both parties destroying one another (p. 137). Life in the 
state of nature may be solitary, poor, nasty, brutish, and 
short, but is it so short that we must assume every physical 
conflict between every two individuals always results in the 
death of both? The very idea that some persons seek glory 
through the subordination of others suggests that Hobbes 
knows that some individuals walk away the victor from 
physical conflict, even when the other side resists. Slomp 
hypothesizes that the state of war in the state of nature may 
be nothing more than a “war of minds” (p. 146), but that does 
not solve the problem for her game participants, who must 
mull the possibility of something more immediately treach- 
erous. Such a fighter’s life may not last long in comparison 
with Hobbes’s own, but contra Slomp, it surely need not end 
as soon as a potentially equal opponent puts up a fight. 


The Comedy and Tragedy of Machiavelli: Essays on the 
Literary Works. Edited by Vickie B. Sullivan. New Haven, 
CT: Yale University Press, 2000. 246p. $40.00 cloth, $18.00 
paper. 

Markus Fischer, Georgetown University 

In addition to his well-known political tracts, Machiavelli 

composed a variety of comedies, poems, and familiar letters. 


Vol. 95, No. 4 


Literary scholars have studied these works for some time and, 
more recently, applied their craft to his political writings as 
well (e.g., Albert Russell Ascoli and Victoria Kahn, eds., 
Machiavelli and the Discourse of Literature, 1993); in their 
view, Machiavelli took politics to consist of rhetoric and 
wrote accordingly. The present volume constitutes an impor- 
tant rejoinder to this endeavor, insofar as its more significant 
essays assume that Machiavelli was a political philosopher 
and that his literary creations apply his political theory to the 
private sphere. 

More explicitly, the theme that holds this collection to- 
gether is the question of whether Machiavelli’s literary effort 
tends more to comedy or tragedy. According to Arlene 
Saxonhouse, comedy breaks down traditional boundaries in 
order to open the distinct and peculiar to the common and to 
show the fluidity of all forms (pp. 57-8). Thus, Machiavelli’s 
literary works are comic because they reveal the private life 
of respectable men and women to be a game for sexual 
gratification, wealth, and reputation, with success going to 
the clever—those who know how to assume the most effective 
speech and guise. 

Mandragola, for instance, is a play that celebrates the fraud 
by which Callimaco exploits the aged Messer Nicia’s desire 
for sons in order to bed his young wife Lucrezia, whose 
morals are corrupted by her ambitious mother and a church- 
man who takes the biblical story of Lot’s rape by his 
daughters to imply that good effects excuse evil means, in 
evident reflection of Machiavelli’s infamous advice to 
princes. Moreover, the fact that Lucrezia is so pleased with 
Callimaco’s embrace that she makes him her lover and gets 
her grateful husband to offer him a room in their house 
suggests that a new order, useful and satisfying to all parties, 
can be constructed by letting go of moral scruples, just as 
Machiavelli’s republic maintains internal stability by aban- 
doning the classical idea of a community of virtue and joining 
nobles and commoners in the mutually rewarding pursuit of 
glory and empire by external conquest. ` 

According to Harvey Mansfield, Mandragola portrays the 
moral failing of Lucrezia as the failing of Christian morality 
in coming to terms with the wicked deeds that many good 
outcomes in practice require (p. 22). Intriguingly, Mansfield 
raises the further possibility that Messer Nicia allowed him- 
self to be cuckolded in order to gain respectability as the 
progenitor of a family, in imitation of Junius Brutus, who 
played crazy and used the rape of the Roman Lucretia to gain 
glory as the founder of the Roman republic (p. 28). In the 
words of Robert Faulkner, Machiavelli replaced classical 
comedy, which drew ethical lessons from the laughable, with 
a utilitarian rhetoric that constructs associations for the 
mutual satisfaction of desire—be they households or modern 
societies (pp. 53, 56). 

The absence of ethical seriousness from Machiavelli’s 
literary œuvre also disqualifies it as tragedy in the classical 
sense, as Faulkner further suggests (p. 35). Tragedy signified 
to the ancient Greeks that the hero’s unflinching pursuit of 
one good inevitably negated another, owing to the fundamen- 
tal incoherence of reality. In the Renaissance, however, the 
tragic merely meant that great success was characteristically 
followed by abject defeat, due to man’s subjection to the 
whims of Fortune, as Ronald Martinez avers (pp. 110-1). 
Accordingly, the ruin of once glorious Italy by the French and 
Spanish invasions constituted a tragedy, and recounting it in 
grave and poetic terms, as Machiavelli did, made the writer a 
tragedian (pp. 102-3, 116-9). But Martinez’s more significant 
contribution consists of interpreting Machiavelli’s comedies 
as parodies of tragic episodes from antiquity, which, in turn, 
suggests Italy’s calamitous decline. In particular, whereas the 
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tale of the Roman Lucretia—who committed suicide after 
being compelled by Sextus Tarquinius to yield her body, 
which prompted the outraged Romans to overthrow the 
Tarquins and establish a great republic— qualifies as tragedy, 
the story of the Italian Lucrezia amounts merely to comedy 
(pp. 105-7). In other words, the tragedy of Italy consists of no 
longer bemg capable of the tragic catharsis that alone could 
renew its body politic. 

According to Michael Harvey, a similar mingling of comic 
and tragic strains is evident from Machiavelli’s poem L’Astno, 
a tale of a man’s descent into the forest of Circe. Usually, we 
take Machiavelli’s view of sexuality as one in which men of 
virta subdue women by fraud and force, as expressed in his 
famous metaphor of the prince who conquers Fortune by 
beating and stnking her down. In L’Asino, however, the 
hero’s virtù—in particular that of his sexual organ—shrinks 
before the terrifying power of Circe, revealing the anguish 
felt by men who must forever prove their manliness in a 
solitary and agonistic world (p. 133). Even more uncharac- 
teristically, Machiavelli responds to this vulnerability with a 
tale in which the hero regains his virtù through the loving 
embrace of a kind and understanding woman, which offers a 
rare glimpse of a world of mutuality and friendship (p. 129). 
Here, Harvey adds an important nuance to our understand- 
ing of Machiavelli’s psychology. 

Susan Meld Shell rounds off this political interpretation of 
Machiavelli’s literary works by drawing out their propositions 
on language. Accordingly, linguistic boundaries are shaped 
by political forces, as conquerors impose their tongues on the 
provinces they settle (pp. 83, 98). Language itself operates by 
the principles of politics as understood by Machiavelli. Ac- 
tion words are more “powerful” than articles and nouns (p. 
83). Native tongues need to dominate imported words to 
remain beautiful (p. 87). Italy’s common language does not 
consist of the abstract universals imagined by Dante but of 
the impure mix of concrete particulars used by the Italians 
(pp. 86-7). And the value of an idiom depends not on its 
aptness for expressing the worthiest things but on its practical 
effectiveness—hence comedy’s use of coarse speech to in- 
struct the many in useful things (p. 92). 

The contributions by Mansfield, Faulkner, Saxonhouse, 
Shell, Martinez, and Harvey make this book highly com- 
mendable to those seeking to fathom the literary form of 
Machiavelli’s thought. Thus form consists of both comedy and 
tragedy, shorn of their traditional moral lessons. As comedy, 
it reveals the noble to be a mere appearance of the vulgar and 
applauds the clever manipulation of appearances. As tragedy, 
it glorifies men who do battle with Fortune even though she 
will eventually ruin both them and their orders. In the final 
analysis, Machiavellian comedy and tragedy are but two 
expressions—iight and grave—of a reality devoid of purpo- 
sive, ethical order. Moreover, the literary form may have 
lured Machiavelli into grving voice to a dread that this 
worshipper of manly action would otherwise not admit: that 
a world of endless strife and contingency is a cause for 
despair, rather than celebration. 


Democracy and Association. By Mark E. Warren. Princeton, 
NJ: Princeton University Press, 2001. 265p. $55.00 cloth. 
$17.95 paper. 


Nancy L. Rosenblum, Harvard University 


Mark Warren joins the lively discussion on voluntary associ- 
ation with a circumscribed purpose: to identify the specifi- 
cally democratic effects of associations in the United States 


and to create a typology of groups. His goal requires him to 
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perform two preliminary tasks. First, he distinguishes the 
effects of association from a group’s formal purposes and 
members’ intentions; associations formed for a variety of 
goods may have democratic effects. Second, he sets his 
project apart from work concerned with the broader moral or 
socially integrative effects of association, ranging from diffuse 
social capital to individual virtues, as well as from theories of 
association that focus on political devolution and subsidiary 
forms of self-government. It should be said that, as in most of 
this literature, the political conditions that shape the ecology 
of associations are eclipsed, and the focus is on the one-way 
effects of association on political life. 

What counts as a “democratic effect,” how we value it, and 
what forms of association contribute to democracy all de- 
pend, of course, on underlying democratic theory. Warren 
adopts a moderate version of Habermas’s spheres, distin- 
guishing among government, market, and “life-world,” which 
includes the social structures that support “public spheres” of 
opinion formation. Consensual associational relations based 
on neither power nor money can be found in every sphere, 
but the organizational form, voluntary association, which is 
based on social attachments and normative resources and is 
intrinsically communicative, dominates civil society. This 
framework allows the author to set aside government and 
business. (And with them the workplace, which is a conten- 
tious point, Warren recognizes, since work is arguably the 
venue that affords most adults the experience of collective 
action and deliberation under conditions of ethnic and racial 
heterogeneity.) 

The Habermasian framework also emphasizes social dif- 
ferentiation, the “migration” of collective action beyond 
states and markets, and the multiplication of politically 
relevant arenas. This underscores Warren’s argument that 
democracy depends on a number of different and indepen- 
dent associational functions that need to be carefully disag- 
gregated; the key is associations’ “contributions” to democ- 
racy in the plural. The Habermasian framework puts a 
premium on associations that connect individual life-worlds 
to public spaces, encourage collective judgments, and create 
the networks of communication that comprise “public 
spheres.” That said, the usefulness of Warren’s typology does 
not depend on subscribing to Habermas’s conceptualization 
of spheres of collective action, and the author does not 
employ “public reasoning” or “deliberation” as philosophical 
terms of art. 

Very briefly, Warren identifies three categories of demo- 
cratic effects. One set constitutes “public spheres” of demo- 
cratic judgment. Another is personal developmental effects: a 
sense of efficacy, political skills such as negotiation and 
coalition-building, civic virtues, and capacities for delibera- 
ton, among others. The third category is effects that under- 
write democratic institutions, including representation, legit- 
imation, or resistance. Against this background Warren 
works out a typology of associations based on a number of 
identifiable factors that tend to produce one or more of these 
democratic effects. 

The principal point flowing from this typology is that 
trade-offs among democratic effects are inevitable. For ex- 
ample, associations that put a premium on the absence of 
internal conflict are unlikely to develop members’ political 
skills. Associations from which exit is costless are unlikely to 
experience pressures from members for “voice” and are 
“lethal to critical skills.” Warren challenges facile assump- 
tions about voluntary associations as sites of unrestricted 
dialogue. Associations that foster deliberative capacities are 
less likely to develop the strong consistent public positions 
necessary for advocacy and other vital contributions to 
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dialogue in the public sphere. The intimate social context of 
associations such as universities makes it more difficult for 
them to address issues of. race and gender as compared to 
market-oriented firms, for whom resolving these issues is 
instrumental to impersonal goals. One last example: Associ- 
ations committed to providing the good of “identity” have a 
low capacity for cooperation and coordination. 
The rewards of system are apparent here; Warren’s typol- 
ogy illuminates the terrain of associations in the United 
States and the limitations of much that is written about it. 
The costs emerge if we look for substantive insight into the 
democratic effects of specific associations. Political parties 
are arguably the most important intermediate democratic 
association, and Warren types them as vested groups ori- 
ented toward “the medium of coercive state power,” indeed, 
as “arms of state power.” This characterization holds for 
parties in government but does not point up the face of 
parties as voluntary associations. Their unique role in accom- 
modating interests, framing issues, altering the parameters of 
discussion in conjunction with other groups, setting agendas, 
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and shaping public opinion (to say nothing of their distinctive 
part in political representation) is given short shrift in this 
and probably in any typology. Moreover, political parties 
function both as associations and as forums for other groups. 
So long as “public sphere” is defined 'as “institutionally 
unbound” and without powers of collective action, our un- 
derstanding of parties will be truncated. 

Unquestionably, Warren’s democratic categories and asso- 
Giational types will help structure and guide the work of 
political theorists and social scientists. His categories are 
exhaustive but not so detailed as to overwhelm the bounds of 
useful typology. At the same time, the grain of generalization 
is not overly coarse, and the result is a nuanced map of the 
associational terrain. The author’s constructive spirit and 
attention to the real world of groups are apparent through- 
out. And Warren’s quiet insistence that against the onslaught 
of legalism, bureaucracy, and markets associations alone 
preserve voluntary, social forms of collective action gives this 
excellent work its moral grounding. 
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In the Web of Politics: Three Decades of the U.S. Federal 
Executive. By Joel D. Aberbach and Bert A. Rockman. 
Washington, DC: Brookings Institution Press, 2000. 230p. 
$42.95 cloth, $17.95 paper. 


Karen M. Hult, Virginia Polytechnic Institute 
and State University 


The legitimacy and the capacity of an administrative state are 
not ¢asy to demonstrate or justify in the United States, where 
administration emerged from and is enmeshed in a political 
system animated by separated powers, checks and balances, 
individual rights, and skepticism about government. Aber- 
bach and Rockman focus on the “turmoil and controversy” 
that have swirled around the U.S. national executive branch 
since 1970, and they sketch the contours of and explore the 
reasons for the ongoing debate. At issue, they claim, is both 
a “quiet crisis,” which reflects the allegedly deteriorating 
quality and morale of federal career executives, and a “noisy 
crisis,” which involves the uncertain responsiveness of civil 
servants to the demands of elected officials. More recently, 
and amid ongoing criticism, the federal government has 
belatedly joined many states and localities, as well as other 
countries, in an effort to “reinvent” government. Aberbach 
and Rockman find scant empirical evidence of either “crisis” 
or of reinvention-induced declines in careerist morale. More- 
over, they contend, much of the persistent concern about the 
capacity, accountability, and responsiveness of federal ad- 
ministration is better understood as disagreement about the 
proper scope and activities of government. 

The empirical core of In the Web of Politics is the analysis 
of findings from structured interviews with senior careerists 
and subcabinet-level political appointees in domestic policy 
agencies during the Nixon (1970), Reagan (1986-87), and 
first Bush (1991-92) administrations. The three cross-sec- 
tions permit the authors to examine the extent and nature of 
change in the backgrounds, attitudes, work experience, and 
perspectives of these officials across three Republican presi- 
dencies as well as before and after the emergence of the 
Senior Executive Service, which was created by the 1978 Civil 
Service Reform Act. Aberbach and Rockman also use the 


interview data as well as other documentary evidence and 
material from Office of Personnel Management (OPM) 
surveys to explore the empirical bases for claims about the 

tativeness, responsiveness, quality, morale, and 
adaptability of the federal executive. 

The quiet crisis (a term the authors adopt from the 1990 
report of the National Commission on the Public Service, 
popularly called the Volcker Commission) revolves around 
the ability of the career service to attract and retain qualified 
individuals and the morale of those officials. Of ultimate 
concern is the effect of personnel quality and morale on 
government performance. Aberbach and Rockman find little 
evidence of a clear drop in competence. Despite the admitted 
limitations of the indicators of “quality” im their survey 
(primarily, number and kind of advanced degrees, and the 
prestige of undergraduate and graduate degree-granting in- 
stitutions), many may find the favorable comparison between 
senior governmental and top for-profit sector officials surpris- 
ing and, perhaps, comforting. At the same time, there has 
been “a modest overall drop in morale and a more dramatic 
drop-off in the intensity of [job] satisfaction” among ca- 
reerists (p. 82). This decline, along with the persistent 
“guildlike features” (p. 75) of the career service (such as 
promotion from within, careers within single agencies), may 
contain warnings about the longer term adaptability of the 
federal executive to increasingly complex and volatile policy 
environments. 

Far noisier is the purported crisis of unresponsiveness. 
Both Nixon and Reagan entered office assuming that they 
confronted a mostly hostile executive branch. Twenty-five 
years ago, Aberbach and Rockman observed (“Clashing 
Beliefs within the Executive Branch,” American Political 
Science Review 70 [June 1976]: 456-78) that Nixon probably 
was correct: Most of the civil servants they interviewed in 
1970 reported being both Democrats and “liberal.” Nixon’s 
response, of course, soon came to be labeled the “adminis- 
trative presidency,” as he experimented with strategic place- 
ment of loyal appointees, reorganization, and impoundment 
to try to boost bureaucratic responsiveness. Reagan brought 
many of his predecessor’s objectives to fruition. The admin- 
istration placed Republicans in top career positions, espe- 
cially in “controversial” departments, such as Health and 
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Human Services, Housing and Urban Development, and 
Education (pp. 107-8). Consistent with a more systematic 
and centralized personnel process, political appointees were 
more likely to be Republican and more conservative than 
they had been in the Nixon era. 

Despite George H.W. Bush’s less ideological posture, he 
also strove to shape congressional influence on the executive 
through, for example, the use of signing statements (p. 38). 
Meanwhile, senior careerists during the Reagan and Bush 
years reported having fewer contacts with Congress, interest 
groups, and the public as well as somewhat less perceived 
influence (p. 115). Whether the three presidents’ efforts to 
boost responsiveness, the growing conservative “Zeitgeist” 
(p. 169), or the decreasing attractiveness of federal govern- 
ment service to Democrats and liberals was most responsible 
for these shifts is impossible to tell from the Aberbach and 
Rockman data. Nonetheless, overall, the executive has been 
notably “responsive to a changing political environment 
and... instruments are in place to promote responsiveness” 


. 127). 

The relatively small number of executives Aberbach and 
Rockman were able to interview (never more than 228, in 
1986-87) militate against much fine-grained analysis of 
similarities and differences among the agencies sampled. 
Even so, some further disaggregation by agency mission or 
primary unit task (e.g, regulation versus grant oversight) 
would have been useful. More important, it is not fully clear 
how far the findings can be generalized. Robert Durant, for 
instance, calls into question the responsiveness of federal 
officials who work outside Washington (The Administrative 
Presidency Revisited, 1992). Similarly, those employed in 
national security or foreign policy departments (such as the 
Central Intelligence Agency or the departments of State and 
Defense) or units (e.g. the international trade division in the 
Department of Commerce) may have quite different profiles 
from those in more “domestic” agencies. 

Aberbach and Rockman conclude by examining the some- 
what different issues of responsiveness that emerged from 
efforts in the Clinton era to “reinvent” government. This 
time, the demands of “customers” rather than elected off- 
cials (or their appointed agents) were to be heeded. In the 
Web of Politics pays special attention to the National Perfor- 
mance Review (NPR) and the Government Performance and 
Results Act, and it capably surveys a broad range of praise 
and criticism the mitiatives have elicited. Most helpful per- 
haps is the volume’s placement of reinvention efforts in the 
context of the broader new public management movement, 
which has been influential in such countries as Australia, New 
Zealand, and Great Britain. Still, the tendency of many new 
public management (and reinvention) advocates to rely on an 
idealized “business model” rather than actual business prac- 
tice in their analyses might have been noted. Just as earlier 
efforts to “reform” U.S. national administration spotlighted 
value conflicts, so, too, Aberbach and Rockman contend, has 
reinvention. Not only is NPR ambiguous about accountability 
relationships, but also issues of political responsiveness—“to 
which principals in a system of separated and divided powers 
are bureaucratic agents to respond?”—+emain “unresolved 
and contentious” (p. 157). 

In the Web of Politics synthesizes important findings on the 
senior federal executive with other studies of the challenges 
of governing in the United States and other advanced 
industrialized countries. The work crystallizes the key nor- 
mative and analytical dimensions of efforts at administrative 

and oversight, and it underscores the profoundly 
political foundations of such initiatrves. Throughout, the 
authors’ reflections on continuities and changes in the U.S. 
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(domestic) executive are nuanced, insightful, and firmly 
anchored empirically. 

Aberbach and Rockman make a persuasive case that 
reformers’ persistent focus on “management improvement” 
is likely to have “at best marginal effects” on executive branch 
capacity (p. 176). “If the U.S. system produces complexity, 
contradiction, bloated or inefficient programs, and unusually 
high degrees of restriction on managerial latitude, that is 
primarily the product of politicians” (p. 188). Whether and 
how that system copes in an era of shrinking federal govern- 
ment employment, expanding shadow government, more 
complex and challenging tasks, and continuing public de- 
mands is a central concern of governance in the twenty-first 
century. 


Campeign Reform: Insights and Evidence. Edited by Larry 
M. Bartels and Lynn Vavreck. Ann Arbor: University of 
Michigan Press, 2000. 259p. $69.50 cloth, $25.95 paper. 


Paul Gronke, Reed College 


Actors, evidence, and standards are the three watchwords of 
this volume on campaign reform edited by Bartels and 
Vavreck. Campaign reform is unlikely (and very possibly 
unnecessary) unless some shared understanding of each 
watchword is reached among politicians and policymakers, 
academic observers, and journalists. 

The book is an outgrowth of a Pew-funded task force on 
campaign reform and contains seven articles along with the 
task force report. Topics range from the content of cam- 
paigns (particularly the consequences of negative advertis- 
ing), to the nature of media coverage, to voter reactions 
(learning, turnout, declines in diffuse system support). All 
authors keep a primary eye on applying political science 
theories and evidence to practical questions of campaign 
reform. Does negative campaigning benefit candidates? 
Should media outlets provide “free time” to competing 
candidates and conduct “ad watches”? Can campaigns be 
conducted in such a way so as to reinvigorate, rather than 
depress, public interest, information, and enthusiasm about 
politics? Each article can be read as standing alone (perhaps 
too much so; see below), but each revisits the issues of actors, 
evidence, and standards. 

Public commentary on campaign reform typically focuses 
on candidates and their financial statements, but this volume 
identifies two other actors to consider: journalists and the 
citizenry. Marion Just and her coauthors, for example, show 
how the objective of candidates (win office) in many ways 
Tuns counter to those of the news media (gain viewers, curry 
influence). Larry Bartels and Lynn Vavreck, in separate 
articles, note that the desire of voters for easily accessible 
information, interesting campaigns, and distinctive policy 
positions can run contrary to the aims of candidates, who may 
be interested in maximizing votes, blurring distinctions, and 
otherwise wooing the median voter. 

An even larger set of actors could be included. Vavreck 
speculates how campaigns in general affect the legitimacy of 
democratic leadership more generally, and Buchanan worries 
that campaigns may reduce civic engagement by the nonvot- 
ing public. Any campaign reform must address the complex 
interaction of multiple actors, all involved to various degrees 
in political campaigns. 

Another notable contribution in this volume is the careful 
consideration and presentation of high-quality social scien- 
tific evidence. Multiple authors (Bartels, Geer, Vavreck, and 
Shaw), using separate data sets and different techniques, 
show that negative advertising has anything but a negative 
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effect. The verdict is far more mixed. Similarly, although we 
tend to blame candidates for negative campaigns, the evi- 
dence in this volume is consistent and relatively overwhelm- 
ing: Journalists are far more blameworthy for highlighting 
attack ads, fostering cynicism, and generally accentuating the 
negatives (see especially the contribution by Just et al.). This 
theme runs through many of the essays. The careful use of 
evidence is what one would expect from these scholars and is 
a refreshing antidote to emotional commentary on this issue. 
Also, the presentations are accessible to a wide range of 
audiences. 

Finally and most important, many of the authors agree that 
the strong reformist movement of the past few decades lacks 
a clear set of standards. What is wrong with the current 
political climate? What would constitute successful campaign 
reform? These are deceptively simple questions, but the devil 
is in the details. 

For example, is it more important that voters become 
highly informed (see Bartels, Geer, and Vavreck), or is a 
greater concern the corrosive effect of campaigns on public 
attitudes about the political system (see Buchanan)? Nega- 
tive ads seem to help voters discriminate among competing 
candidates (Geer, Bartels) but also reduce faith in govern- 
ment (Buchanan). Journalists are well positioned to inform 
the public about misstatements by candidates, but in doing so 
they may publicize the very negative ads that the “ad watch” 
is meant to expose (Jamieson and Waldman). 

Bartels and Vavreck do a fine job of highlighting conflict- 
ing standards. By far the best and most thought-provoking 
treatment of the various and competing standards for cam- 
paign reform, however, is found in the task force recommen- 
dations. This chapter should be required reading for any 
course on campaigns and elections, and it is a useful blue- 
print for both reformers and scholars. 

What is missing in the volume? Most important, it lacks 
both a historical and political science context. Many of the 
media and voter trends noted in the book first appeared in 
the 1960s, accelerated into the 1970s, and flattened out 
thereafter. This was a time when many large-scale changes 
occurred in American politics, including the rise of candi- 
date-centered elections. Perhaps all we are observing is a 
maturation in candidate-centered elections. If s0, reforms are 
virtually impossible without completely changing the political 
system. 

With respect to political science, a few authors note the 
conflict between the desire for voters to discriminate among 
candidates and the pressure for candidates to blur differences 
and master the art of ambiguity. More attention could have 
been given to models of candidate behavior, especially 
Downsian convergence and how it relates to campaign re- 
form. 

An introductory chapter could have provided a larger 
context. A unified bibliography might have been desirable, 
since many authors cite the same works, although the current 

t allows teachers to use a single article or a few in 
class. The index could be more extensive. These weaknesses 
indicate a light hand by the editors but are not serious flaws. 


Polartred Politics: Congress and the President in a Partisan 
Era. Edited by Jon R. Bond and Richard Fleisher. Wash- 
ington, DC: CQ Press, 2000. 226p. $36.95 cloth, $24.95 
paper. 

Michael L. Mezey, DePaul University 

In introductory American politics courses, when we get to the 

section on political parties, we often contrast the American 


parties with their more disciplined European counterparts. 
Under a European model, copartisans unite behind a coher- 
ent set of party principles and policy proposals that they 
present to the electorate, and party leaders in both the 
executive and the legislature exercise significant authority. In 
the following class, we then explain why such a system does 
not obtain in the United States: A federal system produces 
decentralized party organizations rooted in states and local- 
ities; single-member districts; the electoral connection and 
the localism that comes with it; the committee and seniority 
systems in Congress; the power of individual U.S. senators; 
and the constitutional system of separate institutions sharing 


power. 

As often happens, the conventional wisdom that we teach 
our students is modified by events. In this instance, beginning 
in the mid-1970s and continuing apace through the end of the 
century, the political parties in Congress came to take on 
some of the traits of the unified political parties typical of 
European parliamentary systems. At least that is what is 
suggested by rules changes giving more power to congres- 
sional party leaders and by rising party unity scores on 
roll-call votes. 

The seven excellent essays in Polarized Politics are all 
directed at the central question of the causes and conse- 
quences of the increasing level of party unity in Congress and 
its implications for presidential-congressional relations, espe- 
cially in the context of divided government. 

Gary Jacobson shows that voters increasingly connect their 
party identification with their own ideology. As John Aldrich 
and David Rohde demonstrate, this is reflected in Congress, 
where a growing ideological consensus within both parties 
has created the conditions for institutional reforms that 
facilitate party government. Tim Groeling and Sam Kernell 
argue that, despite these changes, American political parties 
remain limited in their ability to communicate a consistent 
message through their rhetoric. This in turn limits their 

ity to deliver on their policy commitments. They con- 
clude that increasing party coherence has been achieved 
primarily through negative commentary about the opposition 
rather than through a positive enunciation of the party’s 
positions. This point is reinforced by Kathleen Hall Jamieson 
and Enka Falk, who document an increase in the level of 
BEY among members of Congress that coincides with 
increased partisanship. This incivility reached its peak in the 
104th Congress, when the atmosphere was somewhat remi- 
niscent of the more rough and tumble environment of the 
nineteenth century. 

The implications of all this for presidential-congressional 
relations are a bit ambiguous. George Edwards and Andrew 
Barrett demonstrate that presidents can get their proposals 
on the congressional agenda, but final passage is largely 
determined by whether their party controls Congress. This is 
nothing new; the size of the president’s majority in Congress 
has always been the best predictor of presidential success. 
What may be clearer is that in the current era of more intense 
partisanship, the majority itself, rather than its size, may be 
the key determinant. 

Turning to divided government, Barbara Sinclair shows 
that such an environment does not preclude presidential 
success. In the six years of divided government during the 
Clinton administration, however, the tactics used by the 
president and by a more empowered set of congressional 
leaders changed to reflect the higher level of partisanship. In 
particular, the level of hostility between the two camps, as 
demonstrated in the Jamieson-Falk chapter, suggests the 
hard-ball turn that American politics took during that period. 

In a concluding essay, the editors point out that the more 


993 


Book Reviews: AMERICAN POLITICS 


December 2001 





homogeneous parties one finds in Congress today are the 
result of fewer cross-pressured members. This in turn is the 


result of more homogeneous congressional districts and the 
party realignment in the South since the advent of voting 
rights for African Americans. Endorsing the theory of “con- 
ditional party government” developed by Aldrich and Rohde, 
Bond and Fleisher conclude that the current situation may be 
subject to change if the condition of more homogeneous 
preferences within the party majority changes. But the higher 
level of partisanship in Congress has not reduced the respon- 
siveness of legislators to their constituents, or eliminated 
party mavericks who can swing the outcome of a close vote, 
or produced gridlock, even in situations of divided govern- 
ment. Although the interchanges may be nastier and the 
policymaking process more difficult, policy decisions are 
nonetheless reached. 

As is the case with many studies of this sort, the Senate is 
not seriously considered in most of the pieces, and this is a 
serious omission as we seek to understand the implications of 
changes in partisanship for congressional-presidential rela- 
tions. Left unanswered, as well, is how much of the decline in 
civility and the intensified partisanship is attributable to the 
personalities involved. , the visceral hatred of Bill 
Clinton that the Republican leadership displayed had as 
much to do with his personal characteristics as with his 
essentially centrist ideology. 

Anthologies are always chancy, but this one is a must read. 
The chapters are of high quality, present original data and 
original arguments, and fit together nicely. As a whole, they 
demonstrate that changes in and party organi- 
zation do matter in Congress, at least in the House, and they 
provide an important counterpoint to theories that tend to 
minimize the role of party in favor of an exclusive focus on 
individual legislators and their preferences. 


Advancing Public Management: New Developments in The- 
ory, Methods, and Practice. Edited by Jeffrey L. Brudney, 
Laurence J. O’Toole, Jr., and Hal G. Rainey. Washington, 
DC, Georgetown University Press, 2000. 320p. $65.00. 

William Eric Davis, University of California, Riverside 

What is the role of public managers, and do they matter? 

How should we study public management? Which strategies 

for reform and innovation hold potential for improving the 

success of public agencies? Which frameworks hold the most 
potential for advancing the field of public management? 

These four questions represent the organization and themes 

of the book reviewed here. 

If addressed competently, each question can advance 
public management’s ability to deliver on promises, improve 
service, and better satisfy public demands. More important, 
we will know whether success is being achieved in those 
respects. The book’s greatest strength lies in its diversity of 
approaches, methods, and frameworks. The opening chapters 
are devoted to the role of managers and illustrate the 
difficulties of measuring the effect of management. The most 
appealing aspect of Part I is that it seeks a greatly expanded 
definition of management, which for many years was simply 
considered something that managers did. This created con- 
ceptual and empirical difficulties. Part I also supports the 
notion that there can be no single recipe for management 
success. Much depends upon the external and internal con- 
texts. 

Laurence O’Toole sees a need for a broader but workable 
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definition of management. For a broad definition to be 
workable, we must assume that a manager “does something” 
that gives an indication of the limits of organizational struc- 
ture and the limits of employees. Following up on that, Anne 
Khademian asserts in chapter 3 that cultures tend to be 
deeply ingrained and difficult to shape, which limits the effect 
of management. Her suggestion is to acknowledge even more 
directly that culture is part of an organization’s institution, 
which implies that neoinstitutionalist literature may hold the 
best promise for future advances. Her chapter reflects the 
somewhat pessimistic assertion that culture does not so much 
represent a management tool as offer a way to understand the 
institutional context. That is, the culture will suggest whether 
there is a possibility for organizational governance or change. 

Thomas Hammond and Jack Knott use spatial theory to 
specify the conditions under which public agency managers 
can move policy in their preferred direction. Spatial theory 
requires simplifying assumptions that open it to attack for 
being unrealistic, thus impractical. Nonetheless, the goal of 
Hammond and Knott is to show the conditions under which 
an independent regulatory agency head can have influence in 
one or two policy dimensions, given the ideal preference 
positions: of, respectively, the House, Senate, and president. 
Using their version of McKelvey’s “chaos theorem,” (Richard 
D. McKelvey, “Intransitivities in Multidimensional Voting 
Models and Some Implications for Agenda Control,” Journal 
of Economic Theory 12 [1976]: 472-82), they do a good job of 
illustrating how agency heads, under certain narrow condi- 
tions, can achieve policy change closer to their preferred 
ideal point. They outline the environmental conditions and 
the particular individual skills that allow such an effect. An 
agent must be smart enough to know the size, shape, and 
location of the unbeatable “core” of policy space as well as 
when it is advantageous to preempt the House, Senate, or 
president by taking initiative. S/he must be an effective 
persuader and know how to frame issues to maximize policy 
advantage. In sum (although the authors do not put it this 
way), the agency head must know how to manipulate. 

Hammond and Knott present a purely formal theory, so 
they do not indicate the proportion of managers who meet 
the specified assumptions or do the things they list as 
important to success. Thus, they do not (and do not seek to) 
offer a definitive answer regarding whether managers matter. 
They simply note when the potential exists. Furthermore, 
since the manager in their example heads an independent 
regulatory commission with rulemaking power, s/he can act 
almost unilaterally and force the House, Senate, and presi- 
dent to meet the specified assumptions and seek to persuade 
the agency head, rather than vice versa. Perhaps a more 
dynamic model is needed that incorporates the actions and 
reactions of all four major participants plus some component 
to represent the limitations on the agency head imposed by 
other endogenous and exogenous factors (culture, structure, 
and so on). 

Part I is devoted to methodological issues and addresses 
how we should study public management. Often in statistical 
analysis we run the models, report the results, and walk away 
from the project without closer inspection of outliers. The 
chapter by Kenneth Meier, Jeff Gill, and George Waller 
asserts that traditional statistical techniques do not serve 
scholars of public management very well because they are 
interested in high and poor performers, rather than the 
“typical” case identified by traditional ordinary least- 
regression. They advocate the use of substantively weighted 
least squares (SWLS), which weights each case according to 
substantive performance. High-performing school districts 
were identified with a studentized residual selection criterion. 
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For subsequent analysis they were weighted more heavily 
than . typical and poor performers. The authors similarly 
identified “failures” and analyzed them separately after also 
weighting them according to performance. 

The point of the Meier et al. project was to remove average 
performers from the analyses entirely. A comparison be- 
tween the two groups showed that high-performing districts 
in each category (high optimizers versus failures) were able 
to, get better from various resources (increases 
in state aid, higher teacher salaries and instruction funds) 
than other districts. This chapter is certainly worth the 
attention of public management scholars. It might be inter- 
esting, however, to run the models (after applying their 
selection criterion) and weight each case by its standardized 
residual, rather than the somewhat arbitrary levels they 
chose. Nonetheless, their contribution supports the book’s 
theme that there can be no single recipe for improving 
success. 

, Chapter 7 represents an interesting qualitative strategy for 
learning about “street-level” workers in public agencies. 
Steven Maynard-Moody and Suzanne Leland asked line 
bureaucrats to tell stories about fairness, working with cli- 
ents, and life in the agency. The chapter would have been 
improved if subjects had been asked to tell a story about 
when they exercised judgment and it led’ to success, to tell a 
story about a personal exercise in judgment that led to 
failure, and to state their beliefs about why. This not only 
would examine more directly decisions and discretion but 
also would allow better theorizing about factors that contrib- 
ute to success. Nonetheless, this omission does not detract 
from their major premise concerning the utility of stories as 
a learning tool. 

Part II concerns strategies for reform and innovation that 
hold potential for improving the success of public agencies. It 
rejects the conventional lay wisdom that public agencies are 
resistant to change but notes many obstacles to it. The major 
task for the authors is to’ make sure change is reasoned. 
Eugene McGregor offers a new vocabulary and three heuris- 
tic devices. He asserts that a common language and concep- 
tualization will improve the odds of bringing about desired 
change. His chapter is written more for the practical manager 
than the scholar, but both will find useful items. The other 
chapters in Part MI offer their own frameworks for identifying 

obstacles to innovation and avenues for change. They mostly 
agree that the obstacles are formidable. One theme that 
appears in various places is the importance of culture, which 
is ignored by managers at the risk/cost of failure. 

Part IV emphasizes general frameworks and is the stron- 
gest section theoretically. The focus is on frameworks that 
hold the most potential for advancing knowledge and prac- 
tice. The chapter by Patricia Ingraham and Amy Kneedler is 
partly a summary of the literature, but it also offers a “perfor- 
in public agency management. They take us inside the famous, 

or perhaps infamous, “black box” of public management in an 
attempt to understand what takes place within. 

How competently does the book address the four organiz- 
ing questions? The question about the role of a public 
manager and whether managers matter is never really an- 
swered. Instead, we learn that managers have the potential to 
have ‘an effect, notwithstanding certain types of limitations. 
The contributors do not directly attempt to define that effect, 
although they intuitively assert that managers matter, but the 
section is really geared toward explaining why stronger 
answers have not yet been provided. Since that question is 
vastly more difficult and complex than the other three, the 
authors are to be forgiven for emphasizing the “role” of 
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managers over whether they matter. The other parts of the 
book come closer to ae answers to their respective 
questions. 


` 


Veto Bargaining: Presidents and the Politics of Negative 
Power. By Charles M. Cameron. Cambridge: Cambridge 
University Press, 2000. 292p. $59.95 cloth, $18.95 paper. 


Charles Tien, Hunter College, CUNY 


Veto Bargaining is an important book and a pleasure to read. 
It is important because it takes us a long way in understand- 
ing presidential veto politics. The power of the veto is a 
foundation of the checks-and-balances system and the sepa- 
ration of powers, and it is surprising that so few data have 
been collected on it. This book is a pleasure to read because 
it provides a framework for understanding the many absorb- 
ing veto cases it includes; For example, Cameron shows that 
congressional uncertainty about the president’s acceptable 
policy preferences can help the president wrest policy con- 
cessions from Congress. This helps us understand why Tru- 
man vetoed Republican tax cuts three times during the 80th 
Congress. By the time legislators found the right package to 
attract an override majority, the Truman vetoes had forced 
Congress to concede 15% off the cut for the wealthiest 
Americans. 

Cameron frames veto bargaining within the context of 
separation of powers, which summons institutional battles 
over policy especially during divided government. The main 
tool at the disposal of presidents during these battles with 
Congress is the veto. Cameron points out that presidents 
have been ready and willing to use this tool when government 
is divided and when the legislative stakes are high. More 
important, he shows that the veto has been a very effective 
factor in bargaining with Congress. Its use or threat has 
allowed presidents to wrestle policy concessions from Con- 


gress. 

Theoretical work argues that, veto power is limited and 
asymmetric: It allows presidents to get less out of Congress 
than what Congress wants to give but no more than what 
Congress wants to give. Cameron builds on earlier models 
and introduces uncertainty (about the policy preferences of 
the president and about the pivotal member of Congress 
whose vote can override a veto) into the veto bargaining 
game. The result is the view that veto power is “much more 
consequential than is commonly believed” (p. 26). 

There is much to like about the book. The rational choice 
models are grounded i in real-world politics. The research is 
more than a series of mathematical exercises and increases 
our understanding of veto bargaining. The applied rational 
choice models provide new intuition about veto politics 
between Congress and the president. Furthermore, Camer- 
òps book should quiet cfitics who argue that empirical 
testing of the rational choice models is madequate. The 
models developed in chapter 4 are thoroughly tested empir- 
ically in chapter 6, with original data, and in chapters 8 and 9, 
with many succinct case studies. 

The amount of original data collected for this book is 
impressive and informative. Cameron compiled event histo- 
ries of 434 vetoes from 1945 to 1992. These reveal that most 
vetoes occur during divided government (71%); very few 
appropriations bills are vetoed (8%, or 34 total); and a 
significant number of vetoes occur in chains (41%), that is, 
either Congress passes another version of the bill, or the 
president vetoes the bill more than once. Cameron also 
classifies all bills passed between 1945 and 1994 into one of 
four categories according to their significance. From these 
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classifications we learn that (1) the majority of vetoes occur 
over minor bills (56%); (2) vetoes of landmark and important 
legislation usually occur in veto chains (65%); and (3) most 
vetoes of minor legislation are final (70%), that is, there are 
no other bills or vetoes regarding the matter. 

The book is quite comprehensive in its coverage, but there 
is no mention of the line-item veto. Every president since 
Jimmy Carter has asked for the authority to single out items 
for veto on spending bills after signing the other parts of the 
bill. Congress gave this power to the president in 1996, but in 
1998 the Supreme Court ruled it unconstitutional in Clinton 
v. City of New York. The issue is important, as 43 of the 50 
governors have line-item veto power. 

All students of the presidency and Congreas should read 
this book. The importance of the findings and the breadth of 
the original data collected and interpreted alone make it 
worth reading. This should be required reading in graduate 
courses on the presidency and on legislative behavior. It is 
also an excellent example of how to do rational choice 
research and do it well. The book is probably not appropriate 
for most undergraduates, especially those who have not been 
exposed to formal theory, although advanced undergraduates 
would benefit from the findings in chapter 2, based on data 
from the post-World War II era. 


Institutional Constraints and Policy Choice: An Exploration 
of Local Governance. By James C. Clingermayer and 
Richard C. Feiock. Albany: State University of New York 
Press, 2001. 151p. $17.95 paper. 


David R. Elkins, Cleveland State University 


For more than twenty years, urban scholars have debated 
whether economic determinism and its fiscal implications 
trump municipal political action in local government policy 
choices. In a very real sense, the debate is about whether 
local politics matters. Somewhat lost in this discussion, 
although not entirely, is an issue that was once at the 
forefront of urban scholarship: the role of institutional 
structures. With the maturing theoretical interest in the new 
institutionalism, the time is ripe to revisit this area and 
determine whether institutional structures matter. According 
to James Clingermayer and Richard Feiock, they do. 

The term “exploration” in the subtitle is an apt description 
of this work. Using seven separate data sets (three are 
derived from the authors’ surveys conducted during the 
1980s, three from 1980 Census data, and one from the 
International City Management Association for 1988), the 
authors explore issues as diverge as economic development 
decision making, municipal practices associated with zoning, 
citizen-initiated contacts and casework activities of council 
members, contracting out for services, and debt financing 
decisions. In addition, they explore how external constraints 
associated with annexation policy, exclusionary zoning prac- 
tices, and the Tax Reform Act of 1986 affect municipal 
policymaking. The breadth of the empirical analysis makes a 
convincing case that urban scholars should seriously consider 
the role urban institutional structures have in shaping local 
policy decisions. Indeed, Clingermayer and Feiock hope that 
their work will reenergize research in this area. 

The authors define institutionalism as the “formal and 
informal rules operating within or across organizations” (p. 
2). In their analysis of municipal governments, they place 
great emphasis on the formal rules of institutions. It is argued 
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that institutions are important because they provide stability, 
shape choice sets, and structure decision making. Specifically, 
“electoral rules and constituency boundaries affect policy 
makers’ behaviors by determining what sorts of voters they 
must please if they are to remain in office or rise to higher 
office” (p. vii). Although the research is firmly anchored in 
the new institutionalism, there is also a strong linkage, which 
the authors readily acknowledge, to the classics in this field. 

The effects of institutional features on municipal economic 
development activities are examined from several angles. 
First, the authors explore how differences in structural fea- 
tures, particularly those of elected officials, are associated 
with five separate economic development policies and prac- 
tices. Second, they ask whether the adoption of comprehen- 
sive zoning is attributable to either market failure or distrib- 
utive politics. Third, they turn to the question of how 
institutional structures shape development-based constituen- 
cy-related experiences of council members. The conclusion is 
that institutional structures do matter with regard to these 
varied economic development activities, but institutional 
effects are subtle. 

Clingermayer and Feiock also examine the decision-mak- 
ing effects of turnover in leadership and council membership. 
They propose two general explanations for the influence of 
turnover on local policymaking. The first suggests that private 
organizations involved in negotiations with a city that has a 
high level of turnover may be apprehensive about the city’s 
ability to fulfill long-term obligations, and this political 
uncertainty drives up the transaction costs associated with 
negotiations. The second explanation hinges on the attempt 
by elected officials to avoid blame for potentially controver- 
sial decisions or to concentrate benefits to enhance political 
prospects. Clingermayer and Feiock find support, albeit 
weak, for both explanations but along an intriguing faultline. 
They find that administrative turnover is inversely associated 
with the level of services contracted out and that mayoral 
turnover is directly associated with contracting out. Also, in 
cities with council-manager structures, turnover in council 
membership is inversely associated with various measures of 
municipal debt. 

Finally, the authors explore how state and federal decisions 
affect municipal policymaking. Their analysis of annexation 
and exclusionary zoning suggests that state-level attempts to 
limit the discretionary authority of local officials have 
succeeded. They also find that the Tax Reform Act of 1986 
reduced the level of municipal revenue bond debt. 

The book is not without flaws. The use of multiple data sets 
and methods creates a trade-off between breadth of analysis 
and continuity of argument. The authors remain focused on 
the exploration of the institutional dynamics of municipal 

policymaking, but the book seems to lack the unified theme 
typical of research based on fewer data sources. In some 
instances it seems this is a collection of discrete articles, with 
little connection between one chapter or section and another. 
The book would have benefited from an appendix with 
details about the various data sets, the construction of 
variables, and some of the statistical problems associated 
with the analyses. For instance, regarding citizen-initiated 
contacts, Clingermayer and Felock note that “usable re- 
sponses were received from 234 council members” (p. 39), 
but the statistical analysis refers to an N of 177 (p. 41). Why 
were so many cases dropped? Although I am inclined to find 
brevity a virtue, further development of some ideas would 
have been helpful. For instance, the discussion of time 
horizons is intriguing and warrants elaboration. 

These criticisms should not dissuade urban scholars from 
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reading this worthwhile contribution. It undoubtedly will be 
cited among the classics in the institutional-structural expla- 
nations of municipal policymaking. Indeed, drawing on this 
earlier generation of scholarship, Clingermayer and Feiock 
conclude that institutions matter but in ways that “mediate 
and interact with other factors” to influence outcomes (p. 
123). Whether Clingermayer and Feiock accomplish their 
goal of reinvigorating the institutional perspective in urban 
scholarship remains to be seen, but their exploration is a 
critical first step. 


Empire on the Hudson: Entrepreneurial Vision and Political 
Power at the Port of New York Authority. By Jameson W. 
Doig. New York: Columbia University Press, 2001. 620p. 
$49.50. 


Steven P. Erie, University of Califomia, San Diego 


Empire on the Hudson is a dual biography of the Port of New 
York Authority, America’s first semiautonomous, self-financ- 
ing public development agency, which helped inspire the 
Tennessee Valley Authority, and of the individuals who 
shaped it. It is also much more. This case study of local 
government’s catalytic role in regional economic develop- 
ment is a major contribution to the twentieth-century history 
of the New York metropolitan region. In an era when 
bureaucracy has become synonymous with failed government 
programs, the Port Authority is a valuable reminder of the 
positive contributions that public agencies can make. It also 
serves as a useful prism through which the author views and 
critiques relevant theories of political leadership. 

Doig shows that the Port of New York Authority, char- 
tered in 1921 by the states of New York and New Jersey, was 
a latter-day child of the Progressive reform impulse of the 
early twentieth century, which sought rational planning by an 
activist government, purged of corruption and infused with 
“business-like” efficiency. The initial impetus was the fear 
that the region’s magnificent waterfront might lose its pre- 
eminent status to other East Coast ports because of the 
additional time and costs entailed by docking facilities lo- 
cated mostly in New York but rail terminals located mostly in 
New Jersey. 

The newly created authority did, indeed, have a life of its 
own. From the 1920s through the 1950s, it evolved from the 
relatively narrow mission of rail-freight planning to a much 
broader mandate to build and operate bridges and tunnels, 
assume the dominant role in regional air transport (at 
Newark, LaGuardia, and Kennedy), and construct a contain- 
erized marine terminal, a gigantic mid-Manhattan bus termi- 
nal, and a new arterial highway system. 

The financial underpinnings of this “imperial” bureaucracy 
were the bridge and toll revenues that allowed the authority 
to self-finance most of its capital projects by issuing revenue 
bonds rather than depend on local and state governments for 
funding. In classic survival-of-the-fittest fashion, it success- 
fully fought off challenges from both within and without the 
region. New York Mayor John F. Hylan ın the 1920s mobi- 
lized Tammany Hall against the Port Authority, but he was 
forced into retirement by the authority’s good friend, Gov- 
ernor Al Smith. Robert Moses, the region’s well-connected 
transportation “power broker,” tried unsuccessfully to block 
the authority’s new airport management role as well as its 
Manhattan bus terminal, but he later cooperated with the 
authority on bridge and highway improvements. Earlier, 
there was a serious challenge from Washington when the 
New Deal attempted to end the tax-exempt status of munic- 
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ipal bonds. This was defeated by a nationwide, ostensibly 
“grassroots” campaign skillfully orchestrated from the offices 
of the Port Authority. 

Doig quotes Ralph Waldo Emerson’s aphorism that insti- 
tutions are “the lengthened shadow of one man” but credits 
three individuals. Julius Henry Cohen, the authority’s general 
counsel from 1921 until 1942, first emerged into prominence 
as a garment industry lawyer who promoted the Brandeisian 
cause of commercial arbitration of labor disputes. He carried 
his cooperative principles into the public sector and crystal- 
lized the idea of bi-state partnership embodied in the 1921 
Port Compact. Swiss-born Othmar H. Ammann, the author- 
ity’s chief engineer from 1928 to 1939, lobbied for, designed, 
and supervised the construction of the George Washington 
Bridge and the Lincoln Tunnel. He returned in the 1950s to 
design the second deck of that bridge as well as the new 
Throgs Neck Bridge and the Verrazano-Narrow spans. The 
third creative bureaucrat was Austin J. Tobin, the authority's 
assistant attorney and then its executive director. A graduate 
of Holy Cross College who lost his interest in theology, he 
committed himself to the authority as a crusade. He ran the 
campaign that defeated the aforementioned New Deal policy 
in the 1930s and then field marshaled the authority’s new 
initiatives in the 1940s and 1950s as airport manager, bus 
terminal operator, and highway engineer. 

The author impartially documents not only the authority's 
successes but also its failures and periods of drift and 
retrenchment. In the 1920s, the authority was fought to a 
standstill by the dozen railroads that served the metropolitan 
region and had no interest in centralizing their competitive 
operations. Declining revenues during the Depression and 
the early years of World War II coincided with the ascen- 
dancy of Frank C. Ferguson as Port Authority chairman. A 
conservative investment banker, he focused his tunnel-vision 
on balance-sheet considerations. Tobin’s career from the late 
1950s to his abrupt retirement in 1971 (after alienating 
governors of both states) was a mixed bag. His successes in 
constructing new bridges and highways were counterbalanced 
by a financially disastrous mass transit experiment, the bil- 
lion-dollar grandiosity of the World Trade Center project, 
and the defeat of a proposed jetport in northern New Jersey. 

Doig’s concluding chapter covers in less detail the embat- 
tled post-Tobin era. Since the 1970s, the authority has 
alternated between “malaise” or retrenchment and contro- 
versial new initiatives that the author views as disguised 
attempts to divert its revenues into nontransportation 
projects run by local and state politicians. Critics, including 
New York Mayor Rudolph Guiliani, want to scrap the 
authority, but Doig hopes for a rebirth of creative leadership 
in the twenty-first century. 

This gracefully written book is a significant contribution to 
the literature. Doig offers a useful corrective to Robert 
Caro’s (The Power Broker, 1974) understandable overempha- 
sis of the omnipotence of his protagonist, Robert Moses. 
Doig also provides a sensible critique of James Q. Wilson’s 
view that public bureaucracies invariably are passive respond- 
ers, of necessity driven by external forces, rather than proac- 
tive movers of events, motivated by creativity and innovation 
emanating from within. Finally, Doig gives deserved atten- 
tion to the ethical dimension of public leadership, specifically 
the need to balance organizational efficiency with democratic 
accountability. In so doing, Empire on the Hudson joins the 
front rank of scholarship on public enterprise and enterpris- 
ing public entrepreneurs. 
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Racist America: Roots, Current Realities, and Future Repa- 
rations. By Joe R. Feagin. New York Routledge, 2000. 
311p. $25.00. 


Robert C. Smith, San Francisco State University 


Joe Feagin is the leading social science authority on racism in 
the United States. A sociologist, Feagin wrote a senes of 
seminal articles on institutional racism during the 1970s, 
followed by numerous essays and several books. Racist Amer- 
ica is his most ambitious work on the subject. Its purpose is 
to develop an antiracist theory and analysis that will explain 
the phenomenon but also indicate ways to change and 
eventually eradicate it. According to Feagin, there are fairly 
well-developed theories and shared concepts to study systems 
of class and gender oppression in America, but the social 
science community has not yet reached a consensus on how 
to define racial oppression and give it operational meaning, 
and there is no well-developed theory to account for its 
origins and persistence. 

To remedy these deficiencies Feagin develops a theoretical 
framework centered on the concept of “systemic racism.” 
Racism is manifested throughout the economy, polity, edu- 
cation, religion, and the family. This theoretical reality is 
reflected empirically in a complex array of antiblack prac- 
tices; in the unjustly gained economic resources and political 
power of whites; and in the ideologies and attitudes of white 
supremacy that developed in order to rationalize the system. 
Feagin avers that a pluralistic analysis of oppression in the 
United States is ultimately necessary because “class struc- 
‘tured capitalism, sexism, bureaucratic authoritarianism and 
homophobia are all important parts of the webbed package 
of oppressions internal to U.S. society” (p. 6). Yet, such an 
analysis is, in Feagin’s judgment, premature. Therefore, 
Racist America focuses mainly on white-black oppression 
(with passing references to how other oppressions interact 
with it) because it is paradigmatic for the other forms of 
oppression in America. 

After sketching out the theoretical perspective, Feagin 
provides six chapters of historical analysis and empirical data 
to illustrate the theory’s utility. The empirical data include an 
extensive mining of the social science literature, government 
and private studies and reports, and results from numerous 
field research projects Feagin and his associates have con- 
ducted in the last decade. These projects include, among 
other things, hundreds of interviews with blacks from all 
walks of life about their experiences with racism and with 
whites about their views of blacks and of raciam. 

Chapters 1 and 2 are historical narratives about the 
creation of the systems of racism, sexism, and economic 
exploitation that are at the foundations of America. Feagin 
notes that under pressure from the abolitionist and civil 
rights movements some progress was made in the 1860s and 
1960s in modifying some of the cruder manifestations of 
systemic racism, but the system has remained largely intact. 
For example, he describes the reforms of the 1960s as modest 
efforts designed to fit some blacks, women, and other racial 
minorities into the society without attempting to dismantle 
the white-male-dominated hierarchical system itself. Thus, he 
concludes, all the talk about equality since the 1960s is a 
“ruse and illusion” (p. 36). 

Chapter 3 examines the origins and evolution of the 
ideology of white supremacy; chapter 4 examines antiblack 
images in the media; chapter 5 analyzes the data on racism in 
the everyday practices of whites and experiences of blacks, 
including institutional manifestations in housing, education, 
employment, business, public accommodations, and criminal 
justice. The sixth chapter shows how blacks are damaged in 
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material and psychological wellbeing by racism and how 
whites are privileged in psychic esteem and material benefits 
by it. The last chapter presents strategies and policies for the 
“uprooting and replacement of the existing systems of racial- 
ized power” (p. 270), focusing on the need for a multiracial 
movement of a large scale, with destabilizing protests that 
will alter the white elite’s calculus of the cost of Manianiig 
the system. 

Feagin’s ambitious effort to place racism in a systems 
framework is an important contribution that should be widely 
read and used in research on racism m the United States. It 
is a refreshingly comprehensive departure from the seem- 
ingly omnipresent books since the 1970s that focus on the 
declining significance of or even the end of racism in Amer- 
ica. The work, however, is not without problems that may 
limit its use both in the research process and in the develop- 
ment of antiracist strategies and movements. 

The most basic limitation is the theory of systemic racism 
itself. As briefly sketched out by Feagin, the theory is 
ambiguous. It is not clear, for example, whether racism is 
systemic in the sense of being a core, defining value of 
America equivalent to capitalism, democracy, and constitu- 
tionalism, or whether it is simply systemic in the sense of a 
complex, interdependent, interactive series of components. A 
telated but distinct ambiguity is whether the theory suggests 
that racism may not be a core or defining American value but 
is nevertheless so integral to the operation of the economy 
and polity that its elimination requires the elimination or at 
least radical modifications of capitalism and the existing 
constitutional basis of the democracy. Clarifying work on 
both of these aspects is necessary before the theory can be 
used in research and in antiracist activism. 

Feagin refers to the concept of oppression throughout the 
book without ever defining it. This will not do. As he himself 
writes, “concepts delineating and probing racism need to be 
clear and honed by everyday experience” (p. 4). This is 
especially important because the term is used to encapsulate 
the experiences of women, homosexuals, and recent immi- 
grants as well as African Americans, and, as Feagin knows, 
the experiences of these groups are not isomorphic with 
those of blacks. It is useful, therefore, to distinguish between 
oppression and such related but distinct phenomena as 
racism, discrimination, prejudice, xenophobia, and stereotyp- 
ical thinking and behavior. Finally, how does one measure 
the decline of oppression? If the changes brought about by 
the civil rights movement of the 1960s were modest and a 
“ruse and illusion,” what would major and real changes look 
like? Indeed, short of absolute equality in psychic and 
material wellbeing among all groups, how would we know 
when systems of oppression have been uprooted and elimi- 
nated? 

Feagin’s work points the study of American racism in the 
right direction. As the author observes in the first chapter, 
the social scientific examination of U.S. racism is in the early 
to middle stages of theoretical and empirical analysis. La- 
mentably, this is true even after more than a hundred years of 
studies. Most of the scholarship on race from its beginnings 
until the 1930s reflected a racist and white supremacist 
character, and since then the social science community— 
political scientists especially—has been unwilling to devote 
the necessary interest and resources to racism in graduate 
studies, research, and academic journals. Feagin, whatever 
the limitations of this book, has made an important contri- 
bution that will advance the study of the realities of racism in 
America. 
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Politicians Don’t Pander: Political Manipulation and the 
Loss of Democratic Responsiveness. By Lawrence R. Ja- 
cobs and Robert Y. Shapiro. Chicago: University of Chi- 
cago Press, 2000. 425p. $50.00 cloth, $17.00 paper. 


Jeffrey E. Cohen, Fordham University 


Many observers of contemporary American politics argue 
that political leaders are overly sensitive to public opinion, 
that they pander to the public. Lawrence Jacobs and Robert 
Shapiro suggest that this view is wrong on two counts. First, 
at least for the past two decades, politicians have not been 
very responsive to public opinion. Indeed, they are likely to 
try to manipulate it. Second, from a democratic theory 
perspective, Jacobs and Shapiro muse upon the oddity of 
viewing responsiveness in a democracy as disreputable (p. 
xiv). 

The authors pursue three major tasks. First, they attempt 
to explain why responsiveness to public opinion has declined 
over the past two decades. Second, they build a theory of the 
dynamics of contemporary American politics, which they test 
on two important cases, the Clinton health care initiative and 
the Republican Contract with America. Third, they redirect 
discussion about the role of leadership and responsiveness in 
democracy and democratic theory. This is a large agenda for 
any project. The authors succeed quite admirably and have 
produced one of the most important books on American 
politics that I have read in years. 

In chapter 1, Jacobs and Shapiro critique the idea that 
politicians pander and discuss the limitations of theory and 
research on the lack of responsiveness in current American 
politics. In chapter 2 they lay out their theoretical framework. 
The next five chapters provide a detailed case study of the 
Clinton health care initiative. Chapters 3 and 4 focus on the 
behavior of the protagonists and antagonists in the policy 
debate. Chapters 5 and 6 present a detailed look at media 
coverage of the health care debate, and chapter 7 focuses on 
the public response. Chapter 8 presents a less detailed case 
study of the Contract with America. The book concludes with 
two chapters that deal with issues of democratic theory. 
Chapter 9 discusses the dilemmas of modern democracy, 
especially the implications of crafted talk and leader nonre- 
sponsiveness. Chapter 10 offers suggestions about how to 
increase iveness to public opinion but at the same 
time provide leadership and improve the quality of demo- 
cratic deliberation. 

Jacobs and Shapiro begin by suggesting that responsive- 
ness has declined and marshal considerable supporting evi- 
dence from other scholars. They then address why this 
decline has occurred and point to five major factors: polar- 
ization of the political parties, the rise of individual indepen- 
dence in Congress, the effects of increased incumbency, the 
proliferation of interest groups, and increasingly divisive 
interbranch relations. Along the way, they also critique the 
two major theories that explain the behavior of politicians: 
electoral incentive models, which emphasize responsiveness 
to the median voter, and policy motivation models. They do 
not eschew either, but they forcefully argue that each 18 
incomplete. 

Jacobs and Shapiro then move to their second task, 
building a theory of contemporary American politics. Their 
framework is extremely rich and subtle and cannot be easily 
summarized here. Their theory deals with the interconnec- 
tions among politicians, the mass media, and public opinion. 

‘Politicians, according to Jacobs and Shapiro, currently 
have strong incentives to pursue their policy preferences 
rather than respond to public opinion, but they also have 
strong incentives to appear responsive, which Jacobs and 
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Shapiro term “simulated responsiveness.” This is done by 
“crafting talk,” which is a major conceptual contribution of 
this work. That is, politicians track public opinion to identify 
the words, arguments, and symbols that the public finds most ` 
appealing with regard to particular issues. These are then 
used to prime the public, not so much to persuade, but to 
raise the issue to the top of the agenda. Policy advocates tend 
to prime the public to the benefits of a policy and avoid 
discussing the costs; opponents prime the costs, encouraging 
the public to view proposed reforms as threatening. 

The media pick up on this policy debate, and Jacobs and 
Shapiro make an important argument in this regard: The 
heightened attention in the media to political conflict may 
reflect reality and is not merely the imposition of media 
practices on political discourse. Still, the media amplify the 
conflict. As a result of the behavior of competing politicians 
and media coverage, public uncertainty about the benefits 
and costs of policies is elevated. Opponents are often advan- 
taged, because managing uncertainty is easier than keeping 
the public focused on potential benefits. As a consequence, it 
is hard to mmplement reforms. Furthermore, the emphasis on 
crafted talk and the atmosphere of conflict breeds public 
disaffection with politics and politicians. This brief summary 
does not do justice to the richness and sweep of the theory, 
which deals essentially with the core operation of the current 
political system. 

The empirical aspects of this study are no less ambitious. 
To test the theory requires data on the behavior of politi- 
cians, media news coverage, and the reactions of the public. 
Jacobs and Shapiro collect a wide variety of data, including 
content analyses of politicians’ public statements, interviews 
with politicians and other participants, content analyses of 
the news media, and public opinion polls. 

Jacobs and Shapiro also address important themes. in 
democratic theory. They try to find a balance between 
leadership and responsiveness, which they term “responsive 
leadership.” Their discussion blends normative political the- 
ory with empiricism and bridges the divide between the two 
approaches, which is rarely done. 

There is little to criticize in this important book, although 
more cases to test the theory would have been desirable. ‘Fhe 
Clinton health care initiative and the Contract with America 
were not fully enacted. The theory predicts that, outside 
election periods, it will be difficult to enact major reforms. 
Yet, some important policy changes were implemented in the 
1990s, such as NAFTA, and a major tax reform was passed in 
2001. What accounts for the success of these major policy 
efforts? How would the theory take account of these cases? 
Jacobs and Shapiro provide a fertile agenda for future 
research. 

We apply too readily the label “instant classic.” This book 
not only deserves that label but also will redirect thinking 
about American politics and will be read for years to come. 


Presidents, Parties, and the State: A Party System Perspec- 
tive on Democratic Regulatory Choice, 1884-1936. By 
Scott C. James. Cambridge: Cambridge University Press, 
2000. 318p. $59.95. 


Marc Allen Eisner, Wesleyan University 


Following a narrow victory, a president seeking to strengthen 
his party’s coalition begins to embrace regulatory initiatives 
that appear to be at odds with the commitments of his party 
and the demands of core coalition members. Yet, if these 
efforts secure the support of pivotal voting blocs and ensure 
future victories in the Electoral College, many loyalists would 
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be satisfied. After all, control of the presidency is necessary if 
one wants to control the policy agenda, and that control is 
impossible without reelection. 

This drama, played out in the months following the 2000 
presidential election, is as old as the regulatory state itself, 
according to Presidents, Parties, and the State. Although Scott 
James focuses on three earlier episodes in American political 
development, he offers important insights into the evolution 
of the regulatory state more generally. At first glance, one 
would not expect the current occupant of the White House to 
embrace new initiatives in environmental protection. But 
who would have expected an antistatist Democratic Party to 
promote the Interstate Commerce Act? Who would have 
anticipated that a party strongly opposed to the rule of reason 
would advocate a Federal Trade Commission along the lines 
suggested by Teddy Roosevelt? 

The core argument is that pressure-group and district-level 
models of legislative choice do not provide adequate ac- 
counts of regulatory choice during 1884-1936. A more 
satisfactory explanation is the party system perspective, which 
places national party competition for the presidency at the 
heart of regulatory choice. “In confronting the problems of 
industrial capitalism, Democrats were forced to choose be- 
tween the short-term desire to enact policies consistent with 
the long-standing party commitments, or jettison these com- 
mitments in deference to the preferences of electoral groups 
whose political support was pivotal to the consolidation of 
governing power. It was a Hobson’s choice because in the 
world of democratic politics, the stabilization of a winning 
electoral coalition was a precondition for anything else the 
Democratic party might reasonably hope to accomplish in 
office” (p. 268). 

After framing his party system perspective, James presents 
three case studies: the Interstate Commerce Act of 1887, the 
Federal Trade Commission Act of 1914, and the Public 
Utility Holding Act of 1935. Each explores the incentives 
created by the Electoral College and party system, the 
regulatory preferences of the voting blocs deemed essential 
to victory, and the historical and political contexts that 
shaped policy alternatives and the intervention of party 
leaders. James provides a lively but theoretically informed 
account of legislative action and coalition building that 
integrates careful historiography with quantitative analyses of 
congressional voting behavior. Each case study is situated 
within the larger dynamic of evolving relationships among the 
parties, the president, and Congress; the decline in congres- 
sional party government; and the emergence of a president- 
centered polity. 

In the case of the Interstate Commerce Act, deeply seated 
antimonopoly and antistatist impulses within the Democratic 
Party generated strong opposition to the creation of a railroad 
commission staffed with experts and charged with the task of 
administering the mdustry. Yet, business Mugwumps had as- 
sumed the status of a swing group in the presidential election of 
1884. Grover Cleveland and other party leaders needed to 
obtain their allegiance if they wanted to increase the chances of 
future electoral success. This required the passage of the 
Interstate Commerce Act over the objections of agrarian Dem- 
ocrats. The demands of coalition maintenance and the incen- 
tives created by the Electoral College trumped the historic 
commitments of the party. 

The Federal Trade Commission Act involved many of the 
same issues. The Democratic Party’s antitrust program was 
strongly opposed to the Supreme Court’s rule of reason, 
corporate consolidation, and any efforts to manage this 
consolidation through the creation of expert commissions. 
The party demanded instead that new laws be passed to force 
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deconcentration and unrestricted market competition. By 
1914, an impending depression led many to counsel Wood- 
row Wilson to suspend the pursuit of new antitrust legisla- 
tion. Teddy Roosevelt and the Progressive Party threatened 
to dominate the reform debates and link Democratic policy 
failures to the growing economic malaise. Understanding the 
potentially disastrous electoral consequences for the midterm 
elections and his own reelection, Wilson embraced the 
Federal Trade Commission Act. Its passage would allow the 
party to appeal to Progressives and organized business, 
thereby strengthening the Democratic coalition. As ın the 
earlier episode, coalition building would carry a price: the 
alienation of agrarian Democrats. 

The case of the Public Utilities Holding Company Act 
(PUHCA) of 1935 returns to the theme of coalition mainte- 
nance. Franklin Roosevelt’s Democratic Party had yet to 
consolidate its hold on power. This may have been impossible 
without the support of midwestern Republicans, who had 
become disenchanted with the collectivism of the New Deal 
and the lack of structural reform. Roosevelt met the demands 
for reform with PUHCA. It was a death sentence for utility 
holding companies and struck at the heart of the so-called 
Power Trust, thereby exhibiting the Democratic Party’s com- 
mitment to reform. Ironically, the demands of coalition 
maintenance created incentives for Roosevelt to move to the 
left of congressional Democrats and take up the antimonop- 
oly impulses of earlier generations of agrarian Democrats, 
who had been sacrificed previously in the name of electoral 
victory. 

The final chapter of Presidents, Parties, and the State 
explores a number of provocative questions. For example, 
why was the Democratic Party—but not the GOP—forced 
repeatedly to choose between well-established party commit- 
ments and the demands of electoral blocs whose political 
support was pivotal to coalition building? Given the compro- 
mises involved in building and preserving coalitions, what was 
the regulatory legacy of Democratic agrarians? These are 
important questions for anyone interested in American po- 
litical development and policy history. Although one might 
wish for some sustained discussion of the implications of the 
party system perspective for regulatory choice in the contem- 
porary period, James has accomplished much with this 
volume. It makes a significant contnbution and should have 
an effect on future discussions of the role of parties, elections, 
and presidents in the construction of the regulatory state. 


Race, Neighborhoods, and Community Power: Buffalo Poli- 
tics, 1934-1997. By Neil Kraus. Albany: State University of 
New York Press, 2000. 294p. $19.95 paper. 


David L. Imbroscio, University of Louisville 


This book grapples with what are perhaps the two central 
questions preoccupying the academic study of cities in the 
postwar era, questions that are quite familiar to political (and 
other social) scientists. The first asks who (or what) governs 
cities, that is, the community power question. The second 
examines why (or how) cities became marked by—to use 
Jonathan Kozol’s powerful phrase—savage inequalities, es- 
pecially along racial lines. The first concerns process: how 
political issues are resolved and the relative power and 
influence of actors and structural forces in their resolution. 
The second involves outcomes, specifically, the pervasiveness 
of deeply entrenched poverty spatially concentrated in the 
older neighborhoods of the urban core. 
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Through an examination of the political processes and 
resulting outcomes in one American city, Buffalo, New York, 
Neil Kraus admirably aspires to address both questions 
concurrently. Based on his study of housing, redevelopment, 
and education during more than a half-century, Kraus offers 
provocative contentions regarding both questions, as he 
challenges to some degree the conventional wisdom sur- 
rounding each. 

First, Kraus postulates that the strong focus on the influ- 
ential role of private sector elites who act largely behind the 
scenes—whether expressed through the current dominant 

igm in community power studies, regime theory, or 
early elitist theories—is “somewhat misplaced” (p. 11), at 
least in regard to issues that affect residential patterns and 
neighborhoods (as opposed to downtown redevelopment). 
Instead, he revives and embraces some of the key tenets of 
pluralism; that is, the nature of community power can be 
understood by examining overt, clearly visible, and publicly 
debated decisions. The core of Kraus’s story is about race, 
location, and racism, as whites in Buffalo used their numer- 
ical majority to limit residential choice and economic oppor- 
tunity for Africans Americans via the traditional and straight- 
forward processes of elections and political representation. 

As for the second question, Kraus contends that concen- 
trated urban poverty can be strongly linked causally to 
politics, as opposed to economic or demographic trends, 
because “political structures have played a critical role in 
ghetto formation” (p. 11). This stands in opposition to the 
standard explanation, which focuses largely on the timing of 
southern black migration to northem cities and the contem- 
poraneous white flight to suburbia, coupled with the effects of 
postwar economic restructuring and deindustrialization. 

Most interesting is the way Kraus brings both questions 
together. He not only argues for the significance of politics in 
explaining the formation and durability of the American 
ghetto but also emphasizes “the decisive influence of local 
politics,” that is, the struggle over community power (p. 7, 
emphasis added). Rather than focus solely on macrolevel 

Kraus attempts to demonstrate that concrete local 
public policy decisions must be included in any explanation of 
ghetto development in Buffalo and the plight of its African 
American poor. In this sense the book can be read as yet 
another work in the literature stream that emphasizes “local 
politics matters” in shaping urban outcomes. 

ing this claim Kraus astutely draws attention to a 
causal relationship often overlooked in studies of urban 
poverty. For this reason alone, his book makes a positive 
contribution, although his analysis could have been more 
penetrating. For example, Kraus makes the “local politics 
matters” argument by attempting to show that deindustrial- 
ization was not a significant factor in creating concentrated 
poverty in Buffalo because African Americans never had 
access to good industrial jobs in the first place (pp. 6-7, 
38-41, 221). But Kraus fails to take his localism argument far 
enough. Work on cities such as Baltimore and Chicago 
reveals that deindustrialization was not simply the result of 
macroeconomic change; it also was driven in part by local 
policy decisions to pursue economic strategies built around 
downtown real estate development and tourism rather than 
preservation of the manufacturing base. Therefore, one can 
argue that local decisions mattered deeply in worsening the 
problem of concentrated poverty and acknowledge that dein- 
dustrialization was a significant cause of concentrated poy- 
erty. But doing so requires an appreciation of how urban 
economic. processes are far from beyond the influence and 
control of local politics. 

Chapters 4 through 8 provide a highly detailed account of 
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Buffalo politics from the mid-1930s to the late 1990s. It is 
presented in a chronological, event-by-event fashion, and 
serves as the empirical data for the study. As a telling of local 
history, this section of the book (roughly two-thirds of the 
total pages) works reasonably well. Those with little interest 
in Buffalo politics may lack the patience for the level of detail 
presented, however. In addition, the careful description of 
historical events could have been placed in a much tighter 
analytical framework and tied much more closely and rigor- 
ously to Kraus’s theoretical propositions regarding commu- 
nity power and the development of the ghetto. 

In the final chapter, Kraus discusses the evils of segrega- 
tion and concentrated poverty and makes some gestures 
toward policy solutions. He also reiterates many of his claims 
regarding the nature of community power, such as the 
continuing importance of pluralist theory for understanding 
city politics. In both discussions Kraus nicely draws on the 
Buffalo case to illustrate his main analytic points, but these 
concluding sections would have benefited considerably from 
a more thorough and systematic treatment. Especially 
needed is a fuller demonstration of how the empirical case 
supports the validity of his rather provocative theoretical 
propositions. This chapter is a mere ten pages and could have 
been extended and deepened to strengthen the book’s overall 
significance and contribution. 

Despite many flaws, this book is well written and addresses 
meaningful urban issues. Moreover, unlike numerous other 
American cities of similar size, such as Cleveland, Atlanta, 
Pittsburgh, St. Louis, and New Haven, the Buffalo political 
story has “not really been told” (p. xi). This book is a 
welcome addition to the academic literature on the politics of 
American cities. 


Crafting Law on the Supreme Court. By Forrest Maltzman, 
James R. Spriggs I, and Paul J. Wahlbeck. Cambridge: 
Cambridge University Press, 2000. 206p. $49.95 cloth, 
$17.95 paper. 


Cornell W. Clayton, Washington State University 


Important contributions in the social sciences are usually 
made in one of two ways. Hither they tell us something we 
previously did not know, or they verify, in a systematic way, 
something that we did. Crafting Law on the Supreme Court 
falls into the latter category. The premise of this book is 
remarkably simple. The authors argue that decision making 
on the Supreme Court is interactive; justices consider the 
views of their colleagues when writing their opinions. Justices 
engage in the “collegial game” of bargaining and compromise 
because they are constrained from unilaterally writing their 
individual policy preferences into law by a set of institutional 
rules, such as the need to attract five votes for a majority, 
which make the Court a collegial institution. 

The collegial character of the Court is well known. The 
authors quote Chief Justice Rehnquist, who candidly admits 
that, in the process of producing opinions, “give and take is 
inevitable, and doctrinal purity may be muddied in the 
process” (p. 152). Indeed, even among political scientists, 
who usually are slow to recognize the obvious, the interactive 
nature of opinion writing was thoroughly discussed neatly 
forty years ago in Walter Murphy’s classic, Elements of 
Judicial Strategy (1964). What makes this book a crucial 
contribution to the literature is not its premise but its use of 
a sophisticated method and innovative database to test a 
multivariate model of interactive decision making. Specifi- 
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cally, the authors seek to identify systematically “what factors 
affect whether, when, and to what extent justices decide to 
bargain, negotiate, or comprese in the process of writing 
opinions” (p. 25). 
In the first chapter the authors present a model of decision 
based on two central postulates: (1) Justices prefer 
legal rules that comport with their policy goals, and (2) given 
the Court’s institutional rules, justices often act strategically 
to secure their policy preferences (p. 17). The first postulate 
drawa on the work of attitudinal scholarship, especially that 
of Jeffrey Segal and Harold Spaeth (The Supreme Court and 
the Altttudinal Model, 1993), the second builds on scholarship 
in the positive theory of institutions, or the “strategic choice” 
tradition, especially the work of Walter Murphy as well as 
more recent scholars, such as Lee Epstein and Jack Knight 
(The Choices Justices Make, 1998). To test their postulates, 
the authors created a database of all cases that received full 
disposition opinions during the. Burger Court (1969-86). 
Using the files of Justice William Brennan, the authors culled 
data from assignment sheets, docket sheets, and circulation 
records (such as draft opinions and memoranda distributed 
to the justices generally) about how the justices interact when 
drafting and circulating opinions. , 

In the four chapters at the heart of the book the authors 
test hypotheses drawn from the two central postulates and 
related to four distinct stages of the process: the initial assign- 
ment of the majority opinion, the response to draft opinions, the 
author's subsequent reaction to those responses, and the deci- 
sion whether to join an opinion (Le. the politics of coalition 
formation). Their analysis finds strong support for the strategic, 
interactive model of decision making during each stage. Justices 
constantly engage in bargaining and compromise over the final 
content of the Court’s opinions. In a short concluding chapter 
the authors summarize their findings and comment briefly on 
the scientific study of courts. 

Few of the substantive findings in this study are intended to 
challenge conventional wisdom. For example, the authors 
find that justices are less willing to accommodate the views of 
potential joiners to an opinion once they have secured a 
majority. Likewise, the Chief Justice, who has institutional 
responsibilities for the Court as a whole, appears to be more 
sensitive to questions of equitable work load and Court 
efficiency when assigning opinions than are assigning associ- 
ate justices. There are no major surprises here. The signifi- 
cance of this book lies in its “scientific” or quantitative testing 
of the interactive model of decision making. The method- 
ological work is first rate. The hypothesized relationships are 
clearly specified and conceptualized. The development and 
use of the data are ingenious. The analysis is thorough’ and 
rigorous. The empirical findings are robust and conclusive 
with respect to what they measure. 

One minor quibble has to do with the ype of research 
design compromises that data constraints impose on this type 
of study. Specifically, in order to understand how institutional 
structures shape judicial behavior, it would be useful to have 
more variation of those structures across time. For example, 
how do we know that Burger's assigning patterns stem from 
his institutional role as Chief Justice, as the authors assume, 
rather than his own idiosyncratic or attitudinal concerns 
about Court efficiency? It is possible that Burger would have 
demonstrated similar assigning patterns even if he were not 
Chief Justice. Of course, the authors’ assumption is perfectly 
plausible and in fact quite likely. The point is that we cannot 
know with an 7 of 1. Still, this criticism, which relates to 
broader problems of how quantitative political science has 
come to conceptualize the relationship between research 
methods and problems, should not distract from what the 
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authors have accomplished with the data available. They have 
told us more: about the systematic dynamics of opinion 
writing on the Court than any previous study. 

The book stands either as a formidable challenge to or a 
major bridge between two separate strands of research on 
Supreme Court decision making. Attitudinal scholars have 
long claimed that the process is driven by the justices’ 
individual policy preferences or “attitudes” alone. Most in 
this camp, however, only claim that individual preferences 
determine the final vote on a case’s disposition, and that 
nonattitudinal variables may operate in other areas of Su- 
preme Court decision making (see Harold Spaeth, “The 
Attitudinal Model,” in Lee Epstein, ed., Contemplating 
Courts, 1995, p. 314). The authors view their research as a 
bridge to attitudinal scholarship because they find abundant 
evidence that individual “preferences shape behavior beyond 
the final vote” (p. 151). Nevertheless, their powerful empir- 
ical evidence that institutional variables condition Supreme 
Court behavior generally is a challenge to: many: of the 
theoretical assumptions that underlie attitudinal research. 

The authors also argue that their approach is a bridge to 
historical-institutional or “traditional legal scholarship” on 
Supreme Court decision making (p. 151). This literature 
tends to focus on the historically evolving normative institu- 
tions and symbols that structure judicial decision making. 
Legal doctrines, judicial role conceptions, and other norma- 
tive institutions that define “good judging” are thought to 
have a constitutive influence on judges, shaping their values 
and attitudes (Howard Gillman and Cornell Clayton, eds., 
The Supreme Court in American Politics: New Institutionalist 
Interpretations, 1999).'The authors of Crafting Law present 
strong empirical evidence that justices are concerned about 
the content of legal opinions, not just the disposition or 
outcome of the case, as generally suggested by attitudinalists. 
This is an important finding for historical-institutionalists and 
supports the proposition that legal doctrines matter very 
much to the justices. Yet, the authors argue that justices care 
about the content of opinions for strategic reasons only, 
which rejects the core historical-institutionalist position that 
judges are often normatively driven actors and favors the 
position that judges are simple policy maximizers self-con- 
sciously using legal doctrines instrumentally. Whether this 

assumption can be tested using positivist methods is 
the subject of intense debate within the field (see “Sympo- 
sium: The Supreme Court and the Attitudinal Model,” Law 
and Courts Newsletter 4 [Spring 1994]: 3-12). As in Segal and 
Spaeth’s landmark study on Court decision making, many of 
the findings here with regard to the ideological motivation of 
the justices, not to mention the central conclusion that 
justices care about the content of Court opinions, may be 
perfectly consistent with a “legal” or normatively driven 
institutional explanation of their behavior. The authors ap- 
pear to reject this possibility in order to adopt the more 
narrow attitudmalist explanation for judicial motivation. 
Consequently, this study has the potential to be a valuable 
link to broader institutionalist work, but its key theoretical 
and methodological assumptions probably serve as a surer 
bridge to traditional behavioralist research on the Court. 

This book is a major contribution to our understanding of 
Supreme Court politics. It is a significant expansion of 
empirical-quantitative studies of the Court, moving beyond 
the typically narrow focus on judicial voting patterns. Simul- 
taneously, it is a major step toward systematically verifying 
the important role that institutional structures play in shaping 
judicial politics generally. There is enough empirical fodder 
here to provoke and console scholars of all theoretical 


persuasions. The authors do a superb job of drawing insights 
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from disparate theoretical traditions and melding them into a 
conceptually sound, parsimonious, and rigorous model. Their 
book is an important milestone in research on judicial 
decision making and commands the attention of all serious 
students of Supreme Court politics. 


Affective Intelligence and Political Judgment. By George E. 
Marcus, W. Russell Neuman, and Michael MacKuen. 
Chicago: University of Chicago Press, 2000. 199p. $42.00 
cloth, $15.00 paper. 

Mark R. Joslyn, University of Kansas 

Perhaps it is ironic that a book about rational judgment is 

entitled Affective Intelligence. This is a rousing and provoking 

effort, with an unmistakable double agenda. One objective is 
to dispute conventional wisdom regarding the relationship 
between affect and reason, especially the rational choice 
vision of political judgment. A second is to establish a more 
prominent role for emotions in models of political behavior: 

“The scientific community might fruitfully devote more at- 

tention to the idea that emotions are centrally important to 

political behavior” (p. 129). The two objectives are linked. By 
demonstrating that emotions influence how and about what 

we think, and what we do, the authors effectively challenge a 

prevailing disposition of the research community to set aside 

the complexities of emotional dynamics or to conceive of 
emotions as distorting agents in judgmental processes. Ra- 
tional choice is the proverbial straw man: in this regard, 
because it generally defines human affect as endogenous. 

Marcus, Neuman, and MacKuen argue persuasively that 

emotional engagement motivates individuals to think more 

deeply about political affairs and thus enhances rational 
choice. 

The first half of the book is devoted to the first objective, 
which details the traditional understanding of emotions: We 
think first and then feel. Moreover, emotion and cognition 
are thought to work in opposition. A passionate citizen is 
unlikely to be a deliberate one. Emotions appeal to the heart, 
not the mind. Against this backdrop of scholarly consensus, 
the authors posit that emotions activate reasoned consider- 
ation. Emotions draw greater attention to environmental 
circumstances, which allows people to process information 
required for a rational choice. The authors invoke Herbert 
Simon: “The human organism living in a demanding environ- 
ment requires an interrupt mechanism to redirect human 
attention to higher priority: real-time needs” (p. 7). This 
“interrupt mechanism” is key to Affective Intelligence. 

Drawing heavily from recent literature in the neuro- 
sciences, the authors advance a surprisingly straightforward 
picture of affect-driven judgments. We possess two emotional 

- subsystems that enable us to rely on habit and to activate 
reasoned consideration, each in the appropriate circum- 
stances. These subsystems are labeled disposition and sur- 
veillance; the former manages our reliance on habits, and the 
latter identifies circumstances that require a shift from habit 
to reasoned consideration. The disposition system monitors 
the relationship between our behavior and its expected 
execution. If behavior matches expectations, emotions of 
satisfaction and enthusiasm are forthcoming, and the behav- 
ior is reinforced and ultimately becomes habitual. By con- 
trast, the surveillance system monitors the environment for 
novel or threatening stimuli. When such a stimulus appears, 
emotions of anxiety and unease arise, upon 
habitual routines declines, and individuals are more likely to 
attend to the relevant stimulus. Anxiety, like Simon’s inter- 
rupt mechanism, triggers human attention to threatening 
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elements in the environment. Both subsystems are subcon- 
scious emotional responses, and surveillance has clear impor- 
tance for evolutionary survival - 

Various political events and symbols trigger the surveil- 
lance system, but an obvious threat comes from the economy. 
The authors demonstrate that during hard economic times it 
is supportive partisans who are likely to feel anxiety about 
their president’s poor economic record. A poor record 
threatens their habitual partisan views, and their surveillance 
system responds with amxiety. By contrast, opponents are 
unmoved, as they are neither threatened nor necessarily 
surprised by events. Anxious citizens are more likely to show 
interest in campaigns, acquire knowledge about politics, and 
participate in politics beyond voting. More profoundly, the 
authors demonstrate that anxiety about candidates leads 
individuals to deviate from their partisan habits and rely 
more heavily on policy proposals and personal qualities of 
candidates. ‘This effect of anxiety is neither trivial nor exchu- 
sive to the domain of voting. The authors offer confirming 
evidence with judgments about the Gulf War and NAFTA. In 
each case a perceived threat in the political environment 
alters the nature of people’s attention to that environment, 
consequently, habitual judgments are subject to change. 
Emotions, especially anxiety, “enhance rationality because 
they allow citizens to condition their political judgment to fit 
the circumstances” (p. 124). 

Affective Intelligence is a compelling and challenging work, 
but there are several points with which one may quibble. 
First, the interpretation of what is rational may be called into 
question. Although rationality may assume various forms, 
often at the behest of the researcher, reliance on partisan 
cues can in fact be considered a rational strategy. Starting 
from Anthony Downs’s classic work, a large and significant 
literature has demonstrated the “rationality” of cue-taking, 
information short-cuts, or the use of heuristics, in which 
partisan identification plays a key role (see Arthur Lupia and 
Mathew McCubbins, The Democratic Dilemma, 1998). By 
contrast, Affective Intelligence defines rationality as a depar- 
ture from partisan tendencies. Is it necessarily more rational 
to use issue stances and personal qualities rather than party 
identification as criteria for candidate choice? Among anx- 
ious voters, Affective Intelligence says “yes.” 

The reader also may be struck by the causal connections 
crucial to the affective intelligence thesis. Notwithstanding 
the forceful arguments, it is plausible that cognitive aware- 
ness of campaign dynamics, perceived differences with one’s 
partisan choice, or simple acquisition of knowledge may 
trigger the surveillance system. As in the case of a threatening 
economy (p. 79), anxiety may be a consequence rather than 
an antecedent of cognitive awareness. We thus may first 
consider the economy and then respond with an appropriate 
emotion. The authors’ reliance on pooled cross-sectional 
ANES data does little to clarify such important issues of 
causal inference. Furthermore, when dynamics are explored 
using the 1980 ANES panel, the authors’ findings are less 


persuasive. 

I ponder the authors’ assumption of linearity between 
anxiety and the various dependent variables used throughout 
the book: campaign interest, political knowledge, and the 
like. Without a defense of this assumption, a nonlinear 
hypothesis appears equally reasonable. As anxiety increases, 
the capacity to attend and learn may diminish. Similarly, 
moderate levels of anxiety may produce participatory activity, 
but high anxiety may well demobilize. One only needs to 
observe undergraduates at semester’s end to find supporting 
anecdotal evidence. Yet, considerable detail on measurement 
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strategy is found in the two appendices, which will undoubt- 
edly be important for replication and future research. 

I am impressed by the magnitude of the argument and the 
force with which it is presented. Affective Intelligence speaks 
to a broad cross-section of political scientists; it has implica- 
tions not only for campaigns, voting, and political learning 
but also for the myriad political topics that potentially 
address human emotions. But the implications for political 
learning are perhaps the most compelling. If, in fact, danger, 
threat, and novelty move people to learn and think more 
critically about political choices that confront them, then the 
character and intensity of present-day political communica- 
tions, and indeed campaigns, is perhaps not surprising. 
Inflammatory media, negative ads, and alarmist appeals 
made by various interests will continue to draw the ire of 
reform-minded citizens and elites alike, but Affective Intelli- 
gence offers a clear motive for their frequency and occasional 
success. In spite of what we may initially believe, the evidence 
in this book suggests that a more threatening and dangerous 
political information climate will produce an electorate much 
closer to the ideals of citizenship—in terms of interest, 
knowledge, and participation— embodied in classic demo- 
cratic theory. 

Although confident in their thesis, the authors are never- 
theless cautious and strike an appropriate balance. The fine 
distinction between conviction and prudence makes this book 
a pleasure to read and a must for those seeking the cutting 
edge in political behavior. The mechanistic models of human 
cognition have clearly influenced the field of public opinion, 
but it now appears that motivated cognition must be recog- 
nized in order to grasp fully the complexities of political 
choice. When the rational choice school responds to this 
book, I suspect it will be because Affective Intelligence awak- 


ened an emotion within. 


Governing Race: Policy, Process, and the Politics of Race. By 
Nina M. Moore. Westport, CT: Praeger, 2000. 248p. 
$65.00. 


Francine Sanders Romero, University of Texas 
at San Antonio 


Scholars who study civil rights policies face an inherent 
disadvantage. This chapter of American politics is very 
complex, involving elements of legislative and judicial behav- 
ior, federalism, public opinion, social movements, and so on, 
and works on this topic are often expected to cover too much 
ground. The most feasible way to approach the subject is one 
piece at a time, and Nina Moore ably explores one crucial 
component of this complex story. Governing Race examines 
the divisive effects of race on U.S. Senate consideration of 
Te ee and aes eane Sn ANE ENS 
ultimately passed. 


Moore’s premise is that issuea of race are essentially 
ungovernable. Specifically, normal parliamentary procedures 
have not prevailed in the consideration of civil rights legisla- 
tion by the modern Senate. The process is marked by 
abnormal procedures: obstructionism, Byzantine parliamen- 
tary Maneuvers, and eventual concessions. All this is due to 
deep regional and partisan divisions over the need to create 
and preserve civil rights protections. 

The book is clearly written, nicely organized, and easy to 
follow. At its core are four similarly structured chapters that 
trace three continuing threads—politics, procedures, and 
policy outcomes—from the late 1950s through the early 
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1990s. The concluding chapter offers additional empirical 
support for the assertions regarding each theme. The overall 
story is that the politics of race has been and continues to be 
divisive, on both a regional (strong at the beginning and 
weaker over time) and partisan (weaker at the beginning and 
stronger over time) basis. As a result, Senate deliberations 
have been uniquely abnormal. Civil rights advocates were 
fought at every turn by opponents and had to compromise in 
order to gain the support of uncommitted senators (usually 
moderate Republicans). These concessions were of necessity 
significant and resulted in policy outcomes far weaker than 
originally intended. 

The strength of Governing Race is its cogent portrayal of 
civil rights legislative battles not as historical anecdotes but as 
a continuing process marked by notable consistencies. The 
material may not be new, but it is presented in a manner that 
makes the broad picture more epparent than previously. For 
example, the crucial role played by Republican moderates, 
the importance of the cloture rule, and the frequent bargain- 
ing down of enforcement mechanisms from the administra- 
tive to the judicial level (what Moore terms “constitutional 
ted emerge as constant and meaningful character- 
istics of this issue and venue over time. Moore provides an 
accurate and useful narrative framework for the study of civil 
Tights deliberation in the U.S. Senate. On this level, the book 
is quite effective and will be a boon to professors interested in 
teaching a civil rights policy class from an actual political 
science perspective, as opposed to the largely historical 
orientation of most works on this topic. 

Moore makes broader claims for the analysis, and on this 
plane she is less successful. In particular, the assertion that 
race is an ungovernable issue is not fully sustained. First, the 
argument that this analysis of thirteen civil rights bills in the 
U.S. Senate is generalizable to the overall governance of 
racial matters is unconvincing. Despite her declarations to 
the contrary, the Senate is too distinct for this clatm to be 
made. Other institutions do manage nonconsensual issues in 
a more efficient way. The author's fall back position, that 
what happens in the Senate is meaningful simply because that 
body must approve all potential laws, is much more reason- 
able, and the entire book would have benefited from relying 
on that more moderate justification for the importance of this 
setting. 

A second concern is a lack of convincing evidence that the 
procedures associated with civil rights bills in the Senate are 
in fact “abnormal.” Not enough attention is paid to such 
works as Barbara Sinclair’s Unorthodax Lawmaking (1997), 
which suggests that putatively unusual procedures have be- 
come much more commonplace. In addition, the data mus- 
tered in support of the central argument are weak. For 
example, the final chapter presents figures to demonstrate 
that typical civil rights bills are more likely than average bills 
to be marked by numerous quorum and roll calls, longer 
debates, and so on. It is not clear what is counted as an 
average bill, but given the argument that civil rights deliber- 
ation is unique, a better comparison would have been to 
other contentious issues. That is, “average” bills may include 
a large number of fairly inconsequential measures and thus 
may be skewed toward a nonrepresentative level of consen- 
sus. Further confusing this argument is the imprecise defini- 
tion of abnormal procedures. At one point, for instance, 
weakening amendments are referred to as an example of 
traditional, normal deliberation. Later, the large number of 
substantive amendments associated with civil rights bills are 
cited as evidence of abnormal processes. 

The purported divisiveness of the race issue, although 
plausible, is not fully explored. For example, there is no 
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discussion of meaningful variation within the civil rights 
policy realm. Certainly, race is a divisive issue, but some 
questions are far more controversial than others, and Moore 
ignores the literature that suggests this. In addition, Moore 
never comes to terms with an ever-present foil to this 
argument. Despite her insistence on some endemic feature of 
race as the key to dissent in the Senate, she cites several 
examples in which uncommitted senators insisted on substan- 
tial concessions in return for their support, not because they 
objected to civil rights gains per se, but because of the 
increased federal role the original bills represented. Thus, the 
asserted ubiquitous lack of consensus on racial issues never 
becomes fully apparent. 

Overall, Race ig a useful approach, with a 
political science orientation, to one significant aspect of civil 
Tights policymaking in the United States. It does not 
completely achieve its stated goals, but scholars and 
students alike will leam from it and will gain a new under- 
standing of civil rights policy creation as a systematically 
difficult process. 


Before Roe: Abortion Policy in the States. By Rosemary 
Nossiff. Philadelphia, PA: Temple University Press, 2001. 
195p. $69.50 cloth, $21.95 paper. 


Raymond Tatalovich, Loyola University Chicago 
Twenty years ago, when I coauthored a case study of the 
abortion controversy (Raymond Tatalovich and Byron W. 
Daynes, The Politics of Abortion, 1981), few books tackled 
that subject, but in the years since there has been an 
overabundance of books on abortion. What immediately 
comes to mind is: Why yet another one? Detailed case studies 
of pre-Roe abortion politics have been published on five 
states (California, Georgia, Hawaii, North Carolina, and 
Texas), and adding New York and Pennsylvania hardly seems 
enough justification for this book. The more important 
rationale for Nossiff is that these neighboring states are 
similar demographically (except for the Jewish presence in 
New York), but their abortion policies before the 1973 
landmark ruling diverged sharply. New York enacted the 
most permissive abortion-on-demand statute in the period 
before 1973, and Pennsylvania tightened a restrictive but 
somewhat ambiguous statute. 

Nossiff's objective is to analyze the interactions among the 
Catholic conferences, political parties, and interest groups at 
the mass and elite levels, based on the framing strategies by 
ideological adversaries, group mobilization, the antagonists’ 
political resources, and most important the permeability of 
the party system. To veteran observers of abortion politics, 
her chronicle of events will be less interesting than her 
theoretical packaging of this political story. Although 
prochoice won in New York and prolife prevailed in Penn- 
sylvania, Nossiff finds parallels insofar as the victors in both 
states fully exploited their political advantages. 

Discourse looms large as an explanatory factor. For exam- 
ple, New York advocates shifted the framing of abortion 
from a medical problem to one of women’s rights and, in the 
process, became one with the newly emerging women’s 
movement The Catholic opposition remained rigidly anti- 
abortion in its framing of the issue, with the result that in 
New York it could not embrace therapeutic reforms to head 
off demands for repeal. One troubling aspect of viewing 
discourse in purely instrumental terms is that the reader may 
forget that, for single-issue activists and most assuredly 
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people who are religious, ideas have worth, and they bring an 
intensity of commitment to moral conflicts that sets issues 
like abortion apart from the rest. 

Why did Catholic intransigence pay off in Pennsylvania but 
falter in New York? The answer is the party system— 
Nossiff's critical variable—or more precisely the Democratic 
Party. Abortion pitted reform against machine Democrats in 
both states. But the Pennsylvania machine Democrats, in 
Jeague with the Catholic Church, regained their political 
hegemony against reformers who operated in the elitist 
tradition of old Philadelphia, whereas in New York City a 
grassroots slugfest between factions gave rise to a strong 
reformist contingent within the statewide Democratic Party. 
Borrowing from social movement theorists, Nossiff argues 
that the changed political opportunity structure, manifested 
as intraparty rivalry, allowed the proabortionists to gain 
policy leverage and ultimately prevail within the New York 
legislature. 

Nossiff states that recent cross-sectional analysis of pre-Roe 
abortion policies in the fifty states neglects intraparty com- 
petition (that empirical analysis indicates party competition 
was negatively related to state abortion reform). Yet, I 
wonder whether New York and Pennsylvania were not both 
outliers in the politics of abortion at the time. Most of the 
early abortion reform states were southern, and conservative 
Democrats still reigned unchallenged, although they saw fit 
to enact therapeutic reforms at a time when most northern 
two-party states did not. Furthermore, just how many states, 
North or South, were characterized by machine versus reform 
cleavages within one of the major political parties? In other 
words, Nossiff gives detailed accounts of what transpired in 
these two jurisdictions, but her key insight has limited 
relevance to the dynamics of abortion politics among the 
fourteen reformed states (ignored in her count is Mississippi, 
perhaps because its reform only extended to abortions for 
rape) and the four more states, including New York, that 
repealed antiabortion statutes. 

The final chapter on the aftermath of Roe is a disappoint- 
ment. We all know that Pennsytvania is a leader among the 
antiabortion states, so presomably Nossiff could have applied 
the same four theoretical prisms to explaining why each of 
the several post-Roe restrictive abortion laws by Pennsylvania 
found its way to the Supreme Court. One instance would be 
especially telling, namely, the fact that prochoice activists 
pressed their appeal in Casey vs. Planned Parenthood of 
Southeastern Pennsylvania (1992) for reasons (I believe) more 
related to their own mobilization needs than to any dispas- 
sionate assessment of their odds of judicial victory. Had 
Nosaiff pursued this line of inquiry, she would have explored 
an important caveat to this literature: There are trade-offs 
between the choices defined by the political opportunity 
structure and the organizational needs of interests that 
pursue essentially altruistic ends. Absent that perspective, we 
are left with basically two more case studies in our inventory 
of pre-Roe abortion policies. 

Nonetheless, this nicely crafted and extremely readable 
book should be considered for courses on morality policy, 
women’s issues, and even (state) party politics. 


Filled with Spirit and Power: Protestant Clergy in Politics. 
By Laura R. Olson. Albany: State University of New York 
Press, 2000. 174p. $57.50 cloth, $18.95 paper. 


Paul J. Weber, University of Louisville 


Laura Olson is one of a small but energetic and influential 
group of Christian political scientists determined to bring 
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rigorous quantitative and qualitative methods to their study 
of religion and politics. This book fits well within that genre. 
It is based on interviews with forty-six Protestant pastors in 
the Milwaukee ‘area between June 10 and August 16, 1994. 
An equal number of mainline, evangelical, and African 
American clergy were interviewed, and Olson provides an 
intimate study of how each views his or her social or political 
ministry. 

In one sense this is a very simple, straightforward book. It 
explores who among these pastors gets involved in politics, 
how they get involved, and why. But the simplicity is decep- 
tive. Olson writes clearly and often allows the pastors to 
speak for themselves, faithfully recording their insights and 
opinions. She then ties their views into more traditional 
theories of political participation and the broader religion 
and politics literature. The result is a readable, insightful 
analysis that fills a gap in that literature. Some common sense 
beliefs about clergy in politics are confirmed; others are 
contradicted. 

Olson divides the respondents into three categories based 
on their attitudes toward the involvement of clergy in politics, 
that is, how they see political activism as part of their 
institutional role. The di are deliberately apolitical; 
agenda-setters set the tone for their congregation and make 
suggestions about pertinent topics without getting personally 
involved; and political leaders are politically active in one 
form or another and see that engagement as an integral part 
of their religious role. 

The comparison of the three types provides the bulk of the 
insights. The central finding is that “in an urban setting the 
choices pastors make about political involvement are shaped 
in a profound way by the socioeconomic chatacteristics of the 
neighborhood in which they serve” (p. 3). Those in 
areas with high crime, unemployment, and need for social 
services are far more likely to be politically active on behalf 
of their parishioners and neighbors than are those in more 
secure and stable neighborhoods. 

Some of the findings are predictable. For example, “per- 
sonal resources” count somewhat; that is, pastors whose 
theology is influenced by the social gospel strain in American 
Protestantism and who feel a sense of personal efficacy in the 
political arena are more likely to be active. Those whose 
theology is more focused on personal salvation or who feel 
less efficacious are less active. I was surprised by two findings. 
Despite widespread media attention to political activism of 
the religious Right, evangelical clergy are less likely to be 
politically active than their mainline and African American 
colleagues. Aiso, denominational affiliation (and presumably 
related theological stances) is far leas important as a predic- 
tor than the socioeconomic need of the communities served 
by the pastors, 

Abortion is one area in which denominational affiliation is 
very significant. Mainline pastors are overwhelmingly 
prochoice, whereas evangelical and African American clergy 
are clearly prolife. Still, despite its enduring legacy as a 
politically divisive issue, a majority of respondents did not 
rank abortion among their most salient political concerns. On 
the issues of crime and violence there is a clear split among 
types. Pastors who are political leaders consider this a highly 
salient issue; the disengaged and agenda-setters do not. The 
disengaged are likely to point to societal causes of crime, and 
agenda-setters tend to attribute crime and violence to per- 
sonal failure. 

When asked whether the “family values” debate is an 
important political issue or simply a rhetorical wedge, the 
clergy expressed some sharp views. Those who do not con- 
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sider the debate politically legitimate called it either racist or 
sexist. Yet, somewhat surprisingly, African American pastors 
held the most consistently conservative views on family 
values, although they also saw the connections among crime, 
violence, and the deterioration of the family. 

Within the author’s intentionally limited scope, this is an 
excellent study, but one should be cautious about generaliz- 
ing. Forty-six clergy do not create a critical mass, and the 
author’s liberal use of percentages, gammas, and chi-squares 
can give a false impression of statistical significance. Also, 
Milwaukee is a heavily Catholic city with a significant Jewish 
presence as well, and the exclusion of these groups makes this 
a less than comprehensive study of clerical activism even in 
Milwaukee. But Olson makes no such pretension, choosing 
in-depth interviews rather than broad coverage. Taken on its 
own terms, this is a valuable contribution to religion and 
politics literature. 


The Politics of Gay Rights. Edited by Craig A. Rimmerman, 
Kenneth D. Wald, and Clyde Wilcox. Chicago: University 
of Chicago Press, 2000. 460p. $68.00 cloth, $19.00 paper. 

Charles W. Gossett, Georgia Southern University 


It is a rare book that educates even when unopened. The 
Politics of Gay Rights is one of them. For a small lesson in 
attitudes toward lesbians and gay men in America, I recom- 
mend reading this book on an airplane, in a coffee shop, or in 
some other public setting. The reader may not encounter any 
direct comments from others, but there will be the occasional 
raised eyebrow or knowing smile; these and the reader's 
internalized debate about whether to turn the cover up or 
down when setting the book aside make one acutely aware of 
the controversial nature of the subject: 

Kenneth Wald’s introductory chapter helps place the study 
of lesbian and gay politics in the broader context of political 
science as a whole, and the subsequent chapters use a wide 
variety of analytic tools and approaches that reinforce the 
introductory chapter. The editors separate the essays into 
four sections. 

Section I, “The Gay Movement,” provides an historical 
overview of the development of gay politics (by historian 
John D’Emilio) and political organizations (Craig Rimmer- 
man), primarily since World War I, as well as an interpre- 
tation of the political movement from the viewpoint of 
African Americans (Keith Boykin, formerly executive direc- 
tor of the National Black Gay and Lesbian Leadership 
Forum) and women (Jean Reith Schroedel.and Pamela 
Fiber). 

Section II, “The Opposition,” looks at the religious Right’s 
opposition to “the gay agenda” in a nuanced analysis of the 
political motivations and strategies used (John Green) and in 
a polemical assault on the religious foundations of gay rights 
opponents (Didi Hermann). A third chapter in this section 
examines the role played by citizen initiatives and popular 
referenda and concludes that, for the most part, such activi- 
ties serve to preserve the status quo rather than cause 
dramatic change in either a pro- or antigay direction (Todd 
Donovan, Jim Wenzel, and Shaun Bowler). 

Section II, “The Issues,” looks at three contemporary 
controversies in which the role of gay men and lesbians 1 
central. The debates over same-sex marriages and employ- 
ment nondiscrimination are analyzed in terms of congres- 
sional responsiveness to public opinion (Gregory B. Lewis 
and Jonathon L. Edelson); the effect of AIDS on political 
participation, political institutions, and public policymaking 
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as well as on the pattern of gay politics is presented (Mark 
Carl Rom); and struggle over the issue of lesbians and gay 
men serving in the U.S. military is carefully explicated 
(Francine D’Amico). 

Section IV, “The Arenas,” examines politics at the differ- 
ent levels and in the different branches of American govern- 
ment. There are chapters on local government (James W. 
Button, Barbara A. Rienzo, and Kenneth D. Wald), state 
government (Donald Haider-Markel), Congress (Colton C. 
Campbell and Roger H. Davidson), the Supreme Court 
(Sarah Brewer, David Kaib, and Karen O’Connor), and on 
public opinion more broadly (Clyde Wilcox and Robin 
Wolpert). 

The editors have put together a useful collection that 
analyzes lesbian and gay rights in the United States from the 
angle of most political science subfields. The book probably 
will be most useful for an undergraduate class, because many 
of the articles are essentially summaries of longer works by 
the various contributors (updated to 1999), as opposed to 
original research. The contributors include both academics 
and activists, but all can be considered advocates for advanc- 
ing gay rights. Due to the quality of previous work of several 
authors and the relatively small number of scholars working 
in this area, earlier works by fellow contributors are often 
cited for ideas and support of the arguments. 

Several essays are updated summaries of works already on 
their way to being classics (e.g. the pieces by D’Emilio; 
Boykin; Hermann; Green; Donovan, Wenzel, and Bowler; 
D’Amico; and Button, Rienzo, and Wald). Others are exten- 
sions of work or part of a series of studies recently begun that 
may become books at some point in the future (e.g., the 
contributions by Rimmerman; Lewis and Edelson; Haider- 
Markel; and Wilcox and Wolpert). As a result, most of this 
work has been extensively reviewed and defended in other 
forums. l 

In several instances, the authors could have made better 
use of the ideas expressed in their colleagues’ contributions 
to this volume. For example, John Green carefully distin- 
guishes among different strategies used by the Right, whereas 
Didi Hermann describes the Right as fairly monolithic. 
Likewise, Rimmerman’s article on strategies employed by 
lesbian and gay political organizations might have included 
some comparison to the Right’s strategies identified in other 
chapters. It would have been interesting to have Keith 
Boykin’s response to John D’Emilio’s questioning of the 
usefulness of “community building” as a strategy for advanc- 
ing gay rights in the twenty-first century. Could it be that the 
black lesbian and gay movement is on its own trajectory? 

A couple of the articles are frustrating because it seems the 
authors do not have a good understanding of some basic 
political facts of life. For example, Schroedel and Fiber take 
an interesting approach to surveying the policy priorities of 
lesbian and gay public officials in an effort to identify 
gender-based differences, but they neglect to control for the 
level of government or the nature of the political office 
occupied. Also, the collection would have benefited from at 
least two more chapters, one that places gay politics in the 
United States in a comparative international context, in 
Section I, and a chapter in Section IV that examines the effect 
of this issue on executive leadership and the bureaucracy. 

If you want students to gain a solid understanding of the 
current state of lesbian and gay politics, this book is a good 
choice. And they can learn almost as much just from carrying 
it around. 


Vol. 95, No. 4 


Theories of the Policy Process. Edited by Paul A. Sabatier. 
Boulder, CO: Westview, 1999. 289p. $70.00 cloth, $29.00 


paper. 
Stella Z. Theodoulou, California State University, Northridge 


Paul A. Sabatier contends that it is vital to find some way to 
simplify the policy process in order to understand fully the 
complexities of policymaking and the wide plethora of actors 
involved. He advocates a “multiple lens strategy” (p. 6). Such 
an approach requires researchers to apply several theoretical 
frameworks rather than only one. This allows a situation to 
be looked at from several different perspectives. Seven of 
what Sabatier terms the most “promising” theoretical frame- 
works (p. 7) developed over the last fifteen years are pre- 
sented in this volume by some of their leading proponents. 
Sabatier charged these authors with the task of synthesizing 
and evaluating a particular theoretical perspective that they 
believe best explains how public policy is made. Overall, the 
intention is to provide readers with better theories to under- 
stand the policy process in contemporary societies. 

The seven frameworks are organized in four sections 
around certain themes. The first section comprises an intro- 
ductory chapter by Sabatier and a second chapter by Peter 
deLeon that reviews the literature on the stages heuristic. 
Such a perspective explores the policy process by dividing it 
into consecutive stages, such as agenda setting, formulation, 
and implementation. The second section explores the role 
and effect of rationality in the policy process. Specifically, the 
institutional rational choice framework, as conceptualized by 
Elinor Ostrom, focuses on how institutional niles affect the 
behavior of assumed rational individuals motivated by self- 
interest. In one chapter Nikolaos Zachariadis examines poli- 
cymaking through a multiple streams and garbage can ap- 
proach, that is, three streams of actors and processes and the 
garbage can model of organizational behavior. 

The third section looks at policy change over time. In their 
discussion of punctuated equilibrium theory, James L. True, 
Bryan D. Jones, and Frank R. Baumgartner argue that 
policymaking is characterized by long periods of incremental 
change interspersed with brief periods of comprehensive and 
dramatic change. In the following chapter Sabatier and Hank 
Jenkins Smith argue that policy change is best explained 
through an advocacy coalition model. This approach focuses 
on advocacy coalitions who share common beliefs and un- 
dertake some level of coordinated activity over time. 

In the fourth section a comparative perspective is provided 
by Frances Stokes Berry and William D. Berry and by 
William Blomquist. Berry and Berry discuss the policy inno- 
vation and diffusion framework, which explains policy varia- 
tions across political entities through analysis of political 
system characteristics and different diffusion processes. 
Blomquist examines the funnel of causality and large N 
comparative studies. This approach explains policy variances 
across states through focusing on such factors as public 
opinion, socioeconomic variables, and political institutions. 

The concluding section has two chapters, by Edella 
Schlayer and Sabatier, that draw the preceding discussions 
together. Schlayer does a good job of comparing all the 
frameworks and highlights their essential differences. Saba- 
tier discusses several strategies for advancing the state of 
policy theory. 

This collection is a challenging read and is for the more 
advanced student of the policy process. The discussion is too 
abstract and theoretical for readers without a reasonable 
level of knowledge of the field, but-each chapter 1s well 
written and clearly structured. All the authors do a good job 
of presenting the evolution of each framework and discussing 
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its underlying principles and propositions. Recent empirical 
evidence and reasoning are also covered. The collection 
provides a serious examination of the strengths and weak- 
nesses of each framework and offers suggestions for future 
developments. 

The greatest contribution of this volume is that it compiles 
ın one place some of the most interesting theoretical frame- 
works in the policy process literature. Some might disagree 
with the choice of frameworks, but Sabatier summarizes 
several others that were judged less promising because they 
are not as complete as the seven selected. The strengths and 
limitations of each framework are specified, so readers can 
decide which “repertoire” of two or three frameworks that 
they want to employ in their work (p. 13). The volume gives 
a broadly based perspective of the policy process that allows 
for consideration of a multitude of ideas and issues. 

The volume’s shortcoming is its bias toward North Amer- 
ican literature and examples. Some noteworthy European 
research has been done over the same period. This criticism 
is a minor quibble, however, and without doubt this is an 
indispensable guide for those interested in the policy process 
and current theoretical developments. The volume is well 
organized and could be used to supplement a general public 
policy text in upper division and graduate courses. The essays 
are stimulating and useful and should be widely read by 
policy scientists. There is much in this volume to debate, 
which is clearly the editor’s mission in producing it. 


The Missing Middle: Working Families and the Future of 
American Social Policy. By Theda Skocpol. New York: 
W.W. Norton, 2000. 207p. $25.95. 


Benjamin I. Page, Northwestern University 


Theda Skocpol’s work on post—Civil-War pensions, programs 
for mothers and children, the New Deal, the G.L Bill, and the 
Clinton health plan have convinced her that it is possible to 
enact broad, generous social policies in the United States, but 
not on a European model. Her formula for success involves 
social alliances across class, race, and generation; the linking 
of benefits with contributions, especially work; central roles 
for voluntary associations; and the availability of reliable and 
expanding public revenues. 

In The Missing Middle, Skocpol articulately addresses con- 
temporary policy issues concerning the plight of low- to 
middle-income working parents and their children. She de- 
scribes the new world of highly stressed families with a single 
parent or two working parents, and she argues that U.S. 
government policies do reasonably well for the elderly but are 
quite incomplete or out of date with respect to parents and 
children. Skocpol offers some thoughts about why this is so 
and suggests policy changes to remedy the situation. 

Chapter 2 outlines cases in which broad social alliances 
and voluntary associations have achieved generous, inclusive 
social policies. It declares that we have veered off course by 
trying to help only the elderly and the very poor, which has 
alienated the middle of society and permits antigovernment 
forces to promote generational warfare, demonize “welfare,” 
and slash taxes so as to squeeze social spending. Chapter 3 
lauds Social Security and Medicare as inclusive and highly 
effective programs, notes the still-precarious situation of 
many oldsters, and maintains that nationally shared values 
and interests, rather than sheer “gray power,” explain the 
potency of these programs. 

Particularly useful ıs chapter 4, which offers a concise but 
well-documented introduction to the stark social realities of 
many U.S. children and their overstretched parents, who earn 
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low wages, have inadequate day care or health coverage, and 
can find scant time for parenting. It argues that U.S. tax, 
labor, and social policies not only offer little help but also are 
designed as if every family still can and should include a 
spouse staying at home. 

Skocpol offers a series of policy suggestions for a “family- 
friendly” America, including G.I. Bill type education and 
training loans from Social Security surpluses; assured child 
support (paid by absent parents plus general tax revenues); 
extension of Medicare to younger and younger groups; and 
expansions in earned income tax credits, the minimum wage, 
family leave, and day care. To fund this she would make the 
income tax more progressive, contrary to the recent regres- 
sive cuts, and remove the “cap” that exempts high incomes 
from payroll taxes. She maintains that Americans will support 
and enact such a program if a family-oriented moral vision is 
forwarded in national debates, if the Democratic Party is 
revitalized from the grassroots, and if broadened member- 
ship groups of working parents pressure politicians. 

Skocpol’s diagnoses of problems are quite compelling, as 
are her policy proposals. Some progressives may dislike the 
critiques of liberal advocacy groups and the nods to conser- 
vatives, especially the assessment that single-parenthood 
tends to be bad for children (tempered, to be sure, by 
vigorous refutation of right-wing notions concerning just why 
U.S. families are fragile and what should be done about it). 
But her centrist, inclusive rhetoric is likely to be more 
effective in the United States than, say, overtly class-based 
appeals would be. 

The sketchiest sections of the book are those dealing with 
the political causes of present U.S. social policies and with 
political strategies for the future. The crucial role of raciam as 
a tool for dividing working Americans is barely mentioned 
(¢.g., pp. 51, 109). Oddly absent is the part played by sexism 
in policies unfriendly to single parents. More than in her 
previous work, Skocpol acknowledges the powerful role in 
American politics of big money, a “shrinking, class-tilted 
electorate,” and an “increasingly class-biased political uni- 
verse” (pp. 7-8, 54). But this analysis is not connected with 
specific policies or carried through into the discussion of 
political strategy. 

How, exactly, can grassroots movements (pp. 167-9) be 
formed, and how can they counter the major investors who 
fund, and often steer, the Democratic Party? Perhaps cam- 
paign finance reform deserves discussion. How, exactly, can 
“actual groups” of busy parents involved in day care centers, 
church groups, and after-school sports (p. 170) be brought 
together into a politically potent national civic association? 
And how can low-income citizens be mobilized to vote? 

Of course, we should not expect everything from this slim 
volume, which does accomplish a great deal. Its moral vision 
and policy ideas deserve a broad audience among the general 
public as well as political scientists and policy experts. 


Contemporary Controversies and the American Racial Di- 
vide. By Robert C. Smith and Richard Seltzer. Lanham, 
MD: Rowman & Littlefield, 2000. 157p. $65.00 cloth, 
$23.95 paper. 


Theodore J. Davis, University of Delaware 


Race continues to be a source for major cleavages in Amer- 
ican politics. Robert Smith and Richard Seltzer, two of the 
prolific scholars in the study of African Americans’ public 
opinion, have continually worked to enhance understanding 
of race, class, and culture as factors influencing that public 
opinion. In Contemporary Controverses and the American 
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Racial Dinde they continue that pursuit by examining race- 
based differences in public opinion. 

Smith and Seltzer make use of forty-seven surveys and 
polls in addition to the 1996 General Social Survey to analyze 
such topics as attitudes toward the Persian Gulf War, con- 
spiracy theories, several high-profile court cases involving 
prominent blacks, and individual perceptions of several lead- 
ing African Americans. To ascertain the magnitude of differ- 
ence in the opinions of blacks and whites, they describe 
variations in terms of gap (10-20%), gulf (20-40%), and 
chasm (plus 40%). 

The authors suggest that a full understanding of differ- 
ences in the public opinion of blacks and whites requires 
recognition of different histories, cultures, and material cir- 
cumstances between the two groups. They point out that the 
historical considerations essentially revolve around unequal 
treatment by the Constitution and by statutory law at all 
levels of government. Cultural considerations are based on 
partially distinctive attributes that constitute a black culture. 
They identify specific attributes that. influence black public 
opinion: shared values among blacks, religiosity, cynicism, 
suspiciousness, and alienation. They note that one must keep 
in mind that culture influences black-on-black as well as 
black-white interaction. They also identify several social 
factors that define black culture, such as education, employ- 
ment, income, poverty, wealth, and residence. 

Chapter 2 focuses on politica, ideology, and partisanship. 
Smith and Seltzer indicate that race matters on racial issues, 
but they emphasize that significant differences among blacks 
on these issues should not be ignored. They found that 
differences between blacks and whites on nonpolitical issues 
are modest and sometimes nonexistent. The greatest differ- 
ence concerns ideology and partisanship. 

Chapter 2 examines the racial divide in public opinion on 
foreign policy matters. Smith and Seltzer note that little is 
known about African American attitudes on foreign policy, 
which they attribute to the neglect of blacks’ role in interna- 
tional affairs. They establish that significant differences be- 
tween the races on foreign policy issues are not as large as the 
differences on domestic issues. Overall, they conclude that 
structural and historical factors provide African Americans 
with different perspectives from which to frame attitudes on 
foreign policy and military matters. 

Chapter 4 probes views of blacks and whites toward three 
very different black leaders (Louis Farrakhan, Colin Powell, 
and Clarence Thomas). The aim is to examine how blacks 
and whites evaluate these men, who represent divergent 
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ideological and institutional roles in society and in the black 
community. The objective is to go beyond the traditional 
examination of issues and ideology to explore perspectives on 
leadership. This is a particularly interesting approach im light 
of what Seltzer and Smith describe as the the conundrum of 
post-civil rights era black leaderahip. 

Chapter 5 deals with racial differences in attitudes and 
opinions related to conspiracies and rumors of conspiracies. 
Smith and Seltzer justify this examination by noting that 
blacks tend to attribute much of their subordinate position in 
society to a white conspiracy. They conclude that blacks are 
more likely to believe in rumors and conspiracies than are 
whites. 

In chapter 6, Smith and Seltzer look at opinions of several 
high-profile cases and conclude that blacks and whites differ 
in their views of the justice system depending on their 
opinion about continuing racism within the system. 

Smith and Seltzer give those who are interested in studying 
the public opinion of African Americans a place to begin. 
There are no new revelations about black public opinion and 
the differences between the races, but the authors effectively 
bring together various aspects of the literature and provide a 
common explanation. In doing so they conclude that the 
formation of black political opinion must be understood in 
terms of the historical roots of slavery and its psychological, 
cultural, economic, and political manifestations in late-twen- 
tieth-century America. Furthermore, they note considerable 
consensus among blacks and whites on many issues, whatever 
the differences between the two groups as to the causes and 
solutions to racial inequality. 

Smith and Seltzer assert that their methods and data nicely 
address the framing problem because their study does not 
involve a single survey at a single point in time. Yet, the 
weakness of the study is the use of more than forty different 
surveys to analyze race opinion. Furthermore, the vast ma- 
jority were not designed to tap differences in the public 
opinion of blacks and whites. These studies enable the 
authors to ascertain differences in opinion but do little to 
advance Smith and Seltzer’s explanation of why such differ- 
ences exist. The samples were small in many instances, and 
the issues had greater local than national significance. This 
study illustrates the need for more comprehensive surveys 
specifically designed to probe the extent to which factors such 
as historical events, religion and culture, and perceptions of 
racism explain the American racial divide. Despite these 
shortfalls, the book is an excellent introduction to the study 
of the racial divide in public opinion. 
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Renovating Politics in Contemporary Vietnam. By Zachary 
Abuza. Boulder, CO: Lynne Rienner, 2001. 271p. $52.00. 


David Elliott, Pomona College 


Zachary Abuza has written a valuable and timely study of 
Vietnamese politics. The book is rather narrowly centered 
around an examination of dissent, but that has important 
implications for the larger political sphere. Although the 
focus is on contemporary Vietnam, the author maintains that 
today’s dissidents cannot be properly understood without 
considering their historical antecedents: the protest of intel- 
lectuals in the mid- and late 1950s against party domination 
of literature and the arts, as well as the mid-1960s split within 


the party about the relationship between socialism and the 
war, the so-called Hoang Minh Chinh affair. The first, 
according to the author, not only provided a model and 
precedent for later dissidents but also established the subser- 
vience of all intellectual life to the dictates of the communist 
party. The second ended “democratic centralism” and toler- 
ance for debate even within the party, although the question 
of inner-party democracy resurfaced in the 1990s. 

Abuza details the various forms of contemporary dissi- 
dence in separate chapters on democratization and legaliza- 
tion, intellectual freedom and freedom of the press, the 
internal dissent of former southern guerrillas in the late 
1980s (the Club of Former Resistance Fighters) against the 
policies of a northern-dominated party central leadership, 
religious freedom, and internal dissent within the party. He 
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concludes that the short-term effect of these various dissident 
voices is limited. 

Furthermore, most dissidents do not challenge the funda- 
mental structure of the regime but instead want more plu- 
ralism and debate within the party itself. The problem, Abuza 
argued, is that the party is unable to “renovate,” a term that 
became the watchword of the economic reform process 
started in 1986 but was never really applied to the political 
sphere. The author ascribes this immobilism, in part, to the 
fact that the party is structured by clientalism but operates on 
the principle of consensus. Moreover, the party’s continued 
monopoly on information has prevented accountability (p. 
238). This fosters a vicious cycle: Without being checked by a 
relatively independent press, corruption grows, which under- 
mines the legitimacy the party built up at such cost during its 
long independence struggle. 

Abuza sees some response from the Vietnamese authori- 
ties to external pressure on human rights, but “despite 
foreign pressure for political reform and a loosening of the 
VCP’s [Vietnamese Communist Party] hold on power, the 
pressure for democratization aad political reform has come 
from within. Exogenous forces will continue to be more of an 
annoyance to the regime than a force for change. Pressure for 
political liberalization has to come from within the party 
itself” (p. 231). His conclusion is that “the dissident commu- 
nity will remain small and besieged by a hostile regime until 
civil society becomes more developed. Until then, the VCP 
will not face any credible threat to its power” (p. 239). 

These are informed and acute observations, and I generally 
agree with the analysis. I also am impressed by the compila- 
tion of supporting material. At the same time, this important 
book should be an occasion for those who study the Viet- 
namese political system to reflect on the both the inherent 
and self-imposed limitations of this work It is extremely 
difficult to do extensive first-hand research on most sensitive 
areas of Vietnamese politics, so Abuza, like most others in 
this rather small field, relies almost exclusively on secondary 
material, typically newspaper accounts, and dissident publi- 
cations outside Vietnam. He also employs revealing primary 
sources to good effect, but these are mainly produced by a 
handful of the dissidents who are the main focus of this book. 
And the volume of dissident writings that reach the outside 
world is not large. The author counts a total of about thirty 
dissidents, of whom fifteen were party members at one time. 
Only eleven of the secular dissidents were sentenced and 
imprisoned (p. 215). 

Primarily a factual narrative, the book is organized around 
different segments of the dissident community. (It would not 
be accurate to call them groups; with the significant excep- 

tions of religious dissidents and the defunct Club of Former 
Pakane Fighters, they are mostly isolated individuals.) 
Underlying concepts and assumptions could be more fully 
developed. For example: “Much of the problem is the fault of 
the Vietnamese political system. Vietnamese politics are highly 
factional, yet still operate on the basis of consensual decision 
making” (p. 242). This important insight helps explain both the 
frustration of the dissidents with the system and the limits of 
their ability to influence it. It is so central to the book that one 
that might have been explored further is the religious dissidents, 
who are quite different in character and effect from the intel- 
lectual dissidents—so much so that it would be helpful to know 
more about what these differences imply. The analysis tends to 
lump all who challenge the status quo into one large group, 
consisting of separate but related subspecies. 

What sets the dissidents apart from more mainstream 
critics is that they extend their challenge to the power holders 
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beyond disagreements about strategies of economic renova- 
tion into the political sphere, and it would be helpful to have 
more discussion of how the dissidents view the connection 
between the two. There is some coverage of this (¢.g., pp. 

90-2), but the argument is not pursued in depth. Further- 
more, an important issue raised by the author is why the 
dissidents have emerged now. His explanation somewhat 
undercuts a basic premise of the book, that there is a linear 
connection between dissent in the 1950s and 1960s and in the 
1990s. He implies that there is now a “critical mass” of 
dissidents (p. 29), although the numbers cited above do not 
seem to bear this out. 

More fundamentally, Abuza notes the importance of the 
collapse of communism, the emergence of the “Asian tigers” 
as alternative models of development, the slowdown of 
Vietnam’s economic progress in the late 1990s, abuses of 
power dramatically revealed by the Thai Binh peasant pro- 
tests, and the widespread reaction against corruption in 
official circles. None of these developments would have been 
conceivable in the 1950s and 1960s, and the political agenda 
is very different now, which seems to bring into question the 
idea of tracing the roots of today’s dissidents to the earlier 
periods. The case of Tran Do, expelled from the party for his 
blunt criticism of its power monopoly, is one illustration of 
the pitfalls of the continuity argument. During the war he was 
a committed military leader and intolerant of any deviation 
from strategy devised by the top levels of the party. In the 
early 1980s he wrote books and articles on “decadent cul- 
ture” during his stint as minister of culture. Still, Abuza 
makes a good case for the argument that nothing is ever 
forgotten in Vietnamese political life, and old issues do have 
a way of reemerging in slightly different guise. 

The comprehensiveness suggested by the sweeping title of 
the book is not achieved, but this is an important contribution 
on its own terms. Yet, a better understanding of the main- 
stream of Vietnamese politics would make the position of 
those on the periphery even clearer. Perhaps a sharper 
distinction should have been drawn between sanctioned 
dissidents and those “beyond the pale,” along with a more 
sustained explanation of how and why this line is drawn. 
Nonetheless, Abuza has produced a very significant study, 
and it adds to our understanding of the murky terrain of 
Vietnamese politics. Although the approach is not compar- 
ative (it would be helpful to examine the similarities between 
the clientalist-consensus model of Vietnam and similar sys- 
tems in other Asian states), the book is well suited for courses 
on Asian politics. We should be thankful to Lynne Rienner 
Publishers for making this work available, although the 
quantity of minor typographical errors is irritating and, on 
some occasions, sows confusion about Vietnamese names. 


To Vote or Not to Vote: The Merits and Limits of Rational 
Choice Theory. By André Blais. : University of 
Pittsburgh Press, 2000. 242p. $45.00 cloth, $19.95 paper. 


Carole Jean Uhblaner, University of California, Irvine 


André Blais takes aim at the disjunction between the predic- 
tion from rational choice theory that few people will vote and 
the fact that many do. He makes a major contribution to the 
debate over the appropriate use of rational choice theory in 
the study of political behavior by carefully applying evidence, 
including some from very clever original studies, to this 
voting paradox. The book is organized around the common 
expected utility form of the rational choice model: A rational 
person will vote if the benefit from having the preferred 
candidate win (B) times the probability that the person’s vote 
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changes the outcome (P) is greater than the cost (C) of 
voting. Blais considers alternative formulations, notably mini- 
max regret, and dismisses-them on theoretical and empirical 
grounds. He pays considerable attention to sense of duty, D, 
which often appears as a fourth variable. Blais argues, 
however, that this and other consumption benefits “fall 
outside the rational choice model” (p. 5); “that some people 
vote out of a sense of duty does not support or invalidate the 
model? (p. 5). He does not address other forms of collective 
action, about which rational choice may tell us more. As the 
author acknowledges, “voting is a tough test for rational 
choice,” since “voting is a low-stake decision” (p. 140). 

Blais concludes that “rational choice can contribute to 
understanding why people vote or abstain but its contribution 
is quite limited...in three ways” (p. 137), which reflect 
Blais’s empirical findings. First, the model does not apply to 
persons (half of the electorate, he estimates) who have a 
strong sense of duty; these people ignore benefits and costs 
and vote out of moral obligation. Second, among the rest, he 
argues that although P, B, and C have the predicted effects, 
the magnitudes are small. Third, people use P in ways 
inconsistent with the calculation of expected. benefits. Yet, 
the model has theoretical utility, especially since it receives 
partial empirical confirmation. “The bottom line is that I 
cannot personally make sense of why people vote without 
taking into account the rațional choice model” (p. 140). In 
particular, the model pushes us to examine why people vote 
by showing why voting is problematic. Blais places duty 
foremost among these reasons. “The major motivation that 
leads most people to vote is the feeling that if one truly 
believes in, democracy one has a moral obligation to vote. 
Oddly enough, rational choice may have facilitated the due 
recognition of this quintessentially irrational factor” (p. 140). 

Blais builds his argument by examining “whether each of B, 
P, and C has independent effect on the decision to vote” (p. 
15). His first steps are interesting detours from this effort. He 
begins with an explanation of cross-national variations in 
turnout that identifies the usual socioeconomic, institutional, 
and party factors, but he adds to the literature by doing so 
across a set of countries much broader than that examined by 
others. His comparison of turnout fluctuations by type of 
election (local, national, presidential, legislative, referenda) 
enters the main argument to provide weak support for a B 
effect. This section of the book is the least convincing im light 
of the large number of uncontrolled factors. 

The next brief chapter uses cross-national survey data-to 
récap typical conclusions on the demographic characteristics 
that distinguish voters from nonvoters. Blais then moves into 
the heart of his analysis, with one chapter each devoted to the 
evidence that P or C (or subjective perceptions of P or C) 
affects the decision to vote. Another chapter assesses the 
extent and effects of D and its interaction with perceived costs 
and benefits. In the penultimate chapter, Blais extends the 
generalizability of his findings by placing them in the context 
of the literature on the logic and incidence of free-riding. (In 
the context of voting, an abstainer is a free-rider.) 

The middle chapters are built around two nonrandom 
studies and three representative sample surveys conducted by 
Blais and his associates. Blais and Robert Young adminis- 
tered questionnaires to 1,459 social science students at the 
Université de Montréal and the University of Western On- 
tario after (and, for half the sample, also during) the 1993 
Canadian federal election. In a second study, 125 Université 
de Montréal students were interviewed in early 1995 with 
special attention to their estimates of P and its importance. 
The two major random surveys, with about 1,000 respondents 
each, sampled the Quebec and British Columbia electorates, 
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the former during and after the October 1995 sovereignty 
referendum and the latter in the last week of the May 1996 
provincial election campaign. Finally, in spring 1996 a small 
(108) sample of regular voters in the Montreal area were 
probed on their reasons for voting. 

These studies provide richer measures than previously 
available of the perceived cost of voting (C) and of the 
perceived probability of changing the outcome and related 
probability variables (P). Blais convincingly shows that cost 
does affect turnout, but the small effect is further reduced 
because people perceive very low costs. Turning to probabil- 
ity, he concludes that direct estimates of P are only weakly 
related to respondents’ perceptions of the election’s close- 
ness. Moreover, only perceived closeness relates to turnout. 
“People ... rely on the vague notion that their vote might 
count more in a close election rather than on an estimate of 
the probability that their vote could be decisive” (p. 78). 

As Blais acknowledges, however, the circumstances of the 
1995 Quebec sovereignty referendum may have misleadingly 
depressed the apparent effect of P. First, the outcome was 

to be and was in fact extraordinarily close (50.6% 

“no” to 49.4% “yes”). Second, the stakes were extraordinarily 

high (high B); had the results been “yes,” Premier Jacques 

Parizeau would have immediately moved the province toward 

ion. Of course, these are circumstances under which 

the rational choice model predicts high turnout. In fact, the 

turnout was extremely high (935%), so there was little 
variance to explain. 

The circumstances in Quebec in 1995 also affect Blais’s 
later findings on the importance of duty. The stakes and the 
closeness of the referendum guaranteed that everyone from 
political leaders down to friends and neighbors would invoke 
a heightened sense of duty to vote. The conclusion that this 
overwhelmed other components of a rational model for the 
1995 referendum might not generalize well elsewhere. And in 
Blais’s data, the vote decisions of even those 1996 British 
Columbians with a high sense of duty responded to B and C, 
as the model predicts (although not to P, which mattered only 
for those with a low level of duty). In the 1993 student data, 
B, P, and C matter for those with a low level of duty and not 
for those with a high level, but also the Quebec students were 
more likely in either group to vote than were those from 
Ontario. That geographical difference is left unexplained by 
Blais. It would be useful to see the entire model estimated 
separately for the Quebec and the Ontario students, as duty, 
reinforced by exhortation, might matter more for the former 
than for the latter. 

The rational choice model does explain turnout decisions 
at the margin, especially among those with a weak sense of 
duty. As Blais notes, the model then points us to fruitful areas 
for further inquiry. He urges deeper investigation of the 
sense of civic duty, including qualitative research and panel 
studies, and replication of his surveys in other political 
contexts and institutional settings. For example, does P work 
similarly in proportional representation electoral systems? 
Perhaps most intriguing, the analysis implies a need to 
investigate which factors cause the level of duty and its effects 
to rise and fall across time and space. For example, Blais 
accounts for the low turnout in municipal elections by 
arguing that his data suggest “the moral obligation to vote is 
believed to apply to all elections in which the stakes are 
deemed to be relatively important” (p. 110). But circum- 
stances, including deliberate actions by leaders, can affect the 
perception of importance and the salience of duty itself. 

. As Blais writes, further research would be useful. In 
contrast to many such exhortations, his call comes attached to 


a blueprint. That specificity is especially important in this 
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field, where even scholars of an empirical bent have found 
theorizing far easier than testing. Blais has made an impor- 
tant contribution in demonstrating how to do this research 
and in clarifying which questions are worth addressing. 


Engendering Citizenship in Egypt. By Selma Botman. New 
York: Columbia University Press, 1999. 141p. $47.50 cloth, 


$18.50 paper. 

Diane Singerman, American Universtty 
In the last decade scholarship on gender and politics in the 
Middle East has mushroomed and includes many good 
monographs on these questions for particular eras and in 
particular places. Botman paints a broad picture of social, 
historical, cultural, and political changes that have influenced 
the struggle of Egyptian women for voice and representation 
throughout the twentieth century. In particular, she adds 
more detail about the participation of leftist and communist 
women in Egypt’s political life than is available elsewhere. 

The author adopts a constructivist position, seeking to 
understand the experience of citizenship for women in the 
context of Egypt’s socioeconomic and political system and 
the patriarchal nature of family and public life. Herein lies 
part of the tension within this book. It tries to bring together 
a constructivist approach and a political culture orientation 
that is shaped by modernization theory. The constructivist 
voice examines the ways in which three different political 
eras—“postharem” nationalist and semiliberal constitutional 
politics, the state-run socialist Nasserist period, and the years 
of Sadat’s and Mubarak’s open door economic policy— 
affected women’s experience of citizenship. Botman presents 
important information about how changing national priori- 
ties and statist ideologies, as well as the larger regional 
political context, shaped the direction of the women’s move- 
ment and gender relations in Egypt. 

Yet, the modernization approach entails a rigidly dichot- 
omous interpretation of public and private realms. Botman 
places women firmly in the private realm and argues that this 
explains their subordination and lack of full citizenship rights. 
Egyptian men, she asserts, are the undisputed household 
head, and women’s “natural” primary identities are as wives 
and mothers. At times she argues this is due to the Egyptian 
state’s adoption of liberal politics and refers to Carole 
Pateman’s arguments about women’s exclusion from the 
liberal notion of citizen. At other times, however, Botman 
adopts a third political culture approach and identifies Islam 
as a source for women’s subordination through its legal 
statutes and religiously defined gender norms. She suggests 
that only by democratizing the family and jettisoning conven- 
tional social and cultural (read: Islamic) beliefs will gender 
relations become more egalitarian and will women become 
full citizens. Eclectic explanations for complex problems are 
warranted at times, but here they confuse the analysis. 

Botman insists throughout her analysis on a wide gulf 
between public and private spheres in Egypt. Yet, it is quite 
commonly accepted that Islam does not replicate the liberal 
dichotomy between public and private. As the author herself 
points out, Islam is concerned with the maintenance of social 
order and gender relations and has many laws that delineate 
proper relations of the most intimate nature between men 
and women. The writ of the state in an Islamic society does 
not stop at the family as it does in a liberal order. Indeed, the 
stability of the family and gender relations are deeply impor- 
tant to the Islamic state. Although nation-states such as 
Egypt have largely secularized their legal systems, which 
borrow heavily from colonial powers, statutes that regulate 


1012 


December 2001 


Marriage, divorce, child custody, and inheritance are still 
largely based on Islamic law. 

Certainly, it is possible and essential to recognize the 
importance of Islamic law and gendered norms in the strug- 
gle for women’s equality, but not because of the vast gulf 
between public and private realms, Botman even mentions in 
passing some of the many Muslim women and various 
organizations that have used new legal and historical schol- 
arship to create the grounds to contest Islamic law from 
within its epistemological foundations. Scholars such as 
Amina Wadud, Fatima Mernissi, and Azizah al-Hibri and 
activista with transnational networks from diverse ideological, 
disciplinary, and religious perspectives have been reinterpret- 
ing religious and legal texts to defend and promote women’s 
rights. 

Botman misses a critical point that surrounds this debate. 
Yes, men have been activists and politicians in the Muslim 
Brothers and the Communist Party and have served as 
officials of the Egyptian monarchy as well as the Nasserist and 
post-Nasserist states. Men continue to rule Egypt, but they 
are also quite far from being citizens with full rights to 
political representation, voice, and access to civil society. Is 
the male exclusion from full citizenship throughout different 
forms of rule in Egypt also due to culture, Islam, and liberal 
ideology? Only in the last few years, have Egyptians won 
small steps in their courageous struggle to achieve political 
voice and representation. The women’s movement, as Bot- 
man points out, has met with only marginal success, but one 
can make the same argument about the Muslim Brothers, the 
communists, and even the wealthy entrepreneurs of the 
Mubarak years. The power of the state to constrain political 
life remains formidable. As political opportunities arise, 
which they do periodically in contested parliamentary elec- 
tions and elsewhere, female candidates and elite women 
make gains in promoting equality and legal reforms. Cer- 
tainly, Egyptian women are constrained by patriarchy, and 
men are in positions of power across the political, economic, 
and religious spectrum, but the agency of women and the 
strategies they have used and continue to use to promote 
their interests, whether grounded im Islamic, liberal, or 
developmentalist discourse, deserve more space in this book. 


Elections in Australia, Ireland, and Malta under the Single 
Transferable Vote: Reflections on an Embedded Institu- 
tion. Edited by Shaun Bowler and Bernard Grofman. Ann 
Arbor: University of Michigan Press, 2000. 293p. $59.50. 


Richard L. Engstrom, University of New Orleans 
Shaun Bowler and Bernard Grofman remind us that electoral 
systems are “just one piece of a wider democratic puzzle” (p. 
270). The theme of their edited volume is a well-known 
message: Election systems are embedded in much larger 
social and political contexts, and their effects on election 
outcomes are “conditional and contingent” (p. 3). This is a 
Message too often forgotten by political scientists. It is still 
common to see features of electoral systems inserted into 
analytic models as if they are separate and distinct indepen- 
dent variables with the same direct, uniform effect across 
diverse sociopolitical settings. Contributors to this volume 
show that such models usually need respecification. 

The election system examined is the single transferable 
vote (STV), often called “preference voting” by American 
electoral reformers. The book itself is organized according to 
a “most similar systems” design. The topics are the use of 
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STV, or some variant of it, to elect members of the Irish Dail 
(by three Michaels at Trinity College Dublin—Gallagher, 
Marsh, and Laver); the Maltese House of Representatives 
(by Wolfgang Hirczy de Mino and John C. Lane, as well as 
Gallagher); the Australian Senate and some state legislative 
chambers in Australia (by Colin A. Hughes, Ben Reilly, and 
Michael Maley); and in the past some municipal governing 
bodies in Canada (by J. Paul Johnson and Miriam Koene). In 
addition, contributions by David M. Farrell and Ian McAl- 
lister and by Neal G. Jesse are not country-specific but 
contain information about some of these settings. The idea 
behind the volume is to begin searching for “other causal 
factors” that might explain the differences in the experience 
with STV across these settings, particularly the way in which 
political parties have adapted to this system (Bowler and 
Grofman, p. 14). 

The chapters, however, are not a series of comparative 
case studies that examine, in the same systematic manner, 
how any of these “other causal factors” operate across 
settings. No structured research design was imposed on the 
contributors, who use different methods to focus on different 
dimensions of the experience with STV. The result is eclectic. 
Some chapters are very historical in focus and rich in detail 
(e.g, Hughes). Some provide empirical tests of theoretically 
derived hypotheses (e.g., Gallagher). Others demonstrate 
that, given some specific assumptions about the information 
voters have and their subsequent behavior, election results 
are consistent with expectations derived through formal 
theory (e.g, Laver). This diversity in approach seems well 
justified, given the state of our knowledge about these systems 
and settings, but the search for the other causal variables, from 
setting to setting, is less than directly comparative. The intro- 
ductory and concluding chapters by Bowler and Grofman, 
however, do provide excellent overviews. 

Many readers may be surprised by the variation in the way 
STV has been implemented. Farrell and McAllister provide 
an overview of these variations, and many of the other 
chapters show that the details make a difference. Across the 
settings examined, the STV system is itself, in application, a 
variable rather than a constant, and this has an important 
effect on experiences with the system. The STV arrangement 
in Ireland usually serves as the textbook example, but it is far 
from representative of the others. Different district magni- 
tudes (the number of seats to be filled) can have an effect, of 
course, on the translation of votes into seats, and the various 
rules governing how the voters may express their preferences 
also can have an important influence on outcomes. An 
intriguing case in this respect is Australia, which in 1983 
added “above the line” voting (also known as ticket voting) to 
the STV arrangement used to elect the federal Senate. This 
provides the option to vote above a heavy black lime for a 
particular political party and thereby adopt the preference 
ordering determined by that party. The alternative is to rank 
order virtually all the candidates for all the parties, listed 
below the line, so it is not surprising that almost all voters 
vote above the line. STV in Australia has been converted, in 
effect, into a list system of proportional representation (see 
the contributions of Farrell and McAllister, Reilly and Maley, 
and Hughes, at pp. 25, 50-3, and 162-3, respectively). 

The contributions do reveal what the editors describe as a 
“sizable gap” between theory and practice (p. 10). This is 
certainly the case for the often-repeated expectation that 
STV will result in multiparty systems. The experience with 
STV has been starkly different. Neal Jesse describes STV, in 
the context of Duverger’s theoretical formulation about the 
relationship between election systems and party systems, as 
an “outlier” empirically (p. 59). Farrell and McAllister 
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identify ten distinct STV systems in use, over time, in Ireland, 
Malta, Australia, and two Australian states. The average 
number of “effective” elective parties within these systems 
exceeded three only once, in Ireland (it was 3.1 from 1928 
through 1989). The average number of effective parliamen- 
tary parties was closer to two than three in eight of the ten 
cases. The exceptions occurred in that same time period in 
Treland and in Malta from 1947-55, both of which were scored 
at 2.79. Malta, it must be noted, is now said to have “one of the 
purest [two-party systems] in the world” (Gallagher, p. 87; see 
also Hirczy de Mino and John C. Lane, p. 194). 

The outlier status of STV is attributed, primarily, to the 
adaptation of large parties to the system, by adopting nomi- 
nation and vote management procedures as well as formal 
systemic rules, such as ballot structures and campaign expen- 
diture regulations, that counter any tendency toward frac- 
tionalization (Bowler and Grofman, pp. 10—11). Jesse further 
suggests that sophisticated voting within the electorate also 
dampens any such tendency. 

STV, and alternative systems such as cumulative voting 
and limited voting, has received increased attention in the 
United States since the Supreme Court, in Shaw v. Reno 
(1993) and its progeny, placed new constraints on the cre- 
ation of majority-minority districts (m terms of racial and 
language minorities protected by the Voting Rights Act). The 
debate over whether to adopt such alternatives could be 
greatly informed by this volume. The fact that the conse- 
quences of electoral system choices are “conditional and 
contingent rather than categorical and mechanical” (Bowler 
and Grofman, p. 3) is often ignored in this debate. 

A quintessential example occurred in the September 1999 
hearings held by the Subcommittee on the Constitution, 
Judiciary Committee, U.S. House of Representatives, on 
Rep. Mel Watt’s proposed States’ Choice of Voting Systems 
Act. This bill would allow alternative election systems, such 
as STV, to be used for elections to the U.S. House. A political 
scientist testifying against the bill confidently stated that 
proportional representation (PR), or even cumulative voting, 
would lead to the fractionalized politics seen in Israel. The 
United States “will end up with a David Duke Party, a Pat 
Buchanan Party, an Al Sharpton Party, you name it.” No 
mention was made of any other countries that employ a PR 
arrangement, and there was no acknowledgement that Israel 
is an extreme case, even among countries using a list system 
of PR, as does Israel. There were no references to Ireland, 
Malta, or Australia, despite the fact that STV is far more 
likely to be adopted in the United States than any variant of 
the party-centered list system, especially the version used in 
Israel. 

Election systems should serve as conditional or specifying 
variables that affect how other variables relate to various 
dimensions of election outcomes. This very useful volume 
reinforces, once again, the need for that conceptual specifi- 
cation. 


Between the State and Islam. Edited by Charles E. Butter- 
worth and J. William Zartman. Washington, DC, and Cam- 
bridge: Woodrow Wilson Center Press and Cambridge Uni- 
versity Press, 2001. 256p. $49.95 cloth, $18.95 paper. 


Rex Brynen, McGill University 


As the title suggests, this edited volume explores the socio- 
political terrain between the state and Islam, emphasizing 
“the forces of skepticism, liberty, and creativity that struggle 
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against Islamic conformism and state hegemony.” As the 
editors note, “however influential religion appears in word 
and deed, however evident the trappings of state authority, 
people do come into being, thrive, marry, raise families, 
think, laugh, and cry without regard to—indeed, sometimes 
in utter defiance of—the strictures of religious or state 
authority” (p. 4). 

Set in this way, the volume positions itself to make a 
significant contribution to the literature on Middle East 
politics, which often appears to allocate excessive explanatory 
effect to either political culture or authoritarian state power. 

As with all edited works, there are variations In how well 
the authors adhere to this theme and to what degree they 
offer additional analytical insight. In the first section of the 
volume, four contributors address the nineteenth century. 
Butterworth reviews paradigms of the state and of Islam. This 
is a useful first chapter, but it is neither detailed nor deep 
enough to add substantially to debates over how best to 
comprehend “Islam.” Antoine Zahlan explores technological 
change in Egypt. His survey is insightful, clear, and very 
engaging but, ironically, is perhaps the chapter least well tied 
to the overall theme. Serif Mardin looks at Islam and identity 
among the Volga Tartars. The treatment, although. empiri- 
cally rich, could do more to conceptualize and generalize the 
findings outside the case study. Finally, Said Bensaid Alaoui 
presents a solid overview of nineteenth-century Muslim 
opposition thinkers. 

In the second section, the remaining six contributors 
explore the twentieth century. As’ad Abukhalil offers a 
charactenstically frank and very useful examination of anti- 
conformist tendencies in Arab/Islamic thought, pausing on 
thinkers who have gone beyond traditional reformism to offer 
radical reinterpretations of Quranic text. Diya Harik surveys 
democratic thought in the Arab world, identifying both 
thinkers and key (human rights and other) institutions. His 
Critical assessment of these actors (and the potential effect of 
their ideas) is thought-provoking, even if one does not fully 
accept all his conclusions. Harik’s discussion also assumes 
particular importance in the wake of the arrest and convic- 
tion of Egyptian-American sociologist and democratic activ- 
ist Saad Eddin Ibrahim, a regressive move clearly intended by 
the Egyptian authorities to intimidate human rights organi- 
zations and civil society more generally. 

Ibrahim Karawan discusses the maneuvering of indepen- 
dent political parties in Egypt, Jordan, and Tunisia between 
the twin poles of an authoritarian state and an Islamist 
opposition, which is an important and largely unexplored 
process. Timothy Piro’s chapter on liberal professionals also 
examines an important understudied topic, although its value 
might have been enhanced if the scope of the treatment 
(which includes business associations, professional syndi- 
cates, labor unions, the press, and think tanks) had been 
narrowed. Piro’s discussion also could have been strength- 
ened by discussion of the varied ideological trends within the 
sectors of civil society, not all of which are very “liberal.” 

Jean Leca, Meriem Vergés, and Mouina Bennani-Chraibi 
revisit the work of Daniel Lerner to offer an impressionistic 
analysis of the effect of audiovisual media in North Africa. 
They offer some insights, but the methodological foundations 
of the piece could be stronger, and the treatment itself is only 
weakly tied to the main focus of the book. In the concluding 
chapter, Zartman wrestles with the compatibility of political 
religion and secular democracy, and he finds such a recon- 
ciliation “impossible” (p. 243). The dualism implicit in this 
formulation seems to run counter to much of the spirit of the 
volume, his prediction—-that the resulting tensions will only 
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be assuaged through processes of conflict, interaction, and 
negotiation—seems unassailable. 

In view of its title and purpose, the book generally fails to 
engage a fairly substantial and growing literature by anthro- 
pologists and political sociologists that explores how everyday 
social action is shaped by folk and “formal” beliefs, institu- 
tions, and state power. This literature is particularly rich and 
highlights family and gender politics; Diane Singerman’s 
Avenues of Participation: Famıly, Politics, and Networks in 
Urban Quarters of Cairo (1995) is just one of many possible 
exemplars. Instead, despite the editors’ apparent willingness 
in the introduction to privilege a micropolitical focus on how 
people individually and collectively live their everyday lives, 
most of the chapters assume a macropolitical perspective that 
emphasizes major thinkers, institutions, and political move- 
ments. 

Between the State and Islam contains much of interest. It is 
recommended for regional specialists and others interested 
in Islam or religion and politics, and it has several chapters 
that will be of use in graduate seminars on these topics. 


Politics on the Fringe: The People, Policies, and Organiza- 
tions of the French National Front. By Edward G. DeClair. 
Durham, NC: Duke University Press, 1999. 261p. $49.95 
cloth, $18.95 paper. 


Andrew Appleton, Washington State University 
“The charm of history and its enigmatic lesson,” wrote 
Aldous Huxley, “consist in the fact that, from age to age, 
nothing changes and yet everythmg is completely different.” 
(The Devils of Loudon, 1952) Surely, Huxley would have had 
much to say about the role played by the Front National (FN) 
in the recent history of France. As René Rémond has reminded 
us over the years, the FN is but the modern incarnation of a 
traditional strand of the French Right. Indeed, Rémond is one 
of the most powerful voices behind the cultural approach to 
analyzing the contemporary extreme Right, arguing that any 
Organizational manifestation of it can only be understood with 
reference to the historical and philosophical antecedents. Much 
of the scholarship on the FN either implicitly or explicitly uses 
this approach as a point of departure. 

As the extreme Right presence has made itself felt in the 
politics of other European countries since the decline of the 
postwar consensus (such as Italy, Germany, Austria, and 
Belgium, to name but a few of the more obvious cases), other 
scholars have begun to make cross-national comparisons. In 
this approach, the FN is no longer simply a continuation of a 
peculiar French history but the logical outcome of a set of 
political and economic factors that affect all postindustrial 
nations to one degree or another. In the work of Hans-Georg 
Betz, for example, the FN is one extreme Right party among 
others; to be sure, some of the variance may be explained by 
system-specific phenomena, but the underlying logic flows 
ACTOSS CASES. 

Into this debate comes Politics on the Fringe, a book that 
has its roots in Edward DeClair’s dissertation research 
conducted in France. One strength of the book is that the 
author acknowledges the debate over “exceptionalism” ver- 
sus comparative analysis; thus, he promises to place what is 
essentially an interpretive case study into a comparative 
context by examining the FN alongside some of its European 
counterparts. Unfortunately, one weakness of the book is 
that he fails in this endeavor, despite some brief portraits of 
the extreme Right in Austria (two pages), Belgium (two 
pages), Germany (one page), and Italy (two pages). A modest 
attempt to draw some comparisons between the themes 
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promulgated by these movements is too cursory (to state that 
the “national identity crisis,” immigration, and the decline of 
the welfare state are connected with the extreme Right 
phenomenon is not particularly illuminating). The main 
contribution of the book stands in its appeal to those 
interested in the specific case of France. 

Politics on the Fringe loosely adopts the “life-cycle” model 
of party development, and this leads the author to adopt two 
important research strategies. First, the book essentially 
pursues a historical-narrative thread, tracing the rise and 
consolidation of the FN as an electoral force. Second, the 
stress in the work is upon party organization as the depen- 
dent variable. Methodologically, DeClair writes in the histor- 
ical-interpretivist tradition, which again injects a set of both 
strengths and weaknesses. Through a series of elite interviews 
with the top party leadership, the author aims to let the 
organizers of the FN speak “in their own voice.” Researchers 
of extreme Right politics in France may find this the most 
valuable contribution, as it skillfully permits the reader to 
gain a greater insight into the motivations and strategies of 
those directing the extreme Right resurgence of the 1980s 
and 1990s. 

The drawbacks of this theoretical and methodological 
posture, however, are those that often blight single-case 
analysis. The theoretical model is never fully specified (I 

that this was a casualty in the transition from 
dissertation to book), which leaves us with the bland assertion 
that “uncovering the variable influences of the different 
facilitators at each level of the development process provides 
important information about political party development. 
The National Front is a compelling test case for this model of 
party development” (p. 50). Yet, absent a set of logically 
derived, falsifiable, empirically testable propositions, the 
narrative does little except lead the author to conclude that 
the FN has indeed passed through the critical three stages of 
the life cycle (origins, success, and durability), and this was 
primarily due to Le Pen’s leadership. 

Most unfortunately for DeClair, the historical-interpretive 
narrative of Politics on the Fringe becomes a prisoner of 
history itself. Just as the book was published, the FN was torn 
apart by internal struggles generated by challenges both to 
the leadership of Le Pen and the ideological and tactical 
posture of the movement. Little in the book prepares the 
reader for this turn of events: “Ominously, the future does 
indeed belong to the National Front” (p. 224). In an after- 
word we are informed that the FN is now in a fratricidal and 
potentially suicidal struggle. Although the author intelli- 
gently explains this turn of events in a few pages, the point is 
that nothing in the preceding pages predicts them. One is 
tempted to ask whether the life-cycle theory is adequate. 
Should we consider stages beyond “durability”? Or perhaps 
the party never really achieved durability? Could other 
theoretical approaches provide superior leverage (e¢.g., a 
rational-choice institutional account such as that provided by 
John Aldrich in Why Parties? [1995]). Nothing in Politics on 
the Fringe enables the reader to find the answers. 

One may conjecture that,the twenty-nine elite interviews 
that form the core of the original data in the book brought 
the author into such close proximity with his object of study 
that the comparative context was somewhat lost. The refer- 
ence, for example, to the attraction of a Thatcherian-style 
authoritarian neoliberalism for at least one of those inter- 
viewed (p. 156) is highly intruiging. It definitely echoes my 
own experience in interviewing party leaders in the mid- 
1980s, when I found a pronounced admiration for the then 
British prime minister; the need for a more systematic 
contextualization and comparison of the extreme Right 
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across postindustrial societies is indicated by pointers such as 
this. Some might quibble with the assertion in Politics on the 
Fringe that the FN has never been a violent movement, 
although Le Pen himself (as well as other prominent party 
leaders) have avoided public incitations of violence and have 
been careful to portray the party as operating inside the 
confines of republican institutions, there has been an unmis- 
takable whiff of a culture of violence that permeates the 
ts. 

Ultimately, Politics on the Fringe has much to recommend 
it to those interested in a deeper understanding of the French 
version, perhaps the prototypical version, of the extreme 
Right, and particularly to those who wish to gain an insight 
into the leadership of this horribly compelling movement. It 
cannot be concluded, however, that the book adds much to a 
broader understanding of the extreme Right across the 

world, whether in terms of development, orga- 
nization, or ideology. DeClair is to be commended for his 
fortitude in enduring long interviews with leaders of the 
extreme Right in France, but one wishes that the results had 
yielded rather more theoretical insight into models of party 
development and extreme Right politics generally writ. 


The Politics of Pragmatism: Women, Representation, and 
Constitutionalism in Canada. By Alexandra Dobrowolsky. 


Don Mills, Ontario: Oxford University Press, 2000. 320p. 

$24.75 paper. 

Avigail Eisenberg, University of Victoria 
Can a group as diverse as women are along nearly every 
politically relevant dimension be mobilized or mobilize itself 
to alter the course of national politics? In this study, Alex- 
andra Dobrowolsky traces how women shaped the Canadian 
Constitution from 1980 to 1992 and explains why they had the 
effect they did. On the basis of her findings, she aims to 
reformulate our understanding of how political mobilization 
and representation work. 

Broadly, two approaches tend to dominate scholarly ac- 
counts about how groups affect national political agendas, 
and neither adequately explains the ways that women partic- 
ipate in and advance political change. First, interest group 
theory suggests that individuals mobilize on the basis of 
shared interests. Interest groups aggregate individual inter- 
ests and have a temporary and focused purpose to advance 
these interests in the political—that is, state—arena by 
lobbying public institutions, building coalitions with other 
groups, and attempting to get as close as possible to those 
who hold power. Second, social movement theory focuses on 
the political change advanced by groups that are organized 
on the basis of loosely defined but shared ascriptive charac- 
teristics, such as gender or ethnicity. Civil society, rather than 
the state or public institutions, is the domain of social 
movements, and these movements generally aim at ; securing 
political change by “destabilizing the status quo.” Rather 
than lobby for the purpose of changing defined public 
policies, social movements have a strategic repertoire that 
includes everything from consciousness raising to civil dis- 
obedience (p. 5). 

Dobrowolsky argues that Canadian women used a large 
variety of tactics to secure social movement ends. She traces 
the activities of Canadian women’s organizations, feminists, 
femocrats, and feminism as a social movement during and 
between three episodes of constitution building in Canada 
(approximately 1979-82, 1987-90, and 1990-92). She pre- 
sents information from an exhaustive list of sources, includ- 
ing publications, pamphlets, briefs and documents generated 
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by more than 100 organizations, and numerous personal 
interviews with key players from each period. The result is 
one of the most thorough and interesting empirical accounts 
of national political actrvity in Canada over the last twenty 
years. The book brings this empirical account to bear on the 
question of how best to understand political mobilization in 
general. Unsurprisingly, neither an interest group approach 
nor social movement theory can account fully for the effects 
of the concerted efforts of women in Canadian constitutional 
politics. 

That women had a profound influence is undeniable, 
according to Dobrowolsky. First, they succeeded in disrupt- 
ing the conventional politics of federalism, which historically 
has ignored identities other than those organized on a 
territorial (Le., federal-provincial) basis. They convinced 
Canadians that other forms of identity, including those of 
national minorities and those based on ethnicity and gender, 
need to be recognized and represented at the negotiating 
table. Second, directly through lobbying efforts and by dis- 
rupting conventional discourse, women succeeded in secur- 


ing an expansive notion of equality that reconciles equality 
rights with difference and diversity. As Dobrowolsky argues, 


this expansive notion fundamentally changed not only the 
constitution but also the Canadian political and social land- 
scape. As a result, it changed the context in which women 
mobilized. 

Ironically, one way in which this expansive sense of equal- 
ity changed politics and society may have been responsible 
for the demise in power of national women’s organizations. 
Key women’s groups weathered troubled times during Can- 
ada’s constitutional episodes partly because, due to their own 
success at arguing for an expansive notion of equality, they 
were confronted by their own failure to respond adequately 
to the identity-related differences among women. In particu- 
lar, women of color, Aboriginal women, and Quebec women 
compelled the movement to rethink the ways in which it 
represented women’s interests. Aboriginal and Quebec 
women withdrew their formal participation from national 
organizations and advanced their distinctive interests and 
perspectives, using their own increasingly powerful organiza- 
fons. 

Dobrowolsky resists the conclusion that the women’s 
movement in Canada operated by advancing vested interests 
of particular women, that is, white, wealthy professional 
women. Yet, her analysis shows that at one crucial juncture— 
during the 1980-82 constitutional negotiations—a network 
of such women were largely responsible for ensuring that 
equality rights were firmly entrenched in the Canadian 
Constitution. Since that time, the women’s movement in 
Canada has displayed what might be seen as a healthy respect 
for the politics of inclusion and a strong resolve to ensure that 
differences among women, especially those based on linguis- 
tic, ethnic, and national divisions, are not neglected in favor 
of presenting a fagade of gender-based unity. The price of 
inclusion, however, is a politics that does not attempt to unite 
women around a set of shared values and does not presume 
or expect solidarity among women in relation to any single 
political agenda. 

For Dobrowolsky, the pragmatism of women’s political 
activity in Canada 1s evidence of success rather than failure. 
She strongly resists the conclusion that identity politics is 
“fragmenting, divisive and therefore detrimental” (p. 197). 
She argues that concerns over identity politics are born out of 
a preoccupation with unanimity and neglect of the benefits of 

exchange (p. 197). In this sense, Dobrowolsky 
urges readers to look at political mobilization using a differ- 
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ent set of standards, but she does not offer a more precise 
idea of what these standards ought to be. 

Unanimity may not be a realistic ideal, but at what point 
can one conclude that a movement suffers from fragmenta- 
tion? Discursive exchange is important to any democratic 
movement, but at some point should conversations move on; 
should decisions be made; and, as in the case examined ın this 
book, should principles be entrenched in a constitution or be 
defeated? In Canada, the amendments in 1990 and 1992 were 
both defeated. Dobrowolsky provides convincing evidence 
that women as a social movement made a difference to each’ 
of these decisions, but her analysis raises the question of 
whether a more pluralistic movement wedded to a more 
discursive form of politics is not better suited to disrupting 
attempts to change the constitutional status quo rather than 
to advancing change. Dobrowolsky does not address this 
question, but her book provides a thoughtful analysis of the 
evidence and arguments one would have to consider in order 
to answer it. 


Citizens, Experts, and the Environment: The Politics of 
Local Knowledge. By Frank Fischer. Durham, NC: Duke 
University Press, 2000. 336p. $59.95 cloth, $18.95 paper. 


Sheldon Kamieniecki, University of Southern California 


Advances in science and technology account for much of 
today’s pollution, but science and technology are now being 
relied upon to reduce pollution, protect natural resources, 
and promote habitat conservation. As scientists, experts, and 
technical personnel become increasingly involved in these 
efforts, public values and preferences, especially at the local 
level, are being ignored. Citizen participation is declining, 
and professionals are playing a more influential role in 
environmental policymaking than before. Because a large 
majority of these professionals are not elected and are not 
directly accountable to the public, decision making is less 
democratic. With this as a backdrop, the major question this 
book attempts to address is how citizens can participate in an 
age dominated by complex technologies and expert decisions 
(p 


. 6). 

This book is divided into four sections. Part 1 provides a 
broad discussion of the role of technology and expertise in 
today’s society. The first three chapters examine citizen 
participation as both an ideology and an activity, criticize 
professional expertise, and call for alternative practices. The 
fourth chapter reviews the epistemological issues underlying 
the critique of empirical social science, natural science, and 
technology and offers a “postpositivist,” discursive theory of 
knowledge. “Postpositivism,” according to the author, “is 
grounded in the idea that reality exists but can never be 
understood or explained fully, given both the multiplicity of 
causes and effects and the problem of social mean- 
ing . . . Critical of empiricism, ‘postpositivism’ emphasizes the 
social construction of theory and concepts, and qualitative 
approaches to the discovery of knowledge” (p. 282, footnote 
1). 

Part 2 contains three chapters and addresses the role of 
expertise in environmental policymaking, the political re- 
sponse of environmentalists to technocratic decision prac- 
tices, and the resulting politics of “counterexpertise.” Part 3 
also contains three chapters and discusses the deliberative 
alternative, emphasizing the importance of local knowledge 
(frequently studied by anthropologists, such as Steve Lan- 

sing), citizens as experts, and community inquiry. 

The two chapters ın the final section attempt to show how 
the participatory inquiry of lay citizens and experts can be 
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employed to address complex issues and the ways that their 
collaborative assessments can be used to inform legislators. 
Determining environmental risk, both empirically and sub- 
jectively, is a central aim throughout the work. In conclusion, 
the author calls for a new understanding of the expert as 
“specialized citizen.” 

Fischer primarily views the problems of scientific expertise 
and the need for increased levels of informed citizen partic- 
ipation within the context of political theory. “What the 
positivists have failed to grasp . . . is that scientific discourse is 
itself a highly interpretive enterprise. Given this interpretive 
dimension, science loses its privileged claim as superior 
knowledge” (p. 44). Hence, citizen participation is critical not 
only for the functioning of democracy and as a value in and 
of itself, but also because science is open to subjective 
interpretation, and knowledge is socially constructed (ie. it 
is not objective or value free, and facts do not exist). This is 
precisely why the author believes scientific expertise and 
citizen participation should be treated equally in the policy- 
making process. 

Average citizens, we are told, have the ability to under- 
stand the most complicated scientific and technical issues. 
Fischer recommends the establishment of “consensus confer- 
ences” (pp. 234-40), or about two dozen “ordinary” citizens 
who assesa the science and technology related to a specific 
issue or problem. People who are chosen to participate in 
these conferences read and learn about the technical issues 
involved and, with the help of a nonexpert facilitator, develop 
a set of recommendations for policymakers and legislators. 
This approach has been employed in Denmark and elsewhere 
with some success. 

Overall, the book is well written. The author effectively 
develops important issues, arguments, and ideas before stak- 
ing out a position. Readers in all fields will appreciate the 
clarity of the discussions concerning the expanding role of 
science and technology in environmental policymaking and 
why citizen participation is vital, especially at the local level. 
The study integrates central concepts and ideas 
in political theory with analyses of the most serious problems 
related to citizen participation in environmental policymak- 


Ing. ‘ 

A critical question Fischer skirts is whether more citizen 
participation leads to more effective environmental policies. 
Regardless of what mechanisms are used to involve the 
public, will the result be environmental policies that work? 
No study has shown that the level of public participation— 
regardless of how it is achieved—varies directly with the level 
of effectiveness of public policy. In fact, research shows that 
the relationship between these two variables is ambiguous at 
best. Although the author may feel that an increase in citizen 
participation is as important as (or even more important 
than) developing cost-effective, successful plans and regula- 
tory programs, it is highly doubtful that politicians and 
policymakers will agree. 

The failure of previous environmental policies has less to 
do with whether science is completely subjective or a decline 
in citizen participation and has more to do with poor policy 
design and execution. The extreme politicization of critical 
environmental problems (e.g. the protection of biodiversity 
and climate change) has resulted in no action or failed 
policies at the federal, state, and local level. In many if not 
most cases, the rise of interest group politics has prevented 
vital environmental legislation from being passed. Laws that 
have been adopted have not been strictly enforced. 

. The book also does not adequately address how to per- 
suade citizens to become more actively involved in environ- 
mental policy at the local level. In general, the level and type 
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of public participation that Fischer desires takes a great deal 
of time and effort, and most people do not want to become 
involved to this extent. Why would they want to participate in 
fairly demanding consensus conferences? (Interestingly, the 
study does not explore how advanced computer and commu- 
nications technology might be used creatively to include a 
larger number of citizens in environmental decision making 
at the local level in a meaningful way.) There must be an 
increase in feelings of civic duty and responsibility before 
such mechanisms as consensus conferences can be successful. 
The major problem with the book is that it fails to offer 
significant new insights into the dilemma of citizen participa- 
tion in modern society, particularly when highly technical 
issues are at stake. The same observations and possible 
solutions to this dilemma have been discussed by others as 
well as the author himself (Technocracy and the Politics of 
, 1990; “American Think Tanks,” Governance 4 [July 

1991]: 332-53). At the same time, the vast literature on 
interest groups, policy stakeholders, agenda setting, and 
public participation in political science is ignored. Regretta- 
bly, despite the importance of the topic, the book does not 
break new ground or add to our knowledge of citizen 


participation in environmental policymaking. 


The British Presidency: Tony Blair and the Politics of Public 
By Michael Foley. Manchester. Manchester 
University Press, 2000. 374p. $74.95. 


G. W. Jones, London School of Economics and 

Political Science 

In this updated edition of his book, published in 1993 as The 
Rise of the British Presidency, Michael Foley develops his 
argument and rounds on his critics. He remains committed to 
his central proposition that the study of American presidents 
reveals underlying political pressures that have transformed 
the British prime minister into the British president. He 
asserts (p. 331): “The comparability that has come to exist 
does so at a level that transcends the constitutional differ- 
ences within the two systems.” He claims there has been a 
fundamental systemic change in British government brought 
about by irreversible dynamics in the British political system. 

The drivers of the transformation are the media. They 
regard politics as a clash between leaders, who personalize 
their parties, programs, and governments. Both prime min- 
isters and presidents deal directly with the people and power 
centers. In Britain prime ministers detach themselves from 
their parties, their cabinets, and Parliament in seeking direct 
links to the people both to attain and to keep office. 

Foley makes a plausible case, writes persuasively in an 
elegant style, deploys apt quotations from his extensive 
collection of press cuttings, and shifts easily between U.S. and 
British experiences. Had he waited for the aftermath of the 
general election of 2001, he would have been able to rein- 
force his thesis. The party campaigns focused overwhelmingly 
on the party leaders and copied U.S. techniques, and the 
media concentrated on the prime minister and the leaders of 
the Conservative and Liberal Democrat parties. Tony Blair, 
once reelected, reorganized the core executive of the Prime 
Minister’s Office and the Cabinet Office, increasing and 
reshaping his staff and fusing the two offices under his 
control. To many commentators Blair copied the organiza- 
tion of the White House. The general election of 2001 and 
the subsequent reorganization of government resonate with 
U.S. analogies. 
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In this edition Foley covers the Blair government’s first 
term and examines how Blair was governing. In the first 
edition he looked only at the campaign to obtain office, 
politics, images, and the media, without exploring the struc- 
tures and processes of government, and especially the rela- 
tionships between prime ministers and ministers. He still 
downplays the latter, asserting that underlying political pres- 
sures have transformed cabinet and ministerial government 
into prime ministerial government. He fails to appreciate that 
much of the high personal profile achieved by the prime 
minister is desired by his ministerial colleagues because it 
helps them and their party win elections and gam consent for 
their policies. The prime minister is only as dominant as these 
colleagues let him be. If he becomes a liability to winning 
elections and gaining consent, he will be constrained and 
ultimately jettisoned. 

Foley also adds a section on four objections to his thesis. 
The first is that of deceptive appearances, which argues that 
a flamboyant prime minister can distort news coverage to give 
an impression of personal hegemony. The second is that of 
the flash in the pan, which argues that a prime minister may 
achieve governmental prominence, but it will be only tempo- 
rary in a system that depends on ministers predisposed to 
collective working and that is produced from elections based 
on mandates for governments rather than for individual 
leaders. The third is that of the iron law of politics, which 
argues that a centralizing prime minister cannot be sustained 
against a political system rooted in conflict, division, and 
challenge that is expressed through Parliament and a cabinet 
dependent on always having parliamentary support. The 
fourth is that of the problem of precision, which argues that 
it is so hard to see through the complexity of data that it 
makes sense to stay with the generalization that most accords 
with traditional observation. The burden of proof lies with 
the innovator, and the presidential-government paradigm 
lacks a clinching body of evidence. 

Foley’s response is to emphasize that his analysis of spatial 
leadership, outsider politics, competitive populism, personal 
projection, media management, individuated party images, 
the permanent campaign, and new linkages between leaders 
and their public constituencies shows that new underlying 
political pressures have overturned the traditional institu- 
tions and processes of government. 

The fifth objection to the Foley thesis, which the author 
fails to address, is that it lacks an historical perspective. 
Foley’s analysis is limited to the period from Margaret 
Thatcher in 1979 and takes no account of how past prime 
ministers behaved. Disraeli and Gladstone personified their 
parties in the nineteenth century and were dominant in both 
electioneering and governing. Lloyd George exercised such 
personal leadership—breaking with his party, keeping aloof 
from Parliament, appealing directly to the people through 
most of the approaches noted by Foley, and increasing his 


staff—that he was known by the 1920s as an imperial Caesar. 
But the government of Bonar Law in 1922 dismantled Lloyd 
George’s expanded prime ministerial Secretariat in the gar- 
dens of 10 Downing Street and reasserted the processes of 
cabinet and ministerial government. The institutions that 
make up the British Constitution were not transcended and 
transformed by Lloyd George or Margaret: Thatcher. Tony 
Blair’s dominance is contingent, not structural, the result of a 
unique set of circumstances that could be changed, above-all 
by a downturn im the economy, POSED Sees Sen Pae 
and growing unpopularity. 

The image of the elastic band to explain the British system 
is still apt. It stretches to accommodate an assertive prime 
minister but contracts to suit one less activist. Blair may be 
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the. prime minister who has stretched it the most, but he has 
not yet gone as far as to break the elastic. He has not called 
his new arrangements in the core executive a prime minister’s 
department because he knows that would snap the band by 
alienating his ministerial colleagues. Blair prefers a presiden- 
tial system to a parliamentary system, as is shown also by: his 
advocacy of directly elected mayors to replace collective and 
conciliar processes in local government, but he has not dared 
make the move that would indicate: that prime ministerial 
government has reps cabinet and ministerial govern- 
ment. : 


Factionalism in Chinese Communist Politics. By Jing Huang. 
New York Cambridge University Press, 2000. 458p. 
$59.95. 

Cadres and Corruption: The Organizational Involution of 
the Chinese Communist Party. By. Xiaobo Lu. Stanford, 
CA: Stanford University Press, 2000. 368p. $55.00. 


Bruce J. Dickson, George Washington University 
These two books cover fundamental aspects of the Chinese 
political system that are widely acknowledged but not well 
understood. Factionalism has been endemic in the Chinese 
Communist Party (CCP) from its creation in 1921. Scholars 
and journalists often refer to factional strife within the party, 
but the contours and dynamics . of .Chinese factionalism 
remain obscure. Corruption, whether malfeasance or rent- 
seeking behavior, also has bedeviled the CCP. ever since it 
came to power in 1949. The causes and consequences of 
corruption are better understood, but the CCP has been 
unable or unwilling to deal effectively with the problem, 
despite the damage to the regime’s reputation and coher- 
ence. Both books attempt to shed new light on these endur- 
ing elements of Chinese politics. 

Because there are few book-length studies on these i issues 
and because these volumes promote fresh analytical perspec- 
tives, they are likely to be widely read. Both are notable for 
incorporating approaches that are often missing in studies of 
Chinese politics: for Lu, comparative analysis, and for 
Huang, rational choice. Yet, both are undermined by ac- 
counting for so little temporal or regional variation in their 
inquiries. Both imply that the causes of factionalism and 
corruption have remained fairly constant, despite the many 
political changes in China in the period covered by each 
book. They are strongest in their factual ‘descriptions of 
factionalism and corruption, respectively, but the analytical 
perspectives adopted do not add a great deal to the ae 
tions. 

Huang intends to show that factionalism has been ihe 
primary independent variable in Chinese politics. The book 
covers the period between the late 1930s, after the CCP had 
arrived in Yan’an and Mao had attained supremacy within 
the party, until the fall of Hu Yaobang in 1987. The roots of 
the clash between Mao and.Liu Shaoqi that was finally 
revealed in the Cultural Revolution are traced to the initial 
state-building efforts of the 1950s, which challenges the 
conventional wisdom and will likely engender debate. The 
1987 endpomt is unexplained and omits the most important 
episode of factional strife in the post-Mao period, the 
Tiananmen demonstrations of 1989 and the emergence of 
Jiang Zemin as the top leader. 

The emphasis of Factionalism is on an interpretation of 
elite politics during approximately 50 years. Much of this 
interpretation covers well-known events, and little new infor- 
mation is presented. Huang provides an. extensive and able 
review of the differing approaches to the study of elite politics 
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in China. He favors the cultural approach championed by 
Lucian Pye and dismisses the structural approach developed 
by Andrew Nathan. As a result, his argument is China- 
specific and based primarily on the importance of guard. 

Huang’s interpretation reveals a large intellectual debt to 
hig mentor, Roderick MacFarquhar, both in the numerous 
references to his work and his criticism of scholars who offer 
opposing perspectives, especially Frederick Teiwes. The de- 
bates in the book are largely over interpretation of events and 
personal motives, debates that are difficult to resolve because 
we lack definitive evidence. Huang challenges the reliability 
of other scholars’ sources but is vulnerable to the same 
charge. Much of his analysis is based on speeches, memoirs, 
and biographies that often were published many years after 
the events, which leaves their veracity and objectivity in 
doubt. Nevertheless, he does a good job in showing how the 
behavior of Chinese leaders was constrained by insufficient 
information and awareness of options, and he brings to life 
some of the key personalities. 

Huang develops a modified prisoner’s dilemma game to 
describe specific episodes of factional strife, which is a 
potentially valuable contribution. He lays out the formal logic 
and potential payoffs of the players, but this seems to add 
little to the empirical analysis. For instance, he does not apply 
game theory to the account of the Cultural Revolution 
decade to highlight overlooked or underappreciated aspects. 
The ranked preferences of specific leaders during certain 
episodes are asserted but not demonstrated. Some examples 
of leaders who behaved “irrationally” would have been 
useful. Given the turmoil of Chinese politics, someone must 
have miscalculated at some point. 

If factionalism is the independent variable, then the de- 
pendent variable seems to be the ups and downs of central 
leaders, especially Politburo members. There is little atten- 
tion to how factions operate, whether they function as 
channels of communication, whether they recruit members at 
lower levels, split or merge, and compete for seats on the 
Politburo, Central Committee, or State Council. The most 
salient question—how factional activities affect the making 
of concrete policies and their implementation—is mentioned 
in the conceptual chapters but entirely absent from the 
empirical chapters. 

A key insight of Factionalism is that “the final outcome of 
an elite conflict during the Mao period was often determined 
by which side Mao was on; during the Deng period, however, 
it depended on who sided with Deng” (p. 422). This captures 
an important dynamic of Chinese politics and an essential 
difference between these two leaders. Despite this acknowl 
edgment of fundamental change, Huang argues that the 
nature of factionalism was a constant during the many twists 
and turns of the Maoist and post-Mao periods: “All the basic 
features of the Yan’an Round Table before the [Cultural 
Revolution] . . . remained unchanged” into the post-Mao pe- 
riod (p. 383). Huang does not incorporate postrevolutionary 
dynamics or imtergenerational transitions. The idea that 
factional networks were created before 1949 and were policy 
blind may have been true for the Mao years (which is highly 
debatable), but that is clearly not the case for the post-Mao 
years and especially the Jiang Zemin era. This is the short- 
coming of the time frame chosen: The central role of policy 
agendas took on greater significance with the rise of Zhao 
Ziyang and later Jiang Zemin, but these developments are 
outside the scope of the book. Much of the conceptual and 
descriptive argument does not ring true in later periods. 

Huang concludes with a brief discussion of how factional- 
ism changed in the 1990s and will change in the future. This 
is a very interesting section, but it assumes that the reader 


brings foreknowledge of issues and events of these years, 
because no information is presented. The author claims one 
finding of his study is that factionalism within the party can 
become the basis for political pluralism, but again there is no 
evidence to support this provocative claim. In the end, this is 
a thoughtful contribution to the literature on elite politics in 
China, but it will not close the debate on the causes and 
consequences of factionalism. 

Xiaobo Lu’s study of cadre corruption takes as analytical 
inspiration Ken Jowitt’s concept of the “neotraditionalism” 
of Leninist parties. Based on Jowitt’s discussion of the 
organizational development of ruling communist parties and 
the eventual loss of their revolutionary ethos, Lu posits a 
model of “organizational involution,” by which he means that 
revolutionary parties tend to adopt some rational features (in 
the Weberian sense) but refuse to embrace or fully adapt to 
the routinization characteristic of modern bureaucracies. 
Instead, they replicate traditional practices, developed during 
the revolutionary years, even though these are no longer 
appropriate to postrevolutionary challenges. He emphasizes 
how the “deviant” behaviors of communist officials were the 
direct and logical, if unintended, consequence of official 
policies. The irony is that steps taken to prevent and punish 
corruption and other forms of deviant behavior only serve to 
perpetuate and deepen those behaviors. Beginning with this 
promising insight, he intends to show that the problem of 
cadre corruption in China is a generic problem of official 
deviance common to all communist parties after attaining 


power. 

Lu argues that organizational involution is the most appro- 
priate explanation for cadre corruption, even though the 
nature of that corruption varies across time. In the Maoist 
era, when markets were either very limited or entirely absent, 
cadre corruption was often noneconomic, even though offi- 
cial status was used for private purposes, such as getting 
children into good schools or assigned to better jobs. With 
the return of markets in the post-Mao years, and especially 
beginning in the 1990s, corruption took the form of blatant 
graft and rent-seeking. 

Although the model of organizational involution is not 
elaborated enough to account for both the changing context 
and changing nature of corruption, it is useful in explaining 
why the CCP has not adequately addressed this growing 
problem. Organizational theory also could help explain why 
vested interests, dominant ideology, bureaucratic resistance, 
and other features of complex organizations prevented the 
CCP from adopting more successful strategies. P Much of the 
empirical material concerns the 1980s and 1990s. Although 
cadre corruption seems to have grown more severe since 
then, it seems to follow the same trends described in the book 
and does not necessarily refute the logic of organizational 
involution. The chosen time frame is therefore less a problem 
for Lu than for Huang. 

Lu admits it is hard to quantify corruption, altbough the 
temptation is irresistible. Examples are sprinkled throughout 
the text, drawn from local histories and other original 
sources. He makes good use of these new sources, but the 
rationale for selecting examples is not spelled out. Whether 
they are typical or particularly egregious is not clear. It would 
be useful, for example, to know whether there is regional 
variation in the past or present prevalence of corruption, and 
if so, what contextual factors explain that variation. 

In addressing existing scholarship on corruption and the 
problem of monitoring the behavior of local officials, Lu too 
readily dismisses alternative viewpoints and approaches with- 


out definitively showing that his approach is superior. Para- 
doxically, he also mentions continuities that suggest at least 
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part of the explanation for cadre corruption is specific to 
China, not generic to communist systems. For example, he 
rejects the cultural determinism perspective but also shows 
how traditional patterns of authority survived after 1949 and 
how cadres “approximate pre-revolution local officials” (p. 
233). This tendency to dismiss rather than build upon previ- 
ous scholarship mars the otherwise thoughtful 

The comparative examples in Lu’s concluding chapter help 
put the Chinese case into proper perspective, but they focus 
more on approaches than on cases. Thus, we read more 
about how neotraditionalism was a generic feature of com- 
munist regimes than about case material to demonstrate it. 
We read how patrimonialism and prebendalism are common 
explanations for corruption in developing countries, but only 
a passing reference to Nigeria is made to support the claim. 

Lu ends the book with the statement that “Chinese offi- 
cialdom will... likely remain neotraditional in the near 
future, nurtured by a ‘booty capitalism’—a fragmented but 
administratively managed market economy” (p. 257). This is 
an intriguing claim, but the last sentence of a book is no place 
to introduce such an important concept. Perhaps his next 
book will fill in the rest of the picture. 


The Politics of Democratization in Korea: The Role of Civil 
Society. By Sunhyuk Kim. Pittsburgh: University of Pitts- 
burgh Press, 2000. 196p. $19.95 paper. 

Aie-Rie Lee, Texas Tech University y 


This book is well written and enjoyable to read, in part 
because of the good use of primary sources and in part 
because of the deliberate and thoughtful way that Sunhyuk 
Kim develops the theoretical argument. Certainly, this is a 
welcome addition to the growing literature on democratiza- 
tion in general and in South Korea in particular. 

One of the most compelling streams in this literature is a 
rediscovery of civil society. Based largely on participant 
observation in small voluntary groups, scholars are charting 
the bases for collective political action, particularly how 
citizens choose to identify themselves and organize for 
political action in response to state repression. The civil 
society paradigm used in this book finds that groups orga- 
nized along lines that expressed new and old horizontal 
solidarities: grassroots religious organizations, trade unions, 
business associations, environmental organizations, student 
groups, women’s groups, neighborhood associations, indige- 
nous groups, and so forth. The theoretical framework looks 
primarily to the self-organization of political space through 
associational life as the answer to the puzzle of building and 
sustaining democracy. 

The purpose of Kim’s research is to explain how and why 
Korean democratization emerged in response to the de- 
mands and activities of civil society groups. The book focuses 
on the role of intermediary social and political organizations 
as the foundation of democracy. Employing a comparative- 
historical analysis of three “democratic junctures” in Korea, 
Kim maintains that Korean democratization has consistently 
been initiated and promoted by civil society groups. They 
have significantly precipitated—if not directly caused—au- 
thoritarian breakdowns, facilitated democratic transitions, 
and to a large extent determined the dynamics of politics in 
democratic consolidation. Crucial to understanding the ori- 
gins and consequences of Korean democratization for Kim 
are the scope, power, and concerns of the prodemocracy 
coalition. 
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The first chapter establishes the purpose and premises of 
the book. In chapter 2, the author lays out the theoretical 
debate and argues for the applicability of the concept of civil 
society to Korea. This chapter also provides a list of analytical 
questions that define civil society in terms of actors, interac- 
tions, and issues. Chapters 3, 4, and 5 examine in depth the 
various civil society groups that initiated and directed democ- 
ratization during each democratic juncture. Throughout, the 
author exhibits a thorough understanding of both the ideo- 
logical differences among groups and the socioeconomic and 
political events that led to their creation. Chapter 6 focuses 
on democratic consolidation and how civil society groups 
have affected it. Finally, chapter 7 looks at the future of civil 
society and democracy in Korea. 

Kim analyzes democratization in Korea as a sequence of 
structural opportunities and constraints. The author is cer- 
tainly right in suggesting that the process is no longer 
considered linear or simple. For Kim, there are three stages: 
authoritarian breakdown, democratic transition, and demo- 
cratic consolidation (p. 11). 

Contrary to conventional wisdom, Kim asserts that it is 
misleading to argue that the elite contributed positively to 
democratic transition and consolidation in Korea. Instead, a 
series of massive, intense, and protracted prodemocracy 
popular movements directly caused or indirectly facilitated 
the change. Nevertheless, I believe the role of the elite is still 
an open question. Also, it is arguable that the visibility of civil 
society groups does not necessarily mean that they are causal. 

The chapter on the role of civil society groups during the 
third democratic juncture, between 1984 and 1987 (chap. 5), 
is one of the most interesting, in part because it espouses the 
view that the Korean state behaved in a more “pluralistic” 
manner in the late 1980s than in the early 1980s. The most 
intriguing question is why political democracy was possible in 
1987 but not in 1980, immediately after Park’s death. Kim 
contends that, as compared with the previous two democratic 
junctures, the scope and power of the prodemocracy coalition 
had considerably increased. The triple solidarity of student 
groups, labor unions, and religious organizations—and their 
prodemocracy coalition with the opposition party—contrib- 
uted to the authoritarian breakdown between 1984 and 1987. 
Furthermore, the prodemocracy coalition attracted many 
middle-class citizens, argues Kim (p. 103). 

At times Kim focuses too narrowly on evidence supporting 
his argument and dismisses alternative scenarios too quickly. 
For example, he claims that neither the precondition nor 
contingency paradigm explains democratic transition and 
consolidation in Korea. It is clear why the author challenges 
the precondition paradigm, in particular, but it can be argued 
that sustained economic growth weakened the power base of 
authoritarian forces because it expanded the density of civil 
society and, therefore, its capacity to check the monopolistic 
government. Although it is difficult to determine empirically 
the relative importance of such variables as economic devel- 
opment, elite strategy, international environment, and civil 
society, all of these seem to have contributed. The question 
still remains as to how and why democratization came about. 
Kim has definite views about why political actors chose 
certain courses of action, but to me his arguments are more 
accurately described as working assumptions rather than 
conclusions drawn from the Korean case. 

Kim also shows how civil society became gra 
ginalized following the June uprising in 1987 by “isle gating 
powers to political society. Kim speculates that if the popular 
movement had retained hegemony over political society a 
little longer, the mode and character of the Korean transition 
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could have been different. Certainly, this is a questionable 
“if” 

In sum, The Politics of Democratization in Korea is an 
accessible, authoritative, and affordable text. The author 
successfully tackles his ambitious project and presents a 
readable and manageable book for an undergraduate or 
graduate audience. It provides educators with one concise 
source for examining democratization, civil society, and the 
state in Korea. It is a valuable contribution to any compara- 
tive course on civil society, social movements, society-state 
relations, and democratization in Korea and/or the Third 
World. 


Institutions and Ideology in Chile. By John B. 
Londregan. New York: Cambridge University Press, 2000. 
280p. $59.95. 


Peter M. Siavelis, Wake Forest University 


When the socialist Ricardo Lagos assumed the presidency of 
Chile in March 2000, it appeared that politics there had come 
full circle. Despite substantial ideological differences be- 
tween the Socialist Party of Salvador Allende and that of 
Lagos, few analysts failed to remark on the irony of a socialist 
presidency only eleven years after the departure of a military 
government bent on the brutal eradication of the Left. 
Despite this apparent sea change, however, the legacy of 
Chile’s military government is alive and well. Postauthoritar- 
ian presidents had relative success in the areas of economic 
management, civil-military relations, and social reform, but 
they failed to alter substantially the primary political legacy of 
Chile’s rightist government: the 1980 constitution. 

That is the point of departure for John Londregan’s 
persuasive and expertly executed analysis of legislative insti- 
tutions in Chile. Londregan seeks to answer two central 
questions; one is of concern to institutional analysts more 
generally, and the other to Chileanist scholars. First, Lon- 
dregan analyzes a paradox of constitutional design in author- 
itarian contexts: How can an outgoing authoritarian govern- 
ment guarantee its preferences but at the same time provide 
subsequent governments the necessary flexibility to govern 
effectively in the face of change? This fundamental tension 
dictates that constitution makers strike a balance between 
delegation and constraint. 

The second question emerges from the first. The 1980 
constitution has been described variously as authoritarian, as 
a straight-jacket, and as institutionalizing Pinochetismo with- 
out Pinochet. It provides strong checks on civilian authorities, 
is quite difficult to amend, and in many ways is a recipe for 
gridlock. It also provides for nine “institutional” senators (the 
first of whom were appointed by Pinochet and his support- 
ers). They deprived President Patricio Aylwin (1990-94) of a 
majority in the Senate and can act as veto players in defense 
of the Pinochet constitution. 

Londregan asks whether democratic leaders can effect 
change in a political system designed so explicitly to defend 
the status quo. In the process, he teaches us something about 
constitutional design in authoritarian contexts and about 
policymaking in situations of divided government more gen- 
erally. Londregan employs an innovative spatial model that 
explores the interplay between policy preferences and va- 
lence to show how democratically elected presidents can 
overcome the objections of veto players (in this case the 
Senate), even in a system in which reform seems unlikely. 
Drawing on Donald Stokes (“Spatial Models of Party Com- 
petition,” American Political Science Review 57 [1963]: 368- 
TT), Londregan argues that all policy proposals combine 
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some element of positional content (a policy preference, 
often based on ideology) and valence content (widely agreed 
upon values, such as honesty, “good” education, and safety). 
He contends that Chile’s strong presidential system and the 
executive’s advantages in terms of access to information and 
control of the bureaucracy transform the president into the 
most important agenda setter. Legislators are forced to opt 
between presidential policies or the status quo. Presidents 
can use their advantage to propose legislation with very high 
valence content, raising the stakes for legislative noncooper- 
ation and swaying veto players’ positional preferences closer 
to those of the executive. 

Because data are lacking for floor votes in both legislative 
houses, and given the difficulty of determining policy differ- 
ences between parties, Londregan relies on roll-call votes 
from three of Chile’s most important Senate committees 
(Labor, Education, and Constitution). These three deal with 
conflicts central to the democratic transition. Senate commit- 
tees require recorded votes on all amendments offered at 
second readings of bills, which provides a high number of 
concrete indicators of policy preferences. Londregan uses 
these data to estimate the preferred policy outcomes of the 
president and the most important committee members. He 
then analyzes to what extent members vote with one another 
and with their party on a series of issues central to the 
concerns of each committee. ' 

The results are interesting in a number of ways. Londregan 
uncovers multidimensionality in political divisions based on 
particular issue areas, which challenges the notion that the 
center-leftist governing Concertación coalition and the op- 
position simply divide along party lines on every issue. The 
analysis of the Constitution Committee shows that human 
rights legislation is highly polarizing, and members of the 
governing coalition divide along the expected lines. Given the 
depth of division on policy and ideology, the president is 
unable to make proposals with high enough valence to sway 
opposition legislators to agree to alter the status quo. In 
contrast, the opposition is divided on labor issues, and 
Concertación senators have an ally in an Institutional Party 
senator, William Thayer. Thayer's position to the Left of 
most of the opposition gives the president an advantage in 
labor policy, because high valence policies can attract support 
from at least some of the opposition. In terms of education 
policy, the outcome is more complex. Londregan finds a level 
of division similar to that in the human rights arena, but the 
high valence of improving education provides the president’s 
proposals an advantage. On social issues, such as divorce, 
abortion, and drug abuse, Concertación is deeply divided, 
and Christian Democratic senators often side with the Right. 

This work has some real strengths. Londregan successfully 
introduces methods from the U.S. literature in a way not 
often seen in non-U.S. institutional work. His analyses of veto 
players and the imterplay of policy position and valence 
enhance our understanding of executive-legislative relations 
in Latin America’s presidential systems. He also provides a 
more widely applicable model for statistical analysis of roll- 
call voting in small legislatures. Sophisticated methods are 
combined with a very textured analysis, replete with examples 
and interview data to support the conclusions advanced by his 
models. Londregan provides a valuable window on the 
policymaking process in Chile, and he demonstrates the 
potential for constitutional change, albeit by way of a slow, 
messy, and piecemeal process. 

Londregan could have better connected his argument to 
some of the literature on Chilean politics, especially that 
related to continuity and change in the party system. Refer- 
ence to this work would have strengthened his conclusions by 


1021 


Book Reviews: COMPARATIVE POLITICS 


providing additional supporting evidence regarding signifi- 
cant continuity in cleavage structures. In methodological 
terms, his choice of Senate committee votes makes sense, but 
it does not tell us enough about floor bebavior in the 
legislative chambers as a whole, where the balance between 
policy and valence may be different, or at the very least much 
more complex. A different balance on the floor would create 
much tougher terrain for the executive to navigate (and with 
different results in terms of success). 

Despite these considerations, Londregan makes a very 
useful contribution to our understanding of the dynamics of 
constitution making, the interplay of valence and policy 
position, and the potential for presidents to overcome the 
opposition of veto players in divided government. For ana- 
lysts of Chilean politics, Londregan provides valuable insights 
about the country’s legislative institutions and convincingly 
demonstrates the potential for reform of Chile’s seemingly 
inflexible constitutional structure. 


Oman: Political Development in a Changing World. By Carol 
J. Riphenburg. Westport, CT: Praeger, 1998, 248p. $67.00. 


Mary-Jane Deeb, The Library of Congress 


This is an excellent general study of Oman in the late 
twentieth century. It gives a'broad overview of the political 
and social structures and institutions of that small country, as 
well as a fair and balanced assessment of its progress in the 
last thirty years and of the problems that lie ahead. As Carol 
Riphenburg describes it, Oman in 1970 “had only the most 
rudimentary social and economic infrastructure but has since 
been transformed under Sultan Qaboos into a modem 
oil-producing state. More recently rich than other states in 
the area, it serves as sort of a bellwether for the region .. . as 
it faces the prospect of diminishing oil revenues along with 
the challenges of an area shaken by the turbulence of the 
Iran-Iraq War and the invasion of Kuwait” (p. xi). 

The first chapter, “Terra Firma and People,” is a very 
mnformative description of a number of elements that have 
made Oman the country it is today. Riphenburg argues that 
geography has kept it historically isolated from the rest of the 
Arabian peninsula, although its long coastline on the Arabian 
Sea made it a sea-farmg nation. An important section 
discusses the ethnic groups that comprise this immensely 
diverse nation of about two million people, one-quarter of 
whom are expatriates. Although the majority are Muslims, 
the diverse sects include Sunnis, Ibadhis, Isma’ils, and Shi’is. 
Ethnically, Arabs are in the minority. Other groups are the 
Baluchis, Zanzibaris, Liwatyah Indians, Hindu Banians, Ira- 
mans, and smaller communities of unknown origin, such as 
the Zatutis and the Shihuh. 

The chapter on history provides the framework within 
which to understand current political and social develop- 
ments. The survey starts in the eighth century, with the 
establishment of the Abbasid Caliphate in Baghdad, and 
Riphenburg notes that in the middle of that century an 
Omani, Abu "Ubaydah, made the first recorded trip from the 
Gulf to Canton m China. She describes Oman’s medieval and 
early modern history as marked by numerous foreign inva- 
sions by the Portuguese, Dutch, French, and British, among 
others. There were also domestic rivalries and wars, and a 
number of local dynasties ruled the country. The present 
dynasty of Al Bu Sa’id dates to 1749 and has had its share of 
problems. The current sultan took power in 1970 after 
overthrowing his father, a conservative and isolationist ruler, 
with the tacit endorsement of the British. 

The rest of the book focuses on the achievements of Sultan 
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Qaboos ibn Sa’id (1940- ), who attended public school in 
England and studied at Sandhurst. Upon taking power 
Qaboos was faced with an uprising (against his father) in the 
Dhofar region, his birthplace. He dealt with the crisis by 
issuing a general amnesty to all the enemies of his father in 
the region and then starting a major program of socioeco- 
nomic development and modernization of the country. He 
built roads, schools, and wells and spent one-quarter of the 
government’s development funds on Dhofar alone to quell 
the rebellion. 

The sultan spent the next few years creating a modern state 
within the geographical and political framework of the Ara- 
bian peninsula and the Arab world. A “benevolent autocrat,” 
in early 1975 he introduced two major laws: the Law for the 
Organization of State Administration, which for the first time 
defined the structures and responsibilities of the government 
of Oman, and the Development Law, which set up mecha- 
nisms for an annual budget. In 1981 he created the State 
Consultative Council, an advisory body whose members he 
appoints but who represent the seven geographic regions of 
the country. A new council was created in 1991, named Majlis 
al-Shura, which is more representative and more powerful. It 
is the equivalent of a miniparliament, and representatives are 
now chosen by the people, through a system of election by 
“nominating colleges” and approval by the sultan. Women 
can vote and can be elected to some council seats. 

Because of its religious and ethnic diversity, Oman is one 
of the most tolerarit states in the Arabian peninsula. There 
are Christian churches, and Hindus (who, unlike Christians 
and Jews, are not considered People of the Book in Islam) 
have been granted religious freedom and have several tem- 
ples and other religious institutions in Oman. Although 
respectful of Muslim tradition, the sultan has spoken against 
extremism and fanaticism. “The sultan’s aim is to demon- 
strate that Islam is consistent with a modern state, interacting 
independently with the modern world. It represents a will- 
ingness to make adjustments to changing conditions in a 
pragmatic manner” (p. 92). 

As for social and economic development since 1970, the 
author maintains that “the sultanate has moved from a poor, 
underdeveloped country toward a modern nation state” (p. 
147). She refers to a World Bank report that considers Oman 
no longer among the ranks of underdeveloped nations that 
need its assistance. Although the commercial production of 
oil, starting in 1967, facilitated growth and development, the 
real story is the management of those resources to ensure the 
education of the population, the distribution of benefits to all 
segments, including women, and the long-term investment in 
such nonoil sectors as agriculture and light industry. 

Riphenburg is fully cognizant of the problems that lie 
ahead for Oman. Diversification is limited, as is private 
investment in economic activities other than trade. Social 
services are costly and may need to be pared down when oil 
revenues decline. There are shortages of water, essential for 
agriculture and the development of a modern infrastructure, 
and foreign investments other than in the oil sector are also 
sparse. “Economic development has resulted in social trans- 
formations, engendering a sense of entitlement among the 
public. To fulfill its expectations, the government must 
ensure sustainable growth” (p. 147). Although significant 
steps have been made in the emancipation of women (who 
can vote, work, own property, and so on), many obstacles, 
mostly traditional, still stand in their way to equality with 
men. 

The author concludes with a look at Oman’s foreign policy. 


She argues that “the unchangeable effects of geography and 
demography—small populations, vulnerable borders in some 
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instances, and valuable natural resources—combine with 
domestic factors... to create a Gulf diplomatic style” (p. 
184). Foreign policy is highly personal and remains the 
exclusive domain of a very small ruling elite. Decisions are 
made to ensure that dynastic interests are not undermined 
and that borders remain secure. Throughout the Irag-Iran 
war, for instance, Oman kept diplomatic ties with both 
regional powers. Sultan Qaboos has been consistently pro- 
Western, however, and takes a conciliatory, middle-of-the- 
road position on regional conflicts. 

This book would be very useful in a class on comparative 
politics as well as international relations of the Middle East 
or the Gulf region. It is well written but relies almost entirely 
on secondary sources in English. This is somewhat 
since the author mentions in the preface that she studied in 
France and in Cairo and visited Oman on a number of 
occasions. There is little theoretical discussion or attempt to 
put the Omani experiment in a framework other than that of 
the small states of the Gulf. The lack of a conceptual model 
limits to some degree the scholarly influence of an otherwise 
engaging and solid piece of research. 


Politics, Parties, and Parliaments: Political Change in Nor- 
way. By William R. Shaffer. Columbus: Ohio State Univer- 
sity Press, 1998. 290p. $55.00 cloth, $21.95 paper. 

Christine Ingebritsen, University of Washington 

William Shaffer has effectively documented the importance 

of the legislature as an arena for conflict resolution im an era 

of intense domestic political change. This study is a nice 
complement to two other prominent analyses published 
recently (Kaare Strøm and Lars Svasand, eds., Challenges to 

Political Parties: The Case of Norway, 1997; and Donald R. 

Matthews and Henry Valen, Parliamentary Representation: 

The Case of the Norwegian Storting, 1999). Shaffer joins an 

emerging group of Norwegian and American scholars who 

systematically examine how political parties have shifted to 
accommodate changes in electoral politics and why Norwe- 
gian politics (as in other advanced industrial democracies) 
has become less consensual. Shaffer’s contribution is to refute 
the argument made by some scholars regarding the role of 
the legislature and its declining influence in national politics. 

Shaffer develops a theory of democratic political change 
that explains why legislatures are institutionally well suited to 
cope with conflictual politics and, specifically, how this plays 
out in Norway. He analyzes a critical period m Norway’s 
political development (1985-96), when the traditional divide 
between socialist and nonsocialist bloc politics unraveled, but 
he situates his study as part of a general history of the 
legislature’s role m Norway’s political development. 

Once a prominent institution in Norwegian political his- 
tory, the parliament (Storting) lost influence to the executive 
and the bureaucracy during the postwar development of the 
welfare state. Because corporatist institutions could not 
manage the extent of changes in society and politics, there 
was a “reparliamentarization” of Norwegian politics. Here, it 
should be noted, Shaffer diverges from the classic analysis by 
Peter J. Katzenstein (Small States in World Markets, 1985), 
who maintains that the very institutions of democratic cor- 
poratism permit flexible adaptation to political (and eco- 
nomic) change. Whereas Katzenstein gives priority to the 
role of a tripartite relationship among government, business, 
and labor, Shaffer emphasizes the legislature as the most 
important arena for mediating change. As his four-part 
theory of democratic change (conflict management, consen- 
sus building, representation, and lawmaking) seeks to ex- 


plain, the legislature is the only institution capable of man- 
aging the breakdown in consensus evident in Norwegian 
politics from the 1970s to the 1990s. 

The strengths of the book lie in its detailed explanation of 
how the Norwegian electoral system functions; the specific 
issues that have become salient in Norwegian politics; and 
the different phases in the shift from a two-bloc party system 
to a realigned parliamentary coalition system. The coverage 
of how votes translate into mandates, the continuation of a 
national tradition (also outlined by Matthews and Valen) of 
weighted voting for rural areas, and changes in the percent- 
ages of votes for the major political parties are central 
contributions of this volume. Both domestic and interna- 
tional changes have led Norwegians to disagree over their 
relationship to the European Community, appropriate envi- 
ronmental policies, levels of taxation, abortion rights, and 
immigration policies. 

Shaffer’s empirical data are novel, and the presentation is 
rigorous. The multidimensional diagrams of roll call votes in 
the Storting model how Norwegian political parties evolved 
from a clearly defined division between socialist and nonso- 
cialist blocs to a minority party system that transcends the 
traditional division. 

The focus on the legislature necessarily omits the impor- 
tance of other sources of political change in Norway. Al- 
though Shaffer is not a political economist, he gives too little 
attention to the discovery of oil and gas on the Norwegian 
continental shelf and its effects on society and politics. Also, 
charismatic individuals (such as Carl I. Hagen, leader of the 
Progress Party, and Kjell Magne Bondevik, former prime 
minister and member of the Christian People’s Party) are not 
viewed as important as larger institutions in managing polit- 
ical change and working toward a new policy consensus. The 
media (as Shaffer acknowledges on page 219) has grown in 
importance as an independent actor, but this dimension does 
not figure prominently in the book. Furthermore, the effects 
of Europeanization and globalization seem to “stop at the 
border” for this domestic study of Norwegian institutional 
change. Nevertheless, the book exceeds expectations and is 
required reading for those interested in comparative demo- 
cratic institutions, Scandinavian politics, and changes in 
electoral preferences in advanced European welfare states. 


Courts and Transition in Russia: The Challenge of Judictal 
Reform. By Peter H. Solomon, Jr., and Todd S. Foglesong. 
Boulder, CO: Westview, 2000. 222p. $25.00 paper. 


Kathryn Hendley, University of Wisconsin-Madison 
Courts have been mostly neglected in analysis by political 
scientists of the transition from state socialism. This book 
begins to fill this gap by focusing on the current state of 
Russian courts. The primary focus is on the courts of general 
jurisdiction, in which the vast majority of cases are heard, 
including all those in which physical persons (as opposed to 
legal entities) are involved. Although Russia also has a 
constitutional court and economic courts, these institutions 
are less thoroughly analyzed. 

Solomon and Foglesong are among the most knowledge- 
able Western observers of the Russian criminal justice sys- 
tem, and the book benefits from their years of experience. 
Their discussion of the pretrial phase of Russian criminal 
procedure is superb. It combines a concise description of the 
Soviet legacy with a detailed analysis of the recent reforms. 
They uncover some of the reasons these reforms have not 
had the intended results, such as the shortage of competent 
lawyers, which has undermined the expanded right to counsel 
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for those accused, and the lack of legal training, which has 
thwarted the hiring of additional investigators (in an effort to 
curtail delays). The authors provide empirical documentation 
of the misery and human rights abuses associated with 
pretrial detention, and they argue that the solution is not to 
build more jails but to rethink the policy that allows the 
accused to be routinely detained during lengthy investiga- 
tions. They acknowledge the shortcomings of the existing 
system but reject any wholesale abandonment of it as politi- 
cally and culturally impractical They prefer an approach that 
explores how to “make the existing Continental system of 
criminal justice in Russia as fair and workable as possible” (p. 
145). 

The book brings together a great deal of previously 
unpublished information about the operation of the courts of 
general jurisdiction. The system of selection and retention of 
judges has undergone dramatic changes since the late 1980s. 
Solomon and Foglesong describe the new institutions, such as 
the Judicial Department and the Judicial Qualification Com- 
missions, aimed at allowing the Russian judiciary to police 
itself. Yet, they do not merely describe how these institutions 
are supposed to work; they delve into an analysis of how they 
actually operate. The tabular information is consistently 
complemented by detailed explanations of what it means in 
the Russian context. 

Much has been made of the adoption of lifetime tenure for 
the Russian judiciary, but the authors clarify that because 
these new rules came into effect only in 1993, most judges 
(approximately 76% as of January 1998) are still serving out 
fixed terms (p. 32). Solomon and Foglesong are appropriately 
skeptical about the long-term effect of this reform to the 
judicial tenure policy, recognizing that life tenure does not 
automatically translate into increased autonomy. Along sim- 
ilar lines, their analysis of the data on delays experienced by 
litigants provides a useful antidote to the impression created 
by the popular media (both Russian and Western) of a 
Dickensian legal system in which cases drag on for years and 
in which justice is an impossible dream. The authors point 
out that the unreasonably short statutory deadline for han- 
dling cases makes a high level of delays in court statistics 
inevitable. 

The use of information collected through a survey of 
Russian judges by the authors is somewhat less successful. 
Select questions (in both Russian and English) from the 
survey are replicated in an appendix, along with the number 
of respondents and the breakdown in responses (pp. 207-10). 
The difficulties of conducting a survey among the far-flung 
judiciary of Russia should not be minimized, but the distri- 
bution method—through the chairmen of the council of 
judges, whom the authors elsewhere report as having extraor- 
dinary influence over ordinary judges—seems fraught with 
problems that undermine the reliability and validity of the 
results, which the authors fail to acknowledge. Of the 2,000 
forms sent out, 321 were returned. The authors’ claim of 
“confidence in the representativeness of our findings” (p. 
206) is not convincing. At best, the survey results can be 
treated as suggestive and can serve as the basis for hypotheses 
to be tested more rigorously. 

In parts of the book, the authors carefully limit their 
claims. In reporting on the attitudes of judges toward the 
qualifications commissions, they make it clear that their data 
are limited to the judges who participated in the survey (p. 
36). Elsewhere the lines are more blurred. For example, 
when discussing the extent of financial contribution from 
local governments, the authors push their data too far, given 
that they do not have a representative regional cross-section 
among their respondents. “According to our survey of more 
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than three hundred district court judges, 58 percent of courts 
received some help from local governments in 1996 and 
1997” (p. 39 

Solomon and Foglesong intend the book for both scholars 
and policymakers, although they are most interested in 
having an influence on the latter. Interspersed throughout 
are recommendations to Russian policymakers and Western 
donors engaged 1n legal reform. Their goal is to outline “what 
can and should be done to make courts in Russia autono- 
mous, powerful, reliable, efficient, accessible, and fair” (p. ix). 
This effort to serve a dual audience of scholars and reformers 
yields mixed results. The recommendations are framed in 
terms likely to appeal to policymakers but without the 
theoretical resonance that would interest scholars. In some 
instances, they reflect a lack of awareness of the political 
constraints within which foreign donors operate (e.g., sug- 
gesting that foreign donors contribute equipment, when 
many foreign governments have a strong preference for 
hands-on assistance and a resistance to equipment dumps, p. 
42) and an unwarranted optimism ebout what can be accom- 
plished with donor assistance (e.g., intimating that a confer- 
ence sponsored by the U.S.-based National Center for State 
Courts could teach Russian judges to become more indepen- 
dent of political and administrative pressure, p. 52). 

The book is unquestionably an important contribution to 
the burgeoning literature on the role of political institutions 
in the transition from state socialism and deserves to be read 
carefully by those interested ın this ongoing process in Russia 
and elsewhere. 


Japan’s Budget Politics: Balancing Domestic and Interna- 
tional Interests. By Takaaki Suzuki. Boulder, CO: Lynne 
Rienner, 2000. 284p. $59.95. 


Ronald J. Hrebenar, University of Utah 


The key to the real nature of this book lies not in the main 
title but in the subtitle. Takaaki Suzuki is not simply updating 
John C. Campbell’s classic Contemporary Japanese Budget 
Politics (1977). Campbell analyzed the Japanese budgetary 
process from a variety of different perspectives, but his focus 
was completely on the domestic political actors. Suzuki gives 
relatively little attention to the domestic actors and is largely 
concerned with how foreign political and economic pressures 
affect Japanese fiscal policies and related budget policies. 

Suzuki breaks the past 40 years of Japanese political 
history into four different patterns of budgetary behavior, 
each ro associated with a decade. The 1960s were 
characterized by balanced and relatively small budgets (less 
than 20% of annual national income). In the 1970s there 
were greater demands for governmental spending combined 
with growing public and corporate resistance to increased 
taxes, which resulted in greater deficits. By the end of the 
1980s, the government had eliminated the huge budgetary 
deficits. In the 1990s the pattern reversed, and there was 
large-scale use of deficit financing. 

Suzuki attempts to explam these policy reversals over the 
past 40 years. He uses a two-level approach patterned after 
Robert Putnam to examine why Japan has experienced 
repeated international pressures to adopt budgetary policies 
to promote fiscal expansion. In addition, Suzuki wants to 
identify the interests and objectives of all the relevant 
Japanese domestic actors, and he explores the relationship 
between international and domestic political forces over time 
as they influence budgetary decision making. 

In general, Suzuki assigns more weight to the domestic side 
of the process than to the international. He argues that the 
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twists and turns of Japanese budgetary policy over the past 
four decades were largely driven by domestic political forces, 
reinforced occasionally by international pressures that accel- 
erated processes already in motion. Yet, he gives some quite 
plausible reasons for what he calls the international politici- 
zation of Japanese budgetary policy. It stems from the 
awareness that fiscal expansion can result from budgetary 
changes, and these can affect balance-of- -payment adjust- 
ments. A major contribution made by Suzuki is to detail the 
circumstances in which Japan can say “no” to foreign de- 
mands. 

The book is weakest in trying to explain domestic budget- 
ary decision making. The process is never fully detailed, as 
the analysis is largely sited at the systemic level. In the 25 
years since Campbell’s book, many changes have occurred in 
Japanese politics, government, economy, and society. The 
country is well into its second decade of economic recession 
and apparently has not found the secret of how to break this 
pattern of stagnation. The political system has been signifi- 
cantly redesigned, not only in terms of the party and electoral 
system but also in terms of the central government bureau- 
cracy. The role of the Ministry of Finance (MOF) has 
changed in recent years, as politicians and other ministries 
have chipped away at its power. One can only speculate that 
all these changes have affected the budgetary process in 
important ways. Unfortunately, Suzuki does not fully address 
these. 

Suzuki challenges the conventional models regarding Jap- 
anese budgetary politics. He (correctly) identifies the MOF 
as an independent force and not merely a ratifier in the 
decision-making process. The MOF, Suzuki argues, is a 
governmental actor that seeks to fulfill its own interests. In 
his larger analysis, the author rejects rational choice theory as 
well as neorealist and neoliberal perspectives, relying more 
on historical methods. He tries to identify the ruling coalition 
in the various eras and tries to link them to 
outcomes. This produces useful case studies of the fours eras. 

Japan’s Budget Politics is a significant contribution to 
understanding the economic relationship between Japan and 
the United States over the past three decades. The conclu- 
sion that Japan did not change its policies because of foreign 
pressures but that these largely reinforced policy patterns 
already in place in Japanese politics is an interesting reinter- 
pretation of the way Japan deals with other nations. The 
book is also useful to students of international political 
economy who seek a better understanding of how domestic 
politics and foreign policies are interwined. The relative lack 
of attention to domestic budgetary decision making is a 
disappointment, but although the book is not an update of 
Campbell’s work, it is a quite useful analysis of how Japan’s 
domestic budgetary politics are related to other nations’ 
foreign policies. 

A couple of stylistic weaknesses should be mentioned. The 
book reads too much like a dissertation, and there is consid- 
erable repetition of the central points. These criticisms are 
relatively minor, however, compared to the real significance 
of the analysis. The author is quite persuasive regarding the 
flexibility of the Japanese government in adopting different 
fiscal policies to meet both domestic and international pres- 
sures. One of Suzuki’s major objectives is to build bridges 
between comparative and international politics, and he goes 
far in accomplishing it. Japan’s Budgetary Potitics would be 
appropriate for upper division and graduate courses in both 
comparative politics and international political economy. 
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Libya since Independence: Oil and State-Building. By Dirk ' 
Vandewalle. Ithaca, NY: Cornell University Press, 1998. 
226p. $49.95 cloth, $17.95 paper. 

Ali Abdullatif Ahmida, University of New England 

In 1951, Libya became an independent state, a federal 
monarchy. In 1969, a military coup overthrew the Sanusi 
monarchy and brought Mu’ammar al-Qadhafi to power. 
Since that time Libya has become invisible, the least known 
of the North African states in comparison to Egypt, Tunisia, 
Algeria, and Morocco. Continuing conflict with the United 
States over terrorism and the question of Palestine has made 
Libya inaccessible to foreign scholars. In the absence of 
serious scholarship, competence in Arabic, or reliable jour- 
nalistic knowledge of Libyan society and politics, Western 
mass media and many academics have reduced Libyan poli- 
tics to the persona of Qadhafi, who frequently is character- 
ized as a mad dog who rules a rogue regime. This new book 
by Dirk Vandewalle is therefore an especially welcome 
contribution. 

Vandewalle is among the few scholars—such as John 
Davis, Francois Burgat, Taoufik Monastiri, Moncef Djaziri, 
and Lisa Anderson—who have actually visited Libya and 
done research there. His earlier in-depth analysis of the 
Libyan experimentation in state-building is marked by open- 
ness and avoids the usual oversimplification of focusing on 
the Libyan leader, although it presents a critical examination 
of the Qadhafi government. This book is an elaborate 
synthesis and assessment of Vandewalle’s major arguments 
regarding state-building in Libya from 1951 to 1996, with 
special attention to the period following discovery of oil in 
1959 and its marketing in 1961. Oil has become Libya’s major 
source of revenue. 

Libya since Independence addresses three central ques- 
tions. (1) What happens when rulers of a “late developer” are 
freed from the burden of taxation? (2) How does this 

ing luxury affect the process of building the local state? 

(3) How do local rulers create the economic and political 

processes that under more conventional historical circum- 

stances emerge from compromises? The book contains a 

theoretical review of the literature on state formation and the 

“rentier/distributive state,” an examination of Libya since 

independence under two regimes, and reflections on the 

Libyan case and beyond. Vandewalle’s eclectic approach to 

the rentier state, state autonomy, and late developers is 

derived from the literature on new political economy, insti- 
tutional economics, comparative politics, and rational choice 
history of taxation and revenues. 

Specifically, Vandewalle focuses on four boom and bust 
periods in the world oil market: the boom of 1963-69 during 
the Sanusi monarchy, the boom under the military regime 
between 1969 and 1973, the boom of 1973-79, and the bust of 
the early 1980s. He also evaluates the failed attempt at 
political and economic reform in 1987. The central thesis is 
that Libya is not an aberration but a distinct case of a 
tentier/distributive state. Like other rentier states, such as 
Saudi Arabia, Qatar, and Kuwait, Libya relies mostly on oil 
revenues and legs on taxation of its citizenry. Hence, the state 
has more autonomy than in conventional cases of state 
formation. 

Two factors make Libya a distinct rentier state. First, the 
government relies exclusively on oil export revenues. Second, 
and more important, the historical timing of rentier status 
occurred when state institutions and national integration 
were in their infancy. The author faults both the Sanusi 
monarchy and the revolutionary regime of Qadhafi for 
suspending the process of building strong economic and 
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political institutions fundamental for successful state forma- 
tion. These institutions would have enabled the state to 
adjust to shrinking oil revenues and diversify the economy. 


Vandewalle’s analysis demystifies the Libyan case and 
significantly contributes to a fresh theoretical comparative 


analysis of the rentier state in general. It challenges both the 
popular view of Libya as a rogue nation and such ahistorical 
cultural notions as Islamic ideology and tribalism, which 
dominate current Middle Eastern studies. Vandewalle pro- 
vides insight into the underlying processes and political 
economy of state-building. His belief about the central 
importance of strong state institutions allows him to explain 
why the 1987 reforms failed and why in the post-Qadhafi 
period Libya will face a tough future. Oil-dependent rentier 
states are inevitably weak because of weak institutions, 
argues Vandewalle, and are more vulnerable to economic 
crises. 

The main methodological problem in this work, which 
offers fresh and welcome insights, is the one-sided focus on 
the state at the expense of society. Absent are the social 
forces, voices, and agencies of the Libyan people, not as 
clients and tribesmen but as real participants in history. State 
and class formation under colonialism are overlooked. Some 
institutions of today’s Jamahiriyya “state of the masses” 
experiment, for example, are not just reactions to monarchy 
but rooted in the Tripolitanian republic and the Sanusiyya 
movement. The study provides no serious analysis of the 
opposition under the monarchy and current regime, such as 
student movements, the role of intellectuals, the army, un- 
derground political parties, exiled political activists, and 
especially today, Islamic opposition movements. 

These problems aside, Vandewalle has written a major 
scholarly work. His research provides a major corrective 
critique to cultural and idiosyncratic accounts of the Libyan 
state. The book treats Libya not as an anomaly but as a 
historical case of a distributive state. This alone is an 
important contribution to our knowledge of Libyan politics in 
particular and our understanding of distributive states in 
general. 


Parliamentary Democracy: Democratization, Destabiliza- 
tion, Reconsolidation, 1789-1999. By Klaus von Beyme. 
New York: St. Martin’s, 2000. 248p. $59.95. 

Kerstin Hamann, University of Central Florida 

Klaus von Beyme tackles an important issue, the develop- 

ment of parliamentary democracies in both historical and 

comparative perspective. He compares the stages of democ- 
ratization with the separately occurring sequence of parlia- 
mentarization and shows how the introduction and consoli- 
dation of parliamentary systems is tied to the issue of regime 
changes more generally. He also discusses changes in the role 
of parliament over time. Von Beyme provides a historical 
analysis of parliamentary democracies rich in descriptive 
detail. Yet, he goes beyond a narrative account of the 
development of parliaments: “Classification of the causes of 
government termination and patterns of coalition building 
nevertheless need quantification in order to escape mere 
description” (p. 3). True to this introductory statement, the 
author provides throughout the book a wealth of useful and 
detailed tables that present comparative data on parliaments. 
The book is clearly structured, beginning with an overview 
of different regime types and the role of parliament in these 
different systems. Von Beyme analyzes the position of par- 
liaments in monarchies, presidential systems, and semipresi- 
dential systems. This conceptual clarification is followed by a 
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discussion of waves of parliamentarization and democratiza- 
tion, in which it is argued that these were distinct occur- 
rences. The author shows, for example, how “revolutions 
rarely led to representative government because parliamen- 
tary government was not an idea that could mobilize the 
dissatisfied masses” (p. 17). 

In addition, von Beyme discusses different aspects of 
parliamentary sovereignty, one of the prerequisites of parlia- 
mentary democracy that relates to the internal organization 
of parliaments. He illustrates considerable variation over 
time and across countries with respect to the internal work- 
ings of parliaments. 

Another area of comparative inquiry relates to the func- 
tions of parliament, which, von Beyme argues, in modern 
democracies basically lie in the relationship between the 
deputies and the citizens (representation and articulation of 
interests), the relationship between parliament and govern- 
ment (controlling function), the relationship to the needs of 
society (legislative function), and the relationship of parlia- 
ment and government (recruiting function) (p. 72). The 
relative importance of these functions has over time. 

Parliaments also have a relationship with the head of state. 
This is particularly complex in presidential and semipresiden- 
tial systems, although the head of state has a very reduced 
function in parliamentary systems. Von Beyme identifies the 
party system as the “most important variable in relabons 
between the president and the parliamentary majority” (p. 
115). 
Finally, the author examines the relationship between the 
government and the parliamentary majority. He further 
provides a comparative overview of coalition formation and 
the role of the prime minister. Von Beyme posits that 
parliaments have lost much of their significance in bringing 
down governments. Instead, elections, cabinet collapses, or 
coalition collapses have become a more frequent reason for 
government turnover (p. 207). 

Von Beyme argues that the decline of parliamentary power 
is a logical extension of the democratization of parliamenta- 
rism. Parliamentary power is cut by the judicialization of 
politics, especially when parliamentarians anticipate the in- 
tervention of constitutional courts during the legislative 
process (p. 212). Other factors weakening the role of parlia- 
ment include the nature of foreign policy and the supremacy 
of the European Union for European democracies (p. 213). 
Processes of regionalization and simultaneous globalization 
further erode ‘fe autonomy of national parliaments (p. 215). 
Despite these developments, von Beyme contends, national 
parliaments are likely to retain an important regulatory 
function, especially with respect to the welfare state. 

An outstanding feature of the book is the wealth of 
comparative data in tabular form in conjunction with the 
narrative account of the historical development of parlia- 
ments. Von Beyme highlights the variation in parliamentary 
systems and parliaments themselves but also points to the 
common features of parliamentary systems. This alone makes 
the work a valuable asset for comparative research. The book 
not only speaks to scholars interested in comparative legis- 
latures but also ties in with the literature on regime change 
and political institutions. It also adds to the literatures on 
electoral laws, government formation, and the distinction 
between presidential and parliamentary systems. It is not an 
accident that von Beyme links these bodies of work, which 
often remain distinct from one another. Parliament, despite 
the ongoing erosion of its role, is still at the heart of the 

policymaking process in many democracies and, as such, ties 
together a large number of different political actors and 
institutions. 
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The book combines various areas of interest in compara- 
tive politics in a meaningful way. It also provides a unique 
overview of different variants of parliamentary systems. Von 
Beyme argues cogently that, despite the weakening of par- 
liament, an immediate crisis of democracy does not loom on 
the horizon, and parliaments still retain 1 functions, 
although these are different—or at least have different 
emphases—from in the past. The major shortcoming of this 
otherwise outstanding book is the lack of an overarching 
argument or theoretical framework. The wealth of informa- 
tion it provides could have benefited from an analysis that is 
more explicitly guided by a theoretical argument. 

Also, the book does an excellent job in presenting com- 
parative data about parliaments, but the explanation of these 
data remains sketchy. Although this could easily be inter- 
preted as a weakness, it is also a major strength, as it raises a 
host of unanswered or only partially answered questions. 
Anyone interested in the comparative study of parliaments 
will find a host of new research questions, both comparative 
and historical. The stimulation of new areas of inquiry may 
well have a significant effect on the discipline. This is a highly 
useful addition to the literature on parliaments and should be 
read by anyone interested in any aspects of comparative 
parliaments or comparative democracies. It is also a welcome 
addition to the literature on democratization. 


The Mysteries of Development: Studies Using Political Elas- 
ticity Theory. By Herbert H. Werlin. Lanham, MD: Uni- 
versity Press of America, 1998. 409p. $68.00 cloth, $25.00 
paper. 

Paul Clements, Western Michigan University 

This book is the product of a lifetime. Inspired by Dwight 
Waldo and Sheldon Wolin, who were Werlin’s professors at 
Berkeley, and informed by decades as a consultant to the 
World Bank, Werlin seeks to unify the literatures of public 
administration and comparative politics, to explain the re- 
quirements for economic growth under any kind of govern- 
ment, and to provide an agenda for World Bank policy in 
low-income countries. Although I believe he is motivated by 
great confidence in good will, his simplistic approach to 
theory would wash out the analytic power that comparative 
politics achieves through institutional analysis. The book at 
points becomes a commentary on debates within the World 
Bank but also reflects many of the bank’s biases. The 
argument is constructed around a set of dualistic definitions 
that constitute a frame of reference in which the virtue, 
knowledge, and practice needed by less developed countries 
lie in the more developed countries. Werlin’s policy agenda 
reflects both blindness and bias, with frightening potential 
consequences. 

For Werlin, the key distinctions are between political 
hardware and software and between political elasticity and 
inelasticity. Hardware refers to objective rules, procedures, 
technologies, organizational arrangements, and methods. 
Software refers to the quality of relationships between lead- 
ers and followers as manifest in attitudes, teamwork, and 
morale (p. 8). Good political software leads to (or consti- 
tutes) political elasticity, and for Werlin this is everything— 
the universal criterion for good government and cause of 
development. Moreover, good software can be achieved 
through specific steps: “establishing acceptable goals, hiring 
qualified personnel, encouraging training, delegating respon- 
sibility, stimulating motivation and competition, paying atten- 
tion to morale, expanding two-way flows of communication, 
promoting legitimacy, maintaining supervision, cultivating 
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contractors, protecting independent spheres of authority, and 
developing conflict-resolution procedures” (p. 64). This list is 
ted several times. 

Political elasticity refers to the extensive quality that can 
exist when power is exercised in an accommodating and 
consensual manner within a framework bounded by credible 
threats, as compared to power that is merely directive and 
coercive. “Under political elasticity, forms of decentralization 
can be expanded without reducing centralization; under 
political inelasticity, leaders fear decentralization, thereby 
weakening their implementation capacity” (p. 117). 

Werlin’s method provides no protection against bias. The 
primary form of argument is to assert a relationship and to 
substantiate it with a series of examples drawn largely from 
World Bank studies. When Werlin argues against Weber, 
behavioralists, and economists, he summarizes their positions 
in one or a few points, without engaging the fabric of their 
argument or showing how they arrive at conclusions different 
from his own. He does not explore alternative explanations, 
for example, from comparative politics, for the problems he 
addresses. His unification of public administration and com- 
parative politics is achieved by placing software at the heart 
of his analysis, thereby reducing political and administrative 
problems to the quality of leadership (pp. 271, 293-4). The 
particular form of his dualism, however, has the effect of 
placing the source of leadership in the World Bank. 

Werlin does summarize critiques of the bank’s project 
implementation. Since his analytic has no place for 
political forces acting through the bank, however, he can take 
no account of the bank’s systematic contributions to devel- 
opment failures. Also, he perhaps inadvertently reproduces 
the bank’s arguments on development approaches, which are 
contested in much of the comparative politics literature. 

Werlin explains political elasticity by comparing examples 
of well and poorly managed programs and showing that the 
latter lack the good political software exhibited by the 
former. Elasticity explains why there is better garbage collec- 
tion in Japan than in Nigeria, better rural development in 
Korea than in Ghana. All poorly managed programs have 
characteristics that violate Werlin’s list. His argument that 
politics is “more powerful than economics” (p. 16) will be 
familiar to economists as a discussion of public goods. 
Governments fail to provide law and order, basic infrastruc- 
ture, and a conducive institutional environment, so there is 
little investment (p. 153). A politically elastic government 
surely would provide these goods. 

Werlin’s two other main dualisms are primary and second- 
ary corruption and primary and secondary democracy. Pri- 
mary corruption refers to rule violations in a polity in which 
rules are generally upheld, and secondary corruption is rule 
violations for private gain without fear or shame. Primary 
corruption is widespread in developed countries, but Werlm 
believes secondary corruption, which he sees as a “political 
illness” analogous to drug addiction or alcoholism, is charac- 
teristic of leas developed countries (p. 199). Primary democ- 
racy, in line with political hardware, refers to elections and 
majority rule, whereas secondary democracy involves discus- 
sion, consensus-building, reconciliation, and leaders who 
reach out for popular approval (p. 9). Because primary 
democracy is so often not conducive to economic growth in 
poor countries, Werlin argues that donors “should not push 
them into primary democracy before they are ready for it” (p. 
234). 

From Werlin’s seat in an elastic primary democracy, things 
look pretty bleak for the desperate masses. “There is so little 
evidence of real professionalization in [less developed coun- 
tries] that advocacy of ‘deprofessionalization’ in these coun- 
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tries might be greeted with amusement, to say the least” (p. 
162). Based on nothing but anecdotes, the author concludes 
that, “as a generalization, we can say that the more impov- 
erished a country is, the more likely the press is to be 
suppressed, intimidated, or undermined in various ways” (p. 
190). We should think of helping countries with per-capita 
income of about $400 (as in India), Werlin suggests, as 
analogous to helping vagrants who may “suffer from various 
forms of mental illness or social pathology” (p. 305). In this 
light one can understand his outrageous argument that the 
World Bank should include language in project agreements 
that allows bank staff to require the removal of uncooperative 
officials (p. 330). 

To sec the significance of a comparative institutional 
approach, contrast “political elasticity” with the comparative 
politics standard of Peter Evans (Embedded Autonomy, 
1995). When Werlin presents a politically elastic financial 
system, what he describes approximates the neoclassical ideal 
of a liberal financial market (pp. 26-7). In describing South 
Korea under Park as a model of political elasticity, however, 
he inconsistently acknowledges the extreme bank-based and 
inflation-financed interventionism of Park’s government. He 
uses the World Bank’s oxymoron for such cases of interven- 
tionism: “market sustaining incentives and disincentives” (p. 
281). Evans agrees that Park provided important leadership 
but argues that a history of strong bureaucracy, dominant 
classes weakened by colonialism, and Korea’s position in 
Cold War geopolitics were no less important for the country’s 
rapid growth. Korea’s primitive capital accumulation prob- 
lem was solved by an exclusive alliance between the state and 
business leaders that was much more coercive, particularly in 
its early stages, than Werlin can acknowledge. Werlin takes 
no account of different structural conditions; political elastic- 
ity is the ideal for all governments at all times. 

The small government, free-market regime that the World 
Bank enforces on less developed countries conflicts with the 
technologically focused investment strategies Evans recom- 
mends. He would see the bank’s free-market ideology as 
working hand in hand with the interests in commodity 
imports and manufactured exports and in political influence 
over low-income countries of the governments that control 
the bank. 

In the final chapter, Werlin acknowledges that the bank 
has been driven by its shareholder governments and its 
internal organizational incentives to emphasize quick project 
preparation and rapid disbursement (pp. 321-2). He notes 
that because bank staff take the initiative in project prepara- 
tion there is often little local “ownership” of the projects. 
Under his dualisms, however, the idea cannot arise that this 
could undermine the corporate coherence a government 
needs to retain access to bank funds while dismissing much of 
its advice. He does not see that debt burdens following poor 
project implementation can serve the interests of developed 
countries. In this context, his recommendation that the 
World Bank should establish an Economic Reform Olympics, 
to intensify pressure and promote greater competition, 
should be viewed with alarm. 


The Soul of Latin America. The Cultural and Political 
Tradition. By Howard Wiarda. New Haven, CT: Yale 
University Press, 2001. 417p. $35.00. 


Brian Loveman, San Diego State University 


This book is vintage Wiarda, a synthesis of more than thirty 
years of research and writing on the Iberian tradition and the 
Catholic, absotutist, hierarchical, politicosocial system in- 
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stalled in Latin America by Spanish and Portuguese con- 
quest. Stressing these contmuities in Latin American societ- 
ies (“the Latin American soul,” p. 281), the author traces the 
origins and evolution of Latin American political culture 
from Greek and Roman times until the end of the twentieth 
century and compares it with the political traditions of 
Anglo-America. 

Like much of Wiarda’s work, this book is dedicated to 
describing the differences between the Iberian politicocul- 
tural tradition—from Roman and Moorish influences, medi- 
eval Catholicism, the reconquista (711 A.D1492), the coun- 
terreformation, authoritarianism, corporatism, and the 
mediated influences of positivism in Latin America in the 
nineteenth century—and the Anglo-American political tradi- 
tion. Emblematic of this difference is the juxtaposition of 
political philosophers John Locke, the supposed inspiration 
of U.S. democratic institutions, and Francisco Suárez, a 
Spanish political philosopher “almost unknown in the United 
States” but the foremost writer among sixteenth-cen 
Spanish neoscholastics. Chapters on liberalism, positivism, 
nationalism, Marxism, corporatism, the “conflict society” 
from the 1930s to the 1980s, and the transitions to democracy 
(1978-90) are framed as challenges to, and constrained by, 
the Ibero-Catholic tradition. In each case Wiarda considers 
how Latin American societies changed or, more commonly, 
maintained cultural continuities over some five hundred 


years. 

Wiarda also seeks to document the consequences of the 
survival of the Latin America soul in the region and its 
implications for U.S.-Latin American relations. Evaluation 
of U.S. policy is usually framed by a “realist” approach: U.S. 
policy was, and should be, driven by U.S. interests, as defined 
by policymakers like Richard Nixon, Henry Kissinger, and 
George H.W. Bush. Whatever U.S. interests are pursued or 
whatever policies adopted, Wiarda emphasizes that there is 
no reason to expect Latin America to conform to “democrat- 
ic” norms, that is, liberal pluralism, as understood in the 
United States. He properly warns students and policymakers 
that Latin America, with all its cultural variation (which 
sometimes gets lost in this book), must be understood on its 
own terms. U.S. legislatures, courts, municipal councils, the 
presidency, the federal system, church-state relations, and 
racial policies and practices are not apt benchmarks for 
equivalent or comparable institutions in Latin America. 

Wiarda concludes the book with the following advice to 
U.S. policymakers, inevitably to be ignored: “The United 
States needs to be very careful before heavy-handedly muck- 
ing around in other people’s countries. Not all democracies 
have to look exactly like the United States. For American 
policy to get too far ahead or fall too far behind the country 
being assisted is a formula for disaster for both parties.” 
What this means in practice is that the United States should 
not “push for too much democracy before the country can 
handle it” (p. 358). 

Wiarda’s interpretation of a unique Iberian politicocul- 
tural tradition is an important foundation for students of 
Latin American politics. His interpretation of Latin Ameri- 
can politics and U.S. regional policies in the post-1950s 
period is another matter. 

Regarding Latin American politics in the 1960-80 period, 
Wiarda claims, for example, that the bureaucratic authoritar- 
ian regimes of the 1960s and 1970s represented an effort to 
restore the status quo ante (p. 305). Yet, all the military 
regimes and their civilian counterparts rejected the past, 
whether the populist Peruvian and El Salvadoran military of 
the 1970s or the “liberal” Chilean, Argentine, and Brazilian 
military. The coups and repression came in the name of 
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anticommunism but also of economic development and tech- 
nological innovation, that is, “the future,” not the past. The 
post-1964 military regimes, from Brazil, Uruguay, and Chile 
northward to Guatemala, reified “modernization”; none pro- 
posed going back to a society dominated by large semifeudal 
latifundios, a society of castes and privilege based on color, 
ethnicity, guild-like corporations, family ties, or even legisla- 
tures controlled by Janded oligarchs. 

Military juntas with civilian allies sought to demobilize and 
repress populist and revolutionary movements in Latin 
America, in part as a reaction to the Cuban revolution, but 
never, as Wiarda suggests, to “turn the clock back” (p. 308). 
The military regimes were antipolitical, antiliberal, and anti- 
democratic, but none of them alluded to Francisco Suárez or 
Saint Thomas (except perhaps in the latter case, indirectly, 
with allusions to the notion of natural rights, “just war,” and 

the “right to rebellion,” that is, for military coups to overturn 
elected governments). In any case, Saint Thomas would 
hardly have applauded market determination of “just price” 
or “just wage,” or the privatizations and financial liberaliza- 
tion promoted in Chile, Mexico, and elsewhere in the 1980s. 

Even more important, Wiarda underplays or ignores the 
crucial role of the United States in sustaining, when not 
promoting, the military dictatorships of the 1960s and 1970s. 
The civil-military regimes after 1964 relied significantly on 
U.S. support for counterimsurgency, civic action, and state 
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Legal Roles and International Society. By Anthony Clark 
Arend. Oxford: Oxford University Press, 1999. 208p. 
$35.00 cloth, $17.95 paper. 


Brad R. Roth, Wayne State University 


The past decade has been marked by an increase in scholarly 
efforts to bridge the gap between the study of international 
relations as undertaken by political scientists and the study of 
international law as undertaken by legal scholars. During the 
Cold War, when international norms seemed distinctly sec- 
ondary to ad hoc cost-benefit calculations as influences on 
state behavior, political scientists largely dismissed interna- 
tional law as rank idealism. International cooperation has 
palpably broadened and deepened in the current era, and 
political scientists have started to take a second look. Inter- 
disciplinary dialogue is fraught with difficulties, however, 
since political scientists and legal scholars approach the 
subject with different questions and different methodological 
frameworks in mind, even assigning wholly different mean- 
ings to many of the same terms. 

Legal Rules and International Society should be seen as a 
significant contribution to the literature of both disciplines. 
The book is a sweeping effort both to explain the competmg 
tendencies in each discipline to the practitioners of the other 
and to demonstrate how a workable interdisciplinary enter- 
prise emerges from the interaction. 

Arend argues that legal rules play a. distinctive role in 
international relations. That role is best appreciated by 
adopting his own nuanced form of legal positivism as an 
approach to apprehending those rules (pp. 86-104) and by 
using constructivism as an approach to understanding their 
effect on the behavior of international actors (pp. 124-48). 
Arend takes as examples the UN Convention on the Law 
of the Sea (UNCLOS) and the European Union treaties 
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terrorism on an international level. Not even in the index 
does Wiarda refer to “Pentagon,” “counteri , “co 
vert operations,” or “Operation Condor.” Acknowledging 
what he calls the often sorry history of U.S. interventionism 
in the Caribbean and Central America in the early twentieth 
century, he comments: “Such interventions are simply what a 
great power, a paramount, a hegemon does to protect and 
advance its interests” (pp. 180-1). Maybe so, but the question 
Wiarda does not pose or seek to answer is to what extent U.S, 
meddling, coercion, exported racism, and military occupation 
from the 1890s until the end of the twentieth century warped, 
if not transformed, the Latin American soul. 

Like the U.S. policymakers whom he seeks on occasion to 
advise, Wiarda generally overlooks the effects of pernicious 
and hypocritical U.S. regional security and economic policies 
from 1945 to the present. After accepting Wiarda’s claim that 
the soul of Latin America is not a natural fount of liberal 
pluralism, the next step is to explore the sequelae of its 
violation over and over again by policymakers in Washington, 
DC, in ways that have determined, even overdetermined, 
political outcomes in the region for the last century. In the 
end, Wiarda’s insight on the historical origins of Latin 
American political culture is blurred by his interpretation of 
post-World War II politics and the effect of U.S. foreign 
policy in the region. He fails to reveal to his readers the affect 
of the Devil’s work on Latin America’s soul 


(pp. 142-7), and he contends that international legal rules 
can affect not only states’ treatment of nonessential matters, 
as rationalist institutionalists and even structural realists 
can concede, but also states’ interests and even their 
very identity. Invoking Friedrich Kratochwil’s observation 
that the rules of chess define the game itself (p. 130), Arend 
makes the case that international law can, over time, verifi- 
ably influence the ways in which state actors conceptualize 
their aims and the options available to them in pursuit of 
those aims (pp. 129-32, 137-48). (That statehood itself is a 
juridical construct, with international mechanisms according 
or withholding recognition of that status, goes some distance 
in substantiating this point.) Arend’s purpose is largely to 
invite, and to orient, empirical study of these constitutive 
effects. 

The first half of the book examines the nature of interna- 
tional law or, more specifically, international legal rules; 
chapters explore their character, their sources, and compet- 
ing methods of ascertaining their existence. Arend demon- 
strates some of the ways in which legal rules differ from other 
rules operating in the international arena, and he offers a 
standard examination of what jurists traditionally cite as the 
sources of such rules. He then presents and contrasts the 

law-finding methods offered by legal positivism and its lead- 
ing competitor, the “New Haven” approach of Myres S. 
McDougal, Harold D. Lasswell, and W. Michael Reisman. 
Arend does an especially effective job of summarizing the 
complex New Haven approach, which combines an empirical 
inquiry into whether ordering processes manifest requisite 
levels of “authority” and “control” with a normative inquiry 
into whether those processes are adequately orlented toward 
the furtherance of “human dignity” (pp. 76~86). Arend 
dismisses this last, teleological aspect as unhelpfully subjec- 
tive, observing that “there is, at present, no one conception of 
human dignity that the decision-making elites have affirmed” 
(p. 86). 
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Arend purports to present “a new method for determining 
rules of international law [that draws] on the insights of both 
positivism and the New Haven School while attempting to 
avoid the pitfalls of both” (p. 86). He extracts part of the New 
Haven analytic framework for his own purposes, but discards 
(1) its inclusion of nonstate actors as independent sources of 
legal authority, (2) its focus on processes rather than rules, 
and (3) its teleological “dignity” component. The result is 
essentially a positivist method (albeit differing in minor 
respects from the approaches of certain positivist authors) 
expressed in New Haven vocabulary. Thus, “authority” is 
reduced to the opmio juris of states (ie., the subjective sense 
of legal obligation that state actors manifest by their acts, 
omissions, and statements in particular contexts), and “con- 
trol” is reduced to the consistency of conforming practice 
(pp. 86-104). The relationship of the two is helpfully illus- 
trated in an assessment index, although this serves more to 
surface than to resolve the traditional ambiguities about how 
much evidence of either element is requisite to finding a legal 
norm. 

Arend next addresses the effects of international legal rules 
on international behavior, and he argues for the superiority 
of a constructivist approach over the alternatives of structural 
realism and rationalist institutionalism (pp. 111-48). He goes 
on to project a “neo-medievalist” future for the international 
system, with Westphalian sovereignty dissolving into a more 
complex scheme of overlapping authority and multiple loy- 
alties (pp. 171-85). He concludes that the study of inter- 
national legal rules establishes the existence of an “interna- 
tional society” that can be comprehended only through 
interdisciplinary cooperation, all the more so as that society 
undergoes change (p. 190). 

In Jaying out the grand scheme, Arend stakes out positions 
on a series of controversies within both international law and 
international relations. Although the positions he takes are 
consistently plausible, his arguments frequently fall short of a 
convincing vindication of the advocated stance. The book 
seeks to cover so much ground in so little space that it cannot 
do justice to the full range of competing considerations that 
attend each issue. 

For example, Arend summarily adopts the controversial 
position that law is essentially a set of rules rather than a 
process (p. 26), thereby loading the discussion in the direc- 
tion of legal positivism from the start. By focusing almost 
exclusively on the recognition of rules, he sidesteps the 
difficulties in interpretation and application that tend to 
problematize the positivist characterization of law. That law 
can be understood only as a purposive process is a theme of 
some of the most significant works of contemporary jurispru- 
dence (e.g, Lon L. Fuller, “Positivism and Fidelity to 
Law—A Reply to Professor Hart,” Harvard Law Review :71 
[February 1958]: 630-72; Ronald A. Dworkin, Law’s Empire, 
1986). Although Arend engages in some detail (pp. 76-86) 
with a methodological tendency—the New Haven School— 
that regards law as process, he never fully acknowledges the 
grounds for that view. Given the book’s ambition to do justice 
to the competing tendencies, this is a significant omission, 
even if the “law as rules” characterization is considered to be 
convincing on the merits. 

There is inevitably much to quibble about in Arend’s 
summary treatment of the various methodological frame- 
works, but his well-organized, well-written, and concise text 
provides an excellent starting point for thinking through the 
interconnections between international law and international 
relations scholarship. In this regard, the book is of value not 
only to graduate students and upper-level undergraduates 
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but also to scholars of either discipline, especially those who 
are relatively unfamiliar with the premises of, and with recent 
work within, the other discipline. 


Semi-Detached Idealists: The British Peace Movement and 
International 1854-1945. By Martin Ceadel. 
New York: Oxford University Press, 2001. 477p. $85.00. 


Andrew Rigby, Coventry University 
In 1980 Martin Ceadel published his definitive study, Paci- 
fism in Britain, 1914-1945. Since then he has established 
himself as the leading historian of British peace movements. 
His prime focus in a series of scholarly publications is the 
analysis of the different stances toward peace and war 
adopted by the various organizations that could be consid- 
ered part of the broad British peace movement in the first 
half of the twentieth century. In recent years he had delved 
farther back in time to produce a comprehensive two-volume 
survey of activism against war from the French Revolution 
until the dawn of the nuclear age. 

In the first volume, The Origins of War Prevention: The 
British Peace Movement and International Relations, 1730- 
1854, Ceadel traces the process whereby—umder the com- 
bined influences of the Enlightenment, evangelical Christian- 
ity, and changes in the international system (especially after 
the defeat of Napoleon in 1815)—there developed a belief 
that human agency can limit the incidence of war. In the 
aftermath of the French wars of 1793-1815, the first peace 
association was established in Britain, and as domestic poli- 
tics gradually liberalized, the appeal of the Society for the 
Promotion of Permanent and Universal Peace, normally 
known as the Peace Society, broadened. 

In the current volume Ceadel traces developments from 
the Crimean War to the end of World War I. He examines 
the manner in which the burgeoning belief that international 
relations, at least among “civilized nations,” can be con- 
ducted without war received something of a setback in 1854, 
when Britain declared war against Russia, and when the 
subsequent peace was disrupted by the American Civil War 
of 1861-65. 

The extension of the franchise to working men in 1867 
enabled the peace constituency to widen its appeal, which led 
to formation of the Workmen’s Peace Association. The 
Franco-Prussian War of 1870-71 and the subsequent armed 
peace between the two continental European powers stimu- 
lated interest in arbitration and the codification of interna- 
tional law. 

During the last two decades of the nineteenth century, 
however, the significance of the Peace Society and associated 
organizations weakened as Britain embarked on its imperi- 
alistic military forays into Egypt and the Sudan. In the first 
decade of the twentieth century there was renewed interest in 
the “peace question.” The First Hague Peace Conference 
was convened, there was opposition to the Boer Wars in 
South Africa, and the growing socialist movement in Britain 
(and elsewhere) sought to bring an end to war by means of 
international working class solidarity. 

The failure of the established peace societies to rise to the 
challenge of World War I left the field open to an array of 
new organizations, such as the Union of Democratic Control, 
the No-Conscription Fellowship, the League of Nations 
Society, and the Women’s International League. These orga- 
nizations channeled the widespread disillusionment with the 
outcome of the war to end all war that reached its peak of 
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intensity during the early 1930s, when there was widespread 
commitment to disarmament and the strengthening of the 
League of Nations. 

By 1936, however, with the rearmament of Germany, the 
Italian invasion of Abyssinia, and the outbreak of the Spanish 
Civil War, the British peace movement and public opinion 
began to polarize. The Peace Pledge Union rapidly expanded 
into Britain’s only mass-based pacifist movement, but others 
began to acknowledge the need to prepare for the impending 
military struggle against fascism and the “containment” of 
Germany. 

The experience of the war and the subsequent disclosure of 
the horrors of the concentration camps had a powerful 
influence on the British peace movement, particularly its 
pacifist wing. Was there a nonviolent answer to totalitarian- 
ism? How significant was conscientious objection against 
atomic weapons? By the 1950s the movement began to 
reassert itself with the emergence of the Campaign for 
Nuclear Disarmament, which is part of the worldwide move- 
ment to avert nuclear destruction that continues to this day. 

The focus throughout the book is upon peace organiza- 
tions as “ideological protagonists,” concerned with the moral 
and practical problems posed by war and committed to 
propounding solutions to them. Ceadel includes within the 
general term “peace movement” all those who not only 
oppose aggressive war but also are inspired by the belief that 
it is possible to achieve a world without war. Within this 
category, however, he makes the linguistically painful distinc- 
tion between pacifism (the conviction that war is always 
wrong and should never be resorted to) and pacsficism (the 
belief that war can be abolished, but until the necessary 
circumstances have been achieved it may be necessary on 
occasion to resort to defensive military force). Ceadel has 
written for many years about what is essentially the difference 
between the absolute pacifists and others within the peace 
movement, but these labels are confusing. Indeed, it is clear 
that Ceadel understands the difficulties by italicizing the 
terms throughout the text. 

Another problem is that the focus is upon the different 
analyses of the nature of war and the international system 
propounded by different historical actors over a period of 
nine decades, but there is very little discussion on the social 
or political significance and scale of the different organiza- 
tions and associations. Indeed, an argument can be made that 
to apply the label “peace movement” to the British Peace 
Society in the nineteenth century is something of a misnomer. 
Yet, Ceadel reveals the quality of his insight into the dynam- 
ics of peace movements by likening them to a pilot light when 
conditions are unfavorable, “keeping the flame of peace 
thinking alive yet requiring thé fuel of combustible public 
sentiment in order to ignite a mass campaign” (p. 4). When 
conditions are favorable, the peace movement can be likened 
to “a glider, drawing attention to the upcurrents of peace 
sentiment which sustain it yet unable either to power its own 
take-off or indefinitely to extend its time aloft” (p. 4). 

Ceadel covers all the threads of peace thinking in Britain 
that had some kind of organizational manifestation over a 
period of ninety years, and he uses primary sources through- 
out. This is an awesome achievement. At times the text has a 
density that challenges the reader, such is the thoroughness 
of the coverage. But for anyone who aspires to teach or 
research on any aspect of British peace movement history in 
the nineteenth and first half of the twentieth century, this 
book should be their first source. 
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Constructing Europe’s Identity: The External Dimension. 
Edited by Lars-Erik Cederman. London: Lynne Rienner, 
2001. 271p. $55.00. 


Gerard Delanty, University of Liverpool 
In the last few years the question of the identity of Europe 
has moved from internal to external issues. Until about 1997, 
the issue was largely discussed in terms of whether a postna- 
tional identity on the European level might be able to 
compete with national identities. Many positions were taken, 
depending on theoretical and political assumptions that 
relate to identity formation and the future of the nation-state. 
Most of these debates presupposed the more or less estab- 
lished geopolitical space of mainly the western European 
Union (EU). This volume addresses a relatively new situa- 
tion, in which a different logic of identity formation is 
becoming apparent. 

Since about 1997, when the EU opened negotiations wi 
the former communist countries, a project of geopolitical 
enlargement has been initiated. This volume looks at some of 
the implications of the mostly eastern expansion of a polity 
originally designed for a much more limited membership and 
more restricted scale of activities. The ten chapters provide a 
valuable study of an underresearched topic. The book adopts 
a constructivist approach to identity and examines the trade- 
offs in the making of a new postnational European identity, 
which is conceived as fluid and open to new codifications. 
This approach, which regards boundaries as central to iden- 
tity formation, places the external boundary at the center of 
the stage. In this it is a welcome from the more 
common internalist arguments, which see collective identity 
as something self-referenced. 

According to the constructivist approach, EU identity is 
becoming more and more defined by reference to the non- 

boundary, as opposed to the EU’s constitutive 
units, the national states. By focusing on the external dimen- 
sion, these essays aim to bring a new perspective to bear on 
the theory of European collective identity. One advantage of 
this approach is that it does not rule out the chances for 
European identity to take a cultural form, as opposed to a 
purely minimal political identity. The construction of a 
European identity is not a mere act of negative identity, as 
has often been argued, but a process that involves clear 
trade-offs. Of particular interest in this regard is the complex 
relationship of, on the one side, inclusion and exclusion and, 
on the other, thick and thin identities. 

The more Europe expands eastward, the more it will dilute 
its identity. Yet, the more protectionist Europe remains, the 
more it will have to exclude. This dilemma, the editor argues 
in his theoretical introduction, is the case with all forms of 
identity: The thinner they become, the more chances there 
are for greater inclusion, since thick identities tend to be 
more exclusive. A general conclusion that might be drawn 
from the various essays is that Europeanization is relatively 
flexible, and the external dimension does not necessarily 
result in straightforward closure. The contributors argue, to 
varying degrees, that the external dimension is compatible 
with a thin, postnationalist collective identity based on lib- 
eral, democratic values. There are some exceptions, largely in 
the area of immigration, but the general argument appears to 
be a recognition of new logics of openness in the construction 
of European identity. There is a clear move beyond the usual 
declarations of Fortress Europe as the inevitable product of 
European integration. This volume suggests a more subtle 
reading of a project that is more open-ended than is often 
thought. 

Craig Calhoun, in a wide-ranging discussion of identity 
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formation, does not see a fundamental problem with the 
external dimension in the making of European identity, since 
the essence of Europe is “inconsistency.” Internally hetero- 
geneous, Europe’s strength would be to create institutional 
structures to encourage diversity, he argues. This is followed 
by a chapter on Switzerland, which presents a perplexing 
example of, on the one side, pluralism and, on the other, 
relative closure. The three authors, Pascal Sciarini, Simon 
Hug, and Cédric Dupont, demonstrate how exclusionary 
practices go together with inclusive institution building. As 
much an exception as an example, it shows the relative costs 
of a high degree of inclusion. 

Philip Schlesinger, in a chapter on the cultural policies of 
the EU, documents the decline of anti-Americanism but 
notes that political communication is primarily national. This 
is followed by a chapter on audiovisual culture by Tobias 
Thelier, who shows that the EU attempt to construct a 
European cultural dimension in opposition to American 
global culture has been unsuccessful. These chapters, which 
deal mostly with the relation of Europe to the United States, 
show that the construction of a European cultural identity 
will at the most result in a thin identity. 

When the external dimension is Russia as opposed to 
America, the result is likely to be a stronger affirmation of 
identity, as Iver Neumann argues in a chapter on eastern 
expansion, The chapter by Frank Schimmelfennig takes a 
slightly different view of eastern expansion, which he sees as 
legs influenced by considerations of exclusion. The Union’s 
thin postnational identity will allow it to expand to include 
states that share its liberal and democratic values. 

Two chapters, one on immigration by Jeff Huysmans and 
one on asylum policies by Vera Gowland-Debbas, address 
issues of exclusion. It must be borne in mind, as Huysmans 
argues, that policies in these areas are mostly shaped by 
national concerns and do not reflect a European vision of a 
cultural threat. The final chapter, by Cederman, sums up 
some of the arguments around the theme of exclusion versus 
dilution. His question is whether this is a real or imagined 
trade-off. 

The general conclusion is that the external factor does not 
result in an inherent tendency toward closure. Overall, the 
volume is a fairly coherent and well-sustained examination of 
how European integration is shaped by the external factor, 
which must be seen as a dimension that can be constructed in 
a wide variety of ways. The theoretical focus on trade-offs 
gives the book a certain uniqueness, although it tends to 
reduce the scope to a conception of identity as based on 
transaction costs. 


Sharing Security: The Political Economy of 
By Malcolm Chalmers. New York: St. Martin’s, 2001. 288p. 
$69.95. 


Danny Unger, Northern Illinois University 


Malcolm Chalmers examines how costs are apportioned 
among states in pursuit of common goals. He focuses on two 
sets of states, the victors of World War II (France, the United 
Kingdom, and the United States) and civilian (defeated) 
powers (Germany, Italy, and Japan). As this categorization 
suggests, Chalmers believes that the end of the Cold War did 
not fundamentally revise the global order created after 1945. 
Domestic norms, the politics of pork that support military 
spending by the victor powers, and multilateral institutions all 
helped to freeze in place postwar arrangements. Dispropor- 
tionate U.S. military spending represents not merely free 
riding among its allies but the fundamental postwar bargain 
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in which the defeated powers limited their military spending 
in order to reassure their allies, China and the Soviet Union, 
as well as their respective publics. 

To explain the distribution of effort across states, Chalmers 
considers the relative utility of public goods theory, liberal 
institutionalism, realism, and strategic culture theory. He 
uses this analytic framework to examine a number of cases: 
the North Atlantic Treaty Organization (NATO), the Euro- 
pean Union (EU), development assistance within the context 
of the Organization for Economic Co-operation and Devel- 
opment’s (OECD) Development Assistance Committee, the 
United Nations, and the Climate Change Convention. Public 
goods theory helps to account for asymmetrical U.S. military 
spending among the NATO countries but not U.S. free riding 
within other regimes. Liberal institutionalism usefully directs 
attention to the costs of renegotiating existing formulas that 
determine the distribution of burden among states. Realism 
helps us understand many of the policies of the three victor 
powers. Chalmers, however, leans most heavily on strategic 
culture theory (the effect of history and domestic politics) to 
explain the policies of both groups of states. 

Chalmers acknowledges the problems in assuming that 
defense spending by NATO allies represented public goods. 
He notes, for example, that the victor powers’ higher levels of 
spending resulted m substantial part from spending for 
military deployments outside Europe that constituted private 
and, at times, competitive goods. The civilian powers es- 
chewed military deployments beyond their borders or nuclear 
strategies, the two elements of military expenditures that 
alone account for most of the gap in levels of military 
spending between the victor and civilian powers. : 

When he turns to other issue areas, however, Chalmers 18 
legs consistently attuned to the question of whether goal 
divergences among the powers make problematic the notion 
that public goods assumptions apply. Are increases in Official 
Development Assistance (ODA) a public good? Certainly, 
officials avowed commitments to ODA increases. Other 
officials and many political leaders, however, especially in the 
United States, argued that public assistance undermined 
development goals. Chalmers maintains that ODA increases 
“should” play central roles in burden-sharing debates. Per- 
haps they should, but do they? 

As Chalmers notes, ODA spending has been linked to a 
variety of private goods, evident in the Italan corruption 
cases of the early 1990s, as well as to rule under social 
democratic parties. Denmark, the Netherlands, Norway, and 
Sweden, for example, account for only 4% of the OECD 
population but well over one-fifth of its economic assistance 
to poorer countries. An effective political consensus in these 
countries probably views this aid as a global public good, it is 
unclear that this is the case everywhere. Similar questions 
arise in considering proposed cooperation in reducing green- 
house gas emissions under the Kyoto Protocol of the global 
climate treaty. Chalmers makes no headway in grappling with 
these problems, which typically plague the public goods 
literature. 

The book focuses on the argument that the institutional 
form of leadership matters. Chalmers notes that U.S. offi- 
cials, at best, are ambrvalent about multilateralism; they hope 
to constrain others while maintaining their own freedom of 
maneuver. As a result, regimes that include the United States 
are apt to remain weak. In sharp contrast, German leaders 
have been fully committed to the European Union. 

The German case is indeed remarkable and powerfully 
supports Chalmers’s arguments. From the Com- 
munity’s inception, Germany made offers that French leaders 
could not refuse on the Common Agriculture Policy in return 
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for French acquiescence to elements of supranationality in 
European institutions. Animated by political concerns, Ger- 
many’s net contribution to the EU im the late 1990s 
amounted to about one-third of its defense budget. With a 
GDP only half that of France, Italy, and the United Kingdom 
combined, Germany made a net contribution three times as 
large. If this spending is included in calculating “aid” expen- 
ditures, Germany’s total outstripped that of Japan and the 
United States combined. Still more remarkably, these contri- 
butions to the EU flowed through multilateral channels that 
diminished the political leverage afforded to German leaders 
by this largesse. Largely due to German leadership, Chalmers 
concludes, and despite the many problems facing the Euro- 
pean Union in the process of enlargement, Europe “appears 
doomed to reach agreement” (p. 110). 

The author’s central concern is that the development of 
international regimes is constrained by U.S. strategic culture, 
and this limits their potential roles in securing global peace. 
Burden-sharing disputes triggered by the United States over 
small sums of money stymie the development of the United 
Nations, whereas German acceptance of dramatic spending 
asymmetries has made possible impressive institutional de- 
velopment in Europe. If realism can help us understand the 
policies of the victor powers, it cannot account easily for 
those of the civilian powers, particularly Germany. To under- 
stand German postwar policies we have to draw on liberal 
institutionalist and strategic culture explanations. 

Globalization and the rise of the EU as a counterweight 
may eventually shift U.S. elite attitudes that see multilater- 
alism as a threat. Chalmers likens these U.S. views to those 
evident in the United Kingdom in the 1950s. He offers 
some hope that the diversity of public goods provision 
among the OECD big six may enhance global levels and 
the quality of public goods provision. German support for 
the EU, for example, helps to secure the European peace, 
just as U.S. aircraft carriers contribute to stability in the 
Pacific. He concludes, however, that the United States 
often blocks efforts to address key contemporary security 
problems (poverty reduction in developing countries, curbs 
on environmental degradation.) As a result, the EU and 
Japan may need to assert leadership in these areas if the long 
peace of the second half of the twentieth century is to be 
sustained. 


Threats and Promises: The Pursuit of International Infiu- 
ence. By James W. Davis, Jr. Baltimore, MD: Johns 
Hopkins University Press, 2000. 222p. $32.50. 

Joseph Lepgold, Georgetown University 

How should threats, promises, and assurances be used as 

tools of foreign policy? In this tightly written and well- 

reasoned book, James Davis contends that international 
relations scholars, who overgeneralize from the failure of 
appeasement in the 1930s, focus too much on the efficacy and 
dilemmas of sticks and not enough on the well-placed use of 
carrots. His objective—an ambitious one, judging from the 
absence of comparable works in the literature—is an argu- 

ment that integrates positive and negative sanctions into a 

single theory of interstate influence. 

Davis argues that deterrence theory, in which aggression is 
seen as driven by a defender’s weakness, oversimplifies 
actors’ motives. Davis contends that when an actor's motives 
are opportunistic, threats of punishment for aggression make 
sense, but when the motivation is a sense of domestic or 
international weakness, the actor is likelier to fight than 
retreat when pushed into a corner. This reasoning reflects the 


finding of prospect theorists that people are more likely to 
take risks to prevent losses than to make gains. By this logic, 
once one knows how a decision maker is framing his refer- 
ence point with respect to some issue, his motives—either to 
forestall perceived losses or make perceived gains—can be 
discerned. By this reasoning, if the actor fears loss on a key 
issue, promises are likelier than threats to induce compliance. 
Moreover, according to Davis, even if one actor is trying to 
improve the status quo at another’s expense, deterrent 
threats are more likely to succeed when they are accompa- 
nied by credible assurances that the punishment will be 
withheld if the actor behaves cooperatively. 

This argument leads Davis to several hypotheses (pp. 36, 
37). (1) “Threats that increase the prospective costs associ- 
ated with a given course of action may be highly effective at 
deterring a decision maker operating in the domain of gain 
but ceteris paribus will be less effective when decision makers 
are in the domain of loss.” (2) “The most effective promises 
should be those that offer the reduction or elimination of the 
target’s losses rather than those that augment another at- 
tribute, because the corresponding increase in utility takes 
place on a steeper limb of the value function.” (3) “To the 
degree to which they reduce uncertainty and increase the 
probability that the reference outcome will obtain, assurances 
can reduce fears and anxieties that often have led to assertive 
or aggressive foreign policies.” 

These hypotheses are tested across a series of cases from 
German diplomatic history in the nineteenth and early 
twentieth century: the absorption of Schleswig and Holstein; 
the Franco-Prussian War and the unification of Germany, 
German confrontations with France over colonial rights in 
Morocco; and the onset of World War L These cases 
comprise both opportunistic and defensive motives for Ger- 
man behavior that challenged the status quo and the full 
range of methods—threats, promises, and assurances—that 
the other European powers used to shape German behavior. 
As a group, they thus exhibit variation on the independent 
variables (influence technique and the target’s motive for 
expansion) and the dependent variable (the target’s choice of 
whether to back down or use force in reaction to that 
influence attempt). 

The cases show that a target’s motives significantly affect 
the efficacy of threats, promises, or assurances in producing 
compliance with others’ demands. When German leaders 
tried to expand because they thought others would not resist 
them, as in several of the Schleswig-Holstein crises, threats of 
Russian and French intervention caused them to back down. 
By contrast, when Bismarck feared in 1875 that France might 
recover quickly from the recent war and seek to regain its lost 
provinces of Alsace and Lorraine, Russian and French 
leaders reassured him that France would not try to revise the 
status quo. Once Bismarck believed that his recent territorial 
gains were safe, he allowed the crisis he had precipitated to 
dissipate. In all three cases involving Germany’s colonial 
holdings in Africa, policymakers in Berlin saw their position 
deteriorating relative to the other great powers. As Davis 
shows, they responded in kind to tough talk and actions from 
Paris and London, but they behaved more cooperatively 
when promised compensation elsewhere in Africa and its 
environs. 


In testing the hypotheses, Davis skillfully operationalizes 
the key variable in prospect theory, which states that actors 
evaluate prospective outcomes with respect to a “reference 
point,” typically their conception of the legitimate status quo. 
Outcomes that improve on that point are considered gains; 
those inferior to it are seen as losses. For each case, Davis 
carefully examines how actors came to define the status quo. 
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His interpretations as to which instances of expansion were 
driven by perceived opportunities to improve on the existing 
situation and which were viewed as defensive reactions to 
prospective loss are usually convincing. In reconstructing 
these decision-making assessments, Davis mines the second- 
ary literature and examines an impressive range of primary 
sources. His mterpretations of the cases are at times 
bold—he does not hesitate to challenge the prevailing schol- 
arly consensus about actors’ motives—yet plausible, if not 
entirely persuasive. 

Davis is willing to go out on a limb in discussing the policy 
implications of his argument: “If political leaders frame 
choices around a reference point and respond differently to 
prospective gains and losses vis-à-vis the reference point, 
then prospect theory provides answers to fundamental ques- 
tions of statecraft” (p. 36). This claim is problematic. Scholars 
can work to reconstruct actors’ decisional frames at whatever 
pace is deemed necessary, but policymakers in emerging 
crises have no such luxury. They need valid diagnoses of their 
counterparts’ motives in the short run. Without good evi- 
dence on other actors’ decision-making processes, such diag- 
noses are often difficult, if not impossible. As students of 
international relations are well aware, offensive and defensive 
motivations often produce similar observable behavior. Fur- 
thermore, aa Davis admits, we lack a theory of how actors 
come to decisional frames. “Most political decision makers 
can imagine multiple reference points for any set of out- 
comes, and the proper referent is often the subject of intense 
political debate” (p. 41). If policymakers do not know their 
counterparts’ reference points, they have no way to deter- 
mine whether aggressive behavior reflects greed or fear. 
Lacking that knowledge, they have little basis for deciding 
whether to emphasize sticks or carrots as policy tools. 

To his credit, Davis acknowledges and tries to deal with 
these problems. If statesmen can simultaneously have offen- 
sive and defensive goals, and if it can be difficult even with 
good documentary evidence to discern actors’ motives, poli- 
cymakers in most cases should use an array of influence 
techniques. Deterrence can be used to identify clear bound- 
aries for the other’s behavior, promises and assurances can 
be used to make the status quo more attractive and stable. 
When the actor is trying to uphold a status quo that is 
reasonably secure, or enjoys a healthy advantage in power 
over a challenger, she might take the risk of erring on the side 
of offering inducements. Conversely, if the challenger has 
equal power or a clear military advantage, it might make 
sense to risk on the side of threats, even if that choice turns 
out to produce an unwanted conflict spiral. 

No one has a clear mirror on nature: Policymakers’ 
assessments of the motives of others may reflect overly sticky 
beliefs, and the evidence itself often may be ambiguous even 
to an unbiased observer. For these reasons, it can be hard to 
know how to combine threats and promises into an effective 
strategy. Even so, Davis has written a perceptive and thought- 
provoking book He emphasizes, against the thrust of much 
contemporary literature, that “even when statesmen confront 
an opportunist, succeasful deterrence requires the ability to 
issue both threats and assurances” (p. 154). This is surely 
correct: Force alone rarely suffices to stabilize the status quo 
over the long run. Durable deterrence requires at least some 
agreement on a mutually legitimate status quo. In developing 
an argument that drives home this point, Davis illuminates 
issues central to bargaining, strategic choice, and social as 
well as individual cognition. 
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Fascist and Liberal Visions of War: Fuller, Liddell Hart, 
Doughet, and Other Modernists. By Azar Gat. Orford: 
Clarenden Press/Oxford University Press, 1998. 352p. 
$85.00. 

Frederic S. Pearson, Wayne State University 

It is difficult to categorize this intriguing and illuminating 

volume, since it is not entirely an intellectual history of 

modernity and warfare, although it has elements thereof. Nor 
is it strictly a modern history of Western strategic thought on 
war, although it is so in large part. As the last in the author's 
trilogy of studies of strategic thought, it appears to be an 
amalgam of twentieth-century perspectives following from 
the other two volumes, aimed at an ultimate consideration of 
the direction of liberal versus authoritarian thought on war. 

The book carries on nicely from the previous work, subtly 

raising crucial policy questions for our time, but does not 

entirely cohere as a single study. 

Azar Gat, of the Department of Political Science at the 
University of Tel Aviv, reveals important patterns and inter- 
relations in the thought of European military, political, and 
journalistic figures on the Right and moderate Left, from the 
early to middle twentieth century. The focus wanders some- 
what from initial concern with the relation of philosophies 
such as fascism, or protofascism, to the institution of modern 
war, on the one hand, to alternate conceptions of strategies 
and tactics for fighting, winning, or avoiding wars, on the 
other. There is a great deal here for students of intellectual 
and political thought as well as strategic thought and, espe- 
cially m the later chapters, for students of international 
relations. The book will probably not satisfy any of these 
constituencies fully, however, since Gat dwells on the work of 
particular influential people, such as Liddell Hart. Thus, the 
range and depth of analysis vary. 

The thread connecting fascist and liberal military thought 
is the notion of modernity and modernist theory. This is a 
fascinating concept and is already well known in the study of 
armament and war, as seen in the works of Keith Krause 
(Arms and the State, 1992), Russell Warren Howe (Weapons, 
1980), Freeman Dyson (Weapons and Hope, 1984), and 
others. The modernist connection is made most tellingly in 
the first half of the book, which deals mainly with protofascist 
thought in Britain, Italy, and Germany. The individuals 
whose ideas are revicewed—J.F.C. Fuller, Giulio Douhet, 
Ernst Junger, and others—were by and large fascinated with 
the implications of emerging machine technology and orga- 
nizational: styles for society and war. The author skillfully 
paints a portrait of a restless fin du siècle generation looking 
for meaning and disillusioned with mass democracy, aristo- 
cratic hierarchy, and traditional folkways. 

Despite variations among some strange bedfellows (includ- 
ing the likes of H.G. Wells and other futurist literary figures) 
and obvious exceptions in the volkische sentimentality and 
sometime mysticism of National Socialist and fascist ideo- 
logues, most of the individuals sampled here predicted a new 
form of Darwinian social order. It would be based on expert 
regimes and mechanized warfare, with particular excitement 
about tanks and aviation, and would lead to transformed and 
supposedly heightened civilizations. The author shows there 
was keen technological fascination in other societies as well, 
notably America and Marxist Russia, and this fascination had 
its effect on military theory m these states, but he maintains 
that the “pioneering” linkage to war fighting sprang most 
readily from the fascist thought. 

In view of Gat’s introduction, one might have expected a 
similar consideration of the links between the philosophical 
tenets of liberalism and modernist military thought in the 
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second half of the book Instead, the focus shifts to the 
development of British strategy to contain Germany before 
World War II, particularly the evolution of Basil Liddell 
Hart’s thinking from roughly 1920 onward. Disillusionment 
regarding total war is tested against the unmistakable need to 
take some effective action to thwart the advance of Germany 
and the fascist bloc. Liddell Hart’s views are placed in the 
context of difficult choices faced and fumbled by the British 
government. The link to liberalism is not made as explicitly as 
in the fascist examples, and there is little discussion of basic 
liberal tenets regarding economic and political competition/ 
cooperation or modernization. Yet, in the end the author 
shows that as total war became anathema to interwar liberal 
thinkers such as Liddell Hart, they reverted instead to limited 
war strategies (“limited liability”) and containment (embar- 
goes, sanctions, collective security). This explains much of the 
delayed response to Hitler by Britain and France. 

The volume is at once highly insightful and uneven. One 
wonders, for example, why no significant sections were 
devoted to French fascist-militarist thought and the question 
of modernity. French writers and military and political lead- 
ers are mentioned en passant, it is mystifying that the 
considerable radical Right tradition (¢.g., Falangist) and the 
legacy of Dreyfus are not given more systematic treatment. 
Did France have no techno-military fascination (unlikely; 
given the French aviation sector)? Is French protofascism 
assumed to be similar to the Italian variety? Or is the author 
simply more at ease dealing with the Italian, German, and 
Bnitish cases? 

Equally baffling and revealing of a potential logical fallacy 
is the rather short shrift given to the U.S. case. Gat tries to 
contrast his continental and British findings with other places 
noted for technological modernism (unfortunately, not in- 
cluding Japan). American Taylorism and Fordism indeed 
were adopted as models by the protofascist modernizers. 
What starts out to be a comparison and contrast, however, 
ends up with very little contrast; there is little explanation of 
why “most technological modernists in the United States who 
took a critical stand towards capitalism ... remained within 
the democratic camp” (p. 109). It may be that most proto- 
fascists were techno-modernists, but one cannot conclude, 
and Gat would not argue, that most techno-modernists were 
necessarily fascistic. Yet, Gat notes that many of them, such 
as Lewis Mumford and Frank Lloyd Wright, evidently be- 
cause they “inclined toward communitarianism, environmen- 
talism, and social planning,” had considerable appeal among 
protofascists. 

Instead of making the distinctions and explanations clear 
in counterexamples such as the American context, Gat dwells 
on the protofascism and anti-Semitism of Henry Ford, 
Thomas Edison, and Charles Lindbergh, which he traces, 
along with their technological tinkering, to the Midwestern 

ialism from which they all sprang. Here, in contrast to 
his thorough treatment of European trends, the author 
to be writing as if from a distance, mixing varied 
movements and styles (“agrarian nostalgia, nativist Populism, 
ivism, and celebration of technology,” p. 107). The 
reader is left with a less intimate understanding and careful 
discussion of the intellectual undercurrents of the time, much 
of which were driven as well by eastern thought, replete with 
its own brands of techno-capitalism and anti-Semitism. 

This book and subject matter perforce require the balanc- 
ing of divergent and contradictory outlooks even for the same 
individuals. Fascists can be at once forward and backward 
looking. Liberal democrats can be torn among ifisti 
revulsion, pragmatic skepticism, and reluctant acceptance of 
or even enthusiasm for war, but wars of varying sorts and 
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styles. Ironically, in discussing British liberalism’s interwar 
gun-shyness, Gat shows that Hitler's generals and others in 
the authoritarian camp likewise were reluctant to launch into 
total war (pp. 241-2). He also shows (p. 255) that liberals 
such as Liddell Hart were, given strategic needs, supportive 
of advanced weapons and new doctrines. 

Despite inconsistencies in the overall thesis, and with the 
exception of the American context, Gat does a remarkable 
job of making this complex and very human side of political 
thought palpable. At a time when democratic peace theory is 
so strongly proposed and so hotly debated, his work haunt- 
ingly suggests crucial and still fundamentally unanswered 
questions. To what extent can liberal democracies be relied 
upon to undertake global or regional police or alliance 
functions, using what sorts of strategies in what specific 
contexts? What are the effects of political, industrial, and 
military culture, as well as scientific fashion, on evolving 
defense doctrines and security agreements? What do the 
European Union and similar integration efforts portend for 
regional or Atlantic defense policy? Do political philosophy, 
regime type, and technological orientation really predict 
military strategy? 


Beyond the State?: The European Dilemma and 
the Emerging World Order. Edited by Michael Th. Greven 
and Louis W. Pauly. New York: Rowman & Littlefield, 
2000. 191p. $65.00 cloth, $24.95 paper. 


Mark Pollack, University of Wisconsin-Madison 


Over the past decade, alongside the development of global- 
ization and regional integration, politicians, activists, and 
scholars point to the emergence of a “democratic deficit,” 
which results from the internationalization of 
domestic economies and the inadequate democratic creden- 
tials of new and existing international organizations. Much of 
this literature—exemplified by Fritz Scharpfs seminal Gov- 
eming in Europe: Effective and Democratic (1999)—focuses on 
the experience of the European Union (EU) and examines 
the prospects for closing the democratic deficit through 
domestic and EU-level reforms. 

In Democracy Beyond the Nation-State, the editors and 
contributors (Stephen Newman, Michael Zirn, Claus Offe, 
Edgar Grande, and Stephen Clarkson) broaden the perspec- 
tive of the democratic deficit debate in both theoretical and 
empirical terms. By comparison with Scharpf and other key 
works on the EU, which draw heavily from international 
political economy, the essays in this volume draw more 

i on political theory, in particular the work of 
David Held, Michael Sandel, and Robert Dahl. In empirical 
terms, the four core essays (by Greven, Zurn, Offe, and 
Grande) take the EU as the starting point of analysis, as does 
Scharpf; but a penultimate chapter by Clarkson makes an 
explicit comparison with the NAFTA case. Ultimately, all the 
chapters examine two broader questions: whether globaliza- 
tion erodes democracy at the national level and whether 
democratic values can be realized beyond the nation-state. 
Although the authors come to no consensus position on 
either issue, the volume is well integrated because each 
author notes points of agreement and disagreement among 
the various essays. 

Regarding the effects of globalization on national democ- 
racies, Newman sets the tone of the volume. He argues that 
economic globalization tends to weaken democracy, if and 
insofar as welfare-state and other domestic policies respond 
to the demands of the global marketplace rather than those 
of national polities and their elected representatives. At the 
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extreme, this process may end in a deregulatory “race to the 
bottom” in which democratically expressed demands for 
social protection are ignored in favor of policies aimed at 
attracting mobile capital. In Offe’s words, the so-called acquis 
communautaire—the body of legislation accumulated by the 
EU and devoted primarily to market liberalization—now 
threatens the West European acquis nationale of strong 
liberal democracy and sicll dewéloped welfare states. 

Clarkson’s chapter makes a similar argument regarding 
NAFTA, although his survey of its effects on the United 
States, Canada, and Mexico reveals a more subtle picture. 
Canadians appear to have lost democratic control to the 
market and to their southern neighbor, whereas Mexican 
democracy arguably has been strengthened on balance by 
NAFTA membership, and U.S. policies remain relatively 
unaffected. By and large, however, the contributors take the 
corrosive effects of globalization on democracy and the 
welfare state as a given and focus on the second question: 
Can a race to the bottom be averted and democracy regained, 
beyond the state, either globally or in regional organizations 
such as the EU and NAFTA? 

The initial response of the book to its titular question is 
pessimistic. By contrast with David Held, whose model of a 

“cosmopolitan democracy” is widely discussed by the contrib- 
utors, Newman, Greven, and Offe argue that the criteria 
associated with representative democracy are unlikely to be 
replicated beyond the state, even by organizations that, like 
the EU, demonstrate the highest levels of institutional devel- 
opment. Greven and Offe provide particularly critical reviews 
of the EU’s democratic prospects, not because of any specific 
institutional failing (although they cite a number of these) 
but because the EU and other international organizations 
lack two fundamental prerequisites for even the most rudi- 
mentary form of democracy. 

First, both authors point to the absence beyond the state of 
a demos, a group of people united by a sense of community or 
“we-feeling” that could provide the constituent basis for 
democracy and for the extension of the civil, political, and 
above all social rights to all its members. This is a common 
critique within the EU literature on the democratic deficit, 
and it applies with even greater force to the global level, the 
authors suggest. 

Second, and related to the first, Greven argues that 
democracy requires a single political space, “a distinct realm 
of communication and meaning” within which a democratic 
polity debates and deliberates (p. 44). The EU has arguably 
attempted to establish such a political space, with extensive 
consultation of civil-society groups and periodic election of 
representatives to the European Parliament, but Greven 
argues persuasively that the European political sphere re- 
mains limited to a relatively few Brussels insiders, stunted in 
its development by the diversity of languages and political 
cultures throughout the EU. For both of these reasons, the 
EU’s democratic deficit is arguably structural rather than 
institutional (Grande, p. 120) and is therefore unlikely to be 
overcome by transferring more powers to the “so-called 
European parliament” (Greven, p. 50). 

This book is not, however, a counsel of despair, and the 
later essays by Ziirn and Grande, although acknowledging 
the absence of a fully developed European or international 
demos, argue for a broader conception of democracy that 
might be compatible with international organizations like the 
EU. Ziro gives much attention to the promise of “delibera- 
tive democracy,” which has been the subject of study on the 
local level in the United States but is increasingly presented 
as a possible strength of the EU’s technocratic, committee- 
driven system of governance. Such deliberation should be 
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encouraged, according to Zúrn, although it comes at a cost 
insofar as most EU deliberation takes piace in camera among 
unelected representatives and therefore suffers in terms of 
both transparency and representativeness. Grande argues 
explicitly for a model of “post-national democracy” that 
would pragmatically combine nonmajoritarian elements— 
including most notably consensus decision making and a 
system of institutional checks and balances—together with 
the occasional use of direct democracy in the form of 
Europe-wide referenda (also favored by Zurn). 

Perhaps the most striking aspect of this volume is the 
apparently unanimous agreement among otherwise disparate 
authors that the EU—and by extension other, less developed 
international organizations—fares poorly according to tradi- 
tional standards of representative democracy, and the often 
discussed remedy of strengthening the European Parliament 
offers at best limited promise. Instead, the real debate within 
the book concerns whether the demonstrated failings of the 
EU and other international organizations in terms of repre- 
sentative democracy can, and should, be compensated by the 
more extensive use of nonmajoritarian checks and balances, 
deliberation, consultation with civil society groups, and occa- 
sional direct democracy. None of these expedients provides a 
functional equivalent for representative democracy, the au- 
thors admit, and some (like technocratic deliberation) may 
be imherently hostile to it, yet they may be the closest 
approximation to democracy possible beyond the nation- 
state. 


European In and Limits. By Martin Holmes. 
New York Palgrave, 2001. 242p. $69.95. 

The French Road to European Monetary Union. By David J. 
Howarth. New Yori Palgrave, 2001. 239p. $69.95. 


Carolyn Rhodes, Utah State University 


These are two very different books, both in terms of content 
and purpose. Howarth offers a straightforward academic 
explanation of French monetary policy from 1968 to 1999, 
and he provides a useful, if not unfamiliar, view of French 
motives for monetary integration. The Holmes book is 
equally straightforward, but it is a polemic against European 
integration and British participation in the current European 
Union (EU) structure. Despite their dissimilarities, however, 
both books provide interesting insights into the politics of 
European integration in these two countries, whose histories, 
ideological orientations, and economic structures vary so 
widely. What first appeared to me an odd juxtaposition of 
books for review now seems an insightful exercise for better 
understanding national reactions to the development of the 
EU 


Howarth’s analysis of French monetary policy generally 
reinforces what most of us in EU studies have known for 
some time—France has contmually pursued a monetary 
policy aimed at enhancing or protecting French national 
power, and it only accepted European monetary integration 
as a Means of contaming the influence of the Bundesbank 
and advancing its own monetary interests. Howarth convinc- 
ingly shows that although France was also concerned with 
insulating the franc from dollar pressures, it was a growing 
concern over the relationship between the franc and the 
deutchmark that prompted definitive steps toward European 
monetary integration. 

Driven by a strong currency policy, both for nationalistic 
and to a degree sound economic (particularly its dependence 
on dollar-denominated oil imports) successive 


purposes, 
French governments struggled with stagflation, high debt, 
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balance-of-payments problems, and reflationary demands 
from domestic interests. Often unable to match macroeco- 
nomic policies with a desire for a strong franc, leaders from 
DeGaulle to Mitterand wrestled with how to use European 
monetary integration to enhance their control over a cur- 
rency whose value increasingly depended upon cooperation 
from the Bundesbank. Institutional structures at the Euro- 
pean level became more and more attractive as it became 
clear to the French political elite that the French Central 
Bank did not have the resources on its own to manage the 
value of the franc. 

This story, which Howarth presents systematically and 
cogently, reaffirms our understanding of why France, such a 
nationalistic country, would become a major proponent of 
monetary union. The realities of the global economy of the 
1970s and 1980s, along with the geopolitical events in Europe 
of 1989-90, convinced French leadership that German eco- 
nomic and financial influence would contmue to reduce 
France’s national monetary power. In the tradition of the 
European Coal and Steel Community, and much to the 
dismay of a marginalized Left and the far Right, the French 
government decided that a monetary partnership (within EU 
institutions that it could influence) with its erstwhile mone- 
tary competitor would better serve its own financial and 
economic purposes than going it alone. Thus, deeper eco- 
nomic and financial integration would paradoncally offer 
more monetary control through supranational political struc- 
tures than would an autonomous, nationalistic policy. 

The Holmes book demonstrates a very different experience 
with European integration, born from Britain’s more global 
economic and political history and from the perception that 
its distance from Europe, in mindset as much as in miles, sets 
its interests apart from those of the Continent. This is a 
collection of essays critical of the EU and of British govern- 
ments whenever they ventured into deeper EU integration, 
and it is an unabashed representation of the intellectual and 
nationalistic arguments of the Conservative Party’s Euro- 
akeptics. Written from 1994 through 2000, the essays reveal 
this group’s concerns regarding such issues as Europe’s single 
currency, enlargement, and tax harmonization. In addition, 
there is a very critical assessment of John Major's ill- 
conceived effort to balance his policy toward EU integration 
in such a way as to satisfy the Euroskeptics in his own party 
while still venturing into d r EU cooperation. According 
to Holmes, Major only p the role of Euroskeptic himself 
when he was attempting to retain party control in the face of 
Euro-criticism. When the discrepancy between the direction 
of his policy and his Euroskeptical pronouncements became 
too great, the Conservatives rejected (and rightly so, accord- 
ing to Holmes) his leadership. The choice of William Hague, 
a Euroskeptic from the start, in 1997 to lead the party was, 
according to the author, a positive move that makes clear the 
Conservative stand on European integration. 

Referring again and again to the “right kind of EU,” not 
one “run by a cling-on Commission” that defies democracy 
and has no accountability, Holmes argues that the EU should 
be reformed into a free trade association whose member 
states retain ultimate say on whether to participate in new 
EU legislation (p. 57). He is very critical of the cohesion fund 
(which he labels a pork-barrel boondoggle), arguing instead 
that true free trade would eventually address regional devel- 
opment challenges. Holmes accuses the EU of reneging on 
single-market pledges in favor of protectionist structures in 
case after case, and he offers a persuasive argument against 
the European model of custom’s union and socialized eco- 
nomic structures. Yet, he takes this argument too far, ignor- 
ing the real free market success stories of integra- 
tion and the EU’s leadership (albeit mixed) at the World 
Trade Organization. Like most polemics, his does not at- 


tempt to present a balanced assessment; it offers one view 
and arranges the evidence accordingly. 

The French Road to European Monetary Union is by no 
means earth shattering in its discoveries or approach. It isa 
solid book that takes the reader syste through 
successive French governments and explains their remark- 
able consistency in monetary policy regardless of the party in 
control The book is well developed and nicely written, and 
although the author’s attempt to categorize French policies 
in terms of negative power, positive power, and veto seems 
more like an “add-on” than a fully integrated analytical 
a ch, it is somewhat useful. 

Integration: Scope and Limits is also interesting 
but for different reasons. The book is mistitled. One expects 
a broad explanation of the integration process and some 
insights into why integration has limits across arid In- 
stead, it is entirely a British discussion about why the United 
Kingdom should not become more deeply imbedded in the 
EU. It makes no effort to provide a balanced assessment. 


The Deadly Ethnic Riot. By Donald L. Horowitz. Berkeley: 
University of California Press, 2001. 605p. $35.00. 


Stephen M. Saideman, Texas Tech University 


Donld Horowitz’s previous volume (Ethnic Groups in Con- 
flct, 1985) was perhaps the most important book written on 
the subject in the 1980s; it not only covered grou around 
the world but also addressed a variety of topics, including the 
nature of ethnic identity, intergroup relations, ethnicity and 
the military, separatism, and political institutions. The Deadly 
Ethnic Riot considers a very specific form of ethnic conflict, 
but his focus should not limit its effect. It has implications 

nd ethnic riots and will challenge scholars in the field to 
think more seriously about social psychological approaches to 
ethnic conflict and to think more clearly in general about 
ethnic strife. 

Horowitz that the ethnic riot is distinct from other 
forms of ethnic conflict (feuds, lynching, violent protests, 
terrorism, civil war, genocide). An ethnic riot is “an intense, 
sudden, though not necessarily wholly unplanned, lethal 
attack by civilian members of one ethnic group on civilian 
members of another ethnic group, the victims chosen because 
of their group membership” (p. 1). He delineates the pro- 
cesses through which riots occur, moving from rumors and 
precipitants, to lulls, to the sudden killings and mutilations of 
a particular group, to the consequences of riots for further 
strife. The book would make a significant contribution simply 
as a descriptive account of ethnic riots, but Horowitz goes 
beyond this to examine their causes. 

Horowitz argues that social psychology presents a better 
perspective than other approaches for understanding why 
ethnic groups engage in conflict in certain forms against 
particular targets at specific times. The key move 1s target 
selection, which he suggests is a mixture of “calculation and 
passion.” Horowitz observes that rioters are quite selective in 
choosing their victims, focusing on one hated ethnic group 


rather than on others, and are willing to risk losi a potential 
target if there is some doubt about group iation. He 
considers what factors cause certain groups to be targeted 


rather than others. Targeting is problematic, and therefore 


theoreti interesting, because a group often has conflict 
with more one group, so there can be variation over time 
and 


space. 
Horowitz identifies several necessary conditions for ethnic 
riots to occur: Two groups must have a hostile relationship; 


one or more events and/or policies gui a group (a 
precipitant); justification for kiling: and a relatively low risk 
that e: g in violence will be costly to the attackers (p. 
524). Horowitz takes hatred/antipathy seriously, considering 
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the various roles it plays in ethnic conflict in general and in 
ethnic riots more specifically, but the hostile relationship is 
not the same as ancient hatreds. The hostility must be 
current, although it may hark back to past grievances. 
Precipitants are very important, because not just any situa- 
tion will trigger ethnic conflict. Only certain events will evoke 
the angry response that unleashes the killing. One path for 
future work is to consider further what kinds of events or 
policies are more or less likely to provoke violence. People 
also must fee] justified in their actions; if not, near riots will 
sputter out or become something else. The last necessary 
condition, a perception that risks are relatively low, is quite 
striking. Although rioters often are seen as irrational, 
Horowitz maintains they attack only when they are unlikely 
to pay the costs of their actions. Thus, riots are more likely 
when the attackers greatly outnumber the targets, when the 
police are absent or allies, and when politicians give them 
cover. If any one of these conditions is absent, then a deadly 
ethnic riot will not occur, although other forms of violence 
may. Thus, the book provides some policy advice. 

Horowitz makes an explicit attempt to counter recent 
arguments that focus on the security dilemma (Barry Posen, 
“The Security Dilemma and Ethnic Conflict,” Survival 35 
[1993]: 27-47) and rational choice explanations (Robert H. 
Bates, Rui J.P. De Figueiredo, Jr., and Barry R. Weingast, 
“The Politics of Interpretation,” Politics and Society 26, 4 
[1998]: 603-42, and James D. Fearon, “Commitment Prob- 
lems and the Spread of Ethnic Conflict,” in David Lake and 
Donald Rothchild, eds., The international Spread of Ethnic 
Conflict, 1998). Horowitz is largely successful in asserting that 
hate plays an important role in ethnic conflict because he 
considers how emotion and reason interact, rather than argue 
that the existence of antipathy means that actors are com- 
pletely irrational. As a result, his book presents a challenge to 
recent work and forces scholars to consider more seriously 
the roles played by emotions, particularly hate, in ethnic 
strife 


Horowitz uses ethnic riots to illustrate a variety of points, 
and this leads to two problems for nt research. First, 
because his book is a mix of social ological theory and 
inductive reasoning, it is hard to tell at times whether the 
theory is providing new insights into ethnic riots or whether 
the examination of ethnic riots is telling us new things. It is 
important to know what the theory does and does not tell us, 
so that we can extend and evaluate it. Second, because the 
method is a close examination of the relevant population, it 
is hard to determine which causal relationships matter. The 
book presents neither a set of case studies nor quantitative 
analyses. Therefore, it is difficult to establish what we really 
know. Horowitz considers these issues in a chapter on 
nonriots. He discusses how one could possibly select cases, 
given that it is hard to determine which nonevents one ought 
to study in comparison to some of the riots he analyzes. 

This book makes an important contribution to our under- 
standing of ethnic conflict. It wil! be a source of testable 
hypotheses for years to come, as Horowitz makes claims 
about what kinds of inter- and intragroup characteristics are 
likely to cause conflict (group cohesion attracts aggression) 
and which are not (cultural distance and economic grievances 
may not matter). He also provides some clues for several 
unanswered questions: Why do followers follow? Do leaders 
follow or lead? When and why do groups act as groups? One 
of the book’s greatest potential contributions is to force 
scholars to think about different kinds of ethnic con- 
flicts—how ethnic riota are similar to and different from 
peaceful protests, terrorism, organized rebellion, and the 
like. Thus far, much of the debate has focused on the 
presence or absence of violence, rather than the form vio- 
lence may take. The Deadly Ethnic Riot will compel scholars 
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to theorize about that which is common to all ethnic conflict 
and that which varies, depending upon the type of conflict. 


Stable Peace among Nations. Edited by Arie M. Kacowicz, 
Yaacov Bar-Siman-Tov, Ole Elgstrom, and us Jer- 
neck, Lanham, MD: Rowman & Littlefield, 2000. 352p. 
$75.00 cloth, $26.95 paper. 


Stephen R. Rock, Vassar College 


Most scholars have long recognized that international politics 
is not always and everywhere a Hobbesian war of all against 
all. Rather, certain states have formed what Karl Deutsch 
called security-communities, Within these communities pre- 
vail relations of “stable ” a term coined by Kenneth 
Boulding to denote a peace that is not merely the absence of 
war, but a peace in which even the possibility of war is 
extremely remote. For a variety of reasons, the minimal 
oe of death and destruction foremost among them, 
stable peace is clearly preferable to a precarious’ peace 
maintained solely by deterrence or a balance of power. 
Nevertheless, with a few notable exceptions, little effort has 
been devoted to identifying the origins of stable peace or the 
ways in which it might be created and nurtured. Stable Peace 
among Nations to remedy this situation. The editors 
and contributors have as their central concerns (1) to define 
stable peace, differentiating it from other forms of the 
phenomenon, (2) to explain how and why relations of stable 
peace develop among states, and (3) to identify the factors 
critical to maintaining stable peace once it emerges. 

In a superb opening chapter, “Stable Peace: A Conceptual 
Framework,” Kacowicz and Bar-Siman-Tov set the stage for 
the work. They define stable as having both a cognitive 
and a temporal dimension. itively, it exists when war is 
unthinkable as a means of resolving interstate conflict. Tem- 
porally, it persists over time. The authors distmguish between 
the stabilization and consolidation of peace. Stabilization 
refers to the process by which stable peace emerges after 
conflict has been reduced and eventually resolved. It implies 
“strategic” or “complex” learning in which the parties rede- 
fine their national interests and develop a deepening com- 
mitment to one another and the new character of their 
relationship. Four conditions are necessary for stabilization: 
(1) stable political regimes, which help promote mutual trust 
and confidence; (2) mutual satisfaction with the terms of the 
aa agreement and/or the status quo; (3) predictability of 

avior and problem-solving m isms; and (4) open 
channels of communication, trust, and between the 
leaders. In addition, two factors are favorable to stabilization: 
third-party guarantees and spillover effects and the provision 
of nonmilitary public goods. With respect to the consolida- 
tion of peace, two conditions are necessary: (1) a continua- 
tion of satisfaction with the status quo and (2) a common 
normative framework, which may be enhanced by shared 
characteristica, including aspects of culture and identity. 

The rest of the book is divided into two parts. The first 
consists of a series of theoretical and conceptual ters, 
most of which are rich with historical illustrations. These 
essays consider such subjects as the domestic, regional, and 
international sources of stable pete the role of common 
enemies and common identity; the development of trust; the 
treatment of issues; the economic aspects of making stable 
peace; and the relationship between stable peace and dem- 
ocratic peace. The second part contains case studies that seek 
not only to explain the formation (or nonformation) of stable 
peace m some instances but also to assess for the 
future. Among the cases examined are dinavia, the 
southern cone of South America, Egypt and Israel, Europe 
and the future of NATO, and German-Polish relations. 


Stable Peace among Nations is impressive in many respects. 
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It is unusual to see such sophisticated conceptual and theo- 
retical work in combination with detailed case studies. It is 
perhaps even rarer to find an edited volume in which the 
editors and contributors appear to have worked so closely 
together that the overarching theoretical and conceptual 
apparatus, on the one hand, and the individual chapters, on 
the other, fit almost seamlessly. The result is a study with a 
breadth unattainable by a single author but with a coherence 
and degree of integration hardly ever achieved in a collection 
of essays. 

The book is also commendable for its paradigmatic eclec- 
ticism. It draws heavily on ideational perspectives, i 
constructivism. Constructivism holds that subjective and m- 
tersubjective understandings, more than material or objective 
forces, shape relations among states. Because stable peace is 
in essence a state of mind, a cognitive phenomenon, it is a 
particularly appropriate subject for constructivist analysis. 
Many of the chapters, including those by John Owen and 
Magnus Ericson, suggest that a constructivist approach can 
be quite helpful and perhaps even crucial in explaining the 
emergence and perpetuation of stable peace. In their fine 
concluding essay, Elgström and Jerneck, who describe them- 
selves as “moderate constructivists,” explicitly recognize the 
causal importance of international power structures, domes- 
tic political configurations, and other less ideational factors. 
Although neorealism does not figure prominently in the 
book—since it is skeptical that a peace of sufficient depth and 
duration to be described as “stable” could ever exist—certain 
chapters, particularly the one by Benjamin Miller, consider 
the international systemic origins of stable peace. Other 


essays examine such traditionally liberal or neoliberal vari-. 


ables as economic relations, international institutions, and 
regime type. 

The study of stable peace overlaps with one of the most 
prominent research agendas in international relations of the 
past two decades: democratic peace theory. Lending support 
to the arguments of that theory, Kacowicz and Bar-Siman- 
Tov suggest that the presence of well-developed democratic 
regimes is sufficient, although not necessary, to maintain 
stable peace. But this book deemphasizes the earlier struc- 
tural-constraint and normative explanations for democratic 
peace and focuses instead on the sense of identity shared by 
democratic (or liberal) states. One of the most interesting 
and potentially controversial claims is Owen’s contention that 
the formation of common identity crucial to the consolida- 
tion of stable peace requires an enemy, an “other.” The 
somewhat depressing but logical conclusion is that stable 

can never be universal. The best one can do is create 
islands of stable peace and attempt to minimize conflict 
among them. 

If this is true, the democratized world advocated by many 
democratic peace theorists might not be so pacific after all. 
Owen’s contention is also hard to reconcile with the generally 
optimistic thrust of constructivism and particularly the asser- 
tion of some constructivists that we could escape the pemi- 
cious effects of the anarchic Westphalian system if only we 
would think differently about it. It is, however, despite its 
origins in an ideational perspective, consistent with the 
neorealist belief that peace among some states is little more 
than a byproduct—often a fleeting one—of competition and 
conflict between those states and others. The compatibility of 
neorealist and constructivist analysis, in this limited respect at 
least, is just one of the book’s many insights. 

Stable Peace among Nations is an excellent work, the most 
theoretically developed and comprehensive study yet pro- 
duced on the important but-too rarely examined phenome- 
non of stable peace. It will doubtless remain so for some time. 
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A Changing United Nations: Multilateral Evolution and the 
Quest for Global Governance. By W. Andy Knight. New 
York: Palgrave, 2000. 257p. $65.00. 


J. Martin Rochester, University of Missouri-St. Louis 


Andy Knight examines the prospects for United Nations 
refrm in the -Cold War era, offering both historical 
context as well as speculation about how conductve the 
conte international system is to building a next- 
generation global organization in the foreseeable future. This 
useful study, which seeks to blend a theoretical discussion 
with prescription, starts with a survey of the literature and 
competing pe: ives on UN reform as well as a conceptual 
treatment of institutional change. It then traces the evolution 
of the UN, how it has responded to changes in its environ- 
ment, and the many failed efforts at UN reform. Finally, 
Knight a new governance model as the basis for a 
rejuvenated UN. 

The author “tries to understand the UN’s place in the 
broader history of multilateral evolution, how the UN system 
has changed, and why” (p. 11). Although he nicely captures 
the “institutional foundations” in 1945 (p. 65) and summa- 
rizes the key developments since, it is hard to improve on the 
work of many who have traveled this path, notably Inis 
Claude (Swords nto Plowshares, 4th ed. 1984, and “The 
Record of International Organizations in the Twentieth 
Century,” Tamkang Chair Lecture Series, 1986). The most 
original aspect of the book is the conceptual treatment of 
organizational change processes and how these relate to the 
problem of global governance, that is, the analysis of the 
pressures being felt by the UN, as a sovereignty-based 
interstate organization, to adapt to a world being buffeted 3 
both centrifugal (subnational) and centripetal (transnational 
forces that are eroding sovereignty. 

There is samething of a disconnect between the rted 
empirical thrust of the book and the normative agenda that 
seems to color much of the analysis. On the one hand, Knight 
states that “the broad purpose of this book... is to describe, 
analyze, understand, and explain the phenomenon of multi- 
lateral evolution as a concept and a practice” (pp. 10-1). On 
the other hand, much of the discussion is hortatory and 
reflects antiestablishment, .antistatist biases, even though 
Knight accuses others (American unilateralists and other 
anti-UN ) of wearing “ideological blinkers” (p. 28). For 
example: “Future institutions of global Tnance must 
become more proactive, addressing underlying reasons for 
global and domestic conflicts” (p. 9). “Unless the UN can 
reconstitute itself so that ‘we the people’ can be brought back 
into the picture, there is a good ce that some of the 
emerging multilateral arrangements may be designed in a 
way that will bypass this organization” (p. 9). “With the end 
of the Cold War political space was created for the emer- 
gence of a human security agenda reflecting the aspirations 
of social forces, human groups, and individuals across the 
globe” (p. 169). 

It is to dispute Knight’s statement that, in light of 
globalization in the economic sphere, new kinds of violence 

ing in the security sphere, and numerous other shifts 
in the landscape of world politics, “there is a need to update 
the UN machinery to accommodate changes occurring in the 
broader international society and system” (p. 20). But even if 
there is a compelling logic that dictates a reformation in 
international governance arrangements, will, in the words of 
Stanley Hoffmann (Primacy or World Order, 1978, 193), “the 
need forge a way?” Knight’s answer is somewhat confusi 
Although “a new consensus on the place of the UN in 
international politics seems to be developing” (p. 129), “the 
role of the in new global arrangement is... very 
much unclear” (p. 150). 
Studies of international organization in general and the 
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UN in particular always struggle to avoid the twin traps of 
bad realism and bad idealism, of succumbing to either 
excessive cynicism or excessive wishful thinking. Knight tries 
to get around this problem by framing his work as an exercise 
in critical theory. He spends considerable tume discussing the 
agent-structure issue and how international institutions like 
the UN are creatures of habit and practice yet are able to 
break out of routines and undergo change. He acknowledges 
the role of power and interests—“multilateral institutions 
reflect the underlying configuration of power at a particular 
critical juncture” (p. 184)—but is strangely silent on whether 
the United States or any other key state actors would support 
the governance model he proposes as a basis for UN reform. 

After a survey of various approaches to UN reform, 
ranging from those of “status quo advocates,” “incremental 
process reformists,” and “adaptive reformists” to more rad- 
ical views (“dissolutionists” and “successionists”), Knight 
boils down the modes of changing international organizations 
to three: reform, adaptation, and learning (p. 40). He argues 
that neither eE Te panied, formal restructuring) 
nor adaptation (informal, reflexive, nonpurposive organiza- 
tional responses to changing circumstances) have worked to 
make the UN a more effective multilateral institution. He 
calls for learning as the preferred mode of change and offers 
a “subsidiarity” model as the best chance for “rethinking” 
how the UN should function. Similar to the model adopted 
by the European Union, subsidianty amounts to “sharing the 
labour of governance with central, regional, and local bodies” 
(p. 173). The book 1s surprisingly sparing in its elaboration of 
what this would entail, and only a few pages at the end (pp. 
170-8) describe the model. Indeed, at a time when Knight 
admits economic issues threaten to overtake traditional 
security issues as a paramount global concern, the only real 
recommendation he puts forth ıs in the war/peace area. a 
greater role for regional organizations, such as the Organi- 
zation of African States and the Organization of African 
Unity, and, as conflict managers. Not only is this an under- 
whelming recommendation after almost 200 pages, but also, 
as the author recognizes, regional organizations have not had 
much success in the security field over the years. 

A central twofold question of our time is: (1) How much 
government/governance do more than six billion human 
beings want in their lives, that is, what is the proper relation- 
ship between state and society, and (2) to the extent ıt is 
understood that some measure of government/governance is 
needed, what is the optimal level for maximizing benefits in a 
given issue area, that is, global, regional, national, or local? 
Knight's book, at times very insightful and eloquent, makes a 
modest contribution in helping us confront this puzzle. One 
only wishes be had carried the analysis farther. 


The International Law Commission of the Untted Nations. 
By Jeffrey S. Morton. Columbia: University of South 
Carolina Press, 2000. 225p. $29.95. 


William Felice, Eckerd College 


Before the eyes of the world, between 600,000 and 800,000 
Tutsi were murdered, many hacked to death in front of their 
families. Rather than confront Hutu power and challenge 
those responsible for this attempted genocide, the UN pulled 
out of Rwanda. President Clinton later apologized for the 
inaction and passivity of the United States. U.S. indifference 
toward the Rwandan holocaust endures as the Clinton ad- 
ministration’s paramount foreign policy failure. After World 
War II the international community pledged that genocide 
wouid never again be tolerated, but in Rwanda it was. Can 
victimized peoples ever put their trust in the UN and 
international law? Is it possible both to establish internation- 
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ally binding codes of conduct that criminalize genocidal 
practices and to enforce these codes? 

Jeffrey Morton helps answer those questions through his 
exploration of the work of the International Law Commission 
(ILC) of the UN in two specific endeavors: the establishment 
of a binding code of international crimes and the creation of 
a standing international court. His descriptive and empirical 
study focuses on the evolution of the term “international 
crimes” and the ILC debates that resulted in the Draft Code 
of Cromes against the Peace and Securty of Mankind and the 
Draft Statute of the International Criminal Court. (These 
documents are included in the book’s appendix.) Since the 
end of the Cold War, the ILC has devoted most of its time to 
these issues of international criminal law. Morton’s book 
provides a valuable overview of the significant accomplish- 
ments of the ILC in these critical areas. 

Students of international law will find the first four descrip- 
tive chapters on the history and work of the ILC particularly 
useful. Morton begins with a well-written summary of the 
history and organization of the ILC and proceeds with 
clarifying analysis of ILC work on international crimes, the 
draft code of crimes, and the mternational criminal court. He 
adeptly summarizes the breadth of international crimes, 
including international terrorism, drug trafficking, torture, 
colonial domination, genocide, apartheid, war crimes, crimes 
against humanity, aggression, aircraft hijacking, and piracy. 
His history of the struggle to codify a draft code of crimes 
against the peace and security of mankind dramati 
illustrates the unprecedented challenge the ILC confron 
The clear summary of the legal sources used and the princi- 
ples enshrined in the draft code by the ILC is of particular 
value to scholars of international law. 

Morton’s chapter on the international criminal court pro- 
vides a concise review of attempts to establish the court, 
culminating in the Rome conference in June 1998, and a 
review of the arguments for and against the creation of a 
permanent body. Morton focuses on the role of the ILC in 
drafting the statute for the court. Much progress has been 
made by the ILC in poe ey legal principles into binding 
rules, The end of the Cold War opened a political door for 
the creation of an international criminal court, and the ILC 
moved with dispatch to draft a workable statute. 

In the final two chapters, Morton presents an empirical 
analysis of the ILC. He constructs a database through content 
analysis of the recorded debate among ILC members on the 
wording, substance, and scope of the respective draft articles. 
Commission members’ positions are treated “as observations 
that, when coded, generate data regarding the de of 
consensus within the commission on draft articles” (p. 79). 
Numerical values are assigned to statements, ranging from 1 
(minimal consensus) to 5 (maximum consensus), From these 
data, Morton seeks answers two questions: “(1) Do commis- 
sion members function independent of or as extensions of 
their home governments? and (2) What impact, if any, did 
the end of the cold war have on the International Law 
Commission” (p. 102)? 

On the first question, Morton concludes, “the empirical 
results are clear and conclusive: the political arena in which 
nation-states compete for the attainment of their vital inter- 
ests has penetrated the International Law Commission to 
such extent that commission debate is a microcosm of world 
politics. The existence of a cohesive East Euro bloc of 
commission members, conflict between commission members 
from Eastern and Western blocs, a North-South divide, and 
Southern cooperation in commission debate indicate in the 
strongest of terms that, indeed, the International Law Com- 
mission is a refiection of the political, ideological, and 
economic struggles of the international arena” (pp. 102-3). 

In regard to the second question, the data reveal that the 
end of the Cold War greatly affected the ILC. “The conflic- 
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tual relationship of East and West ended, and Southern 
cooperation experienced a significant decline” (p. 105). The 
second set of findings thus reinforces the first, namely, “that 
the Law Commission reflects the international arena in which 
it functions” (p. 105). 

Morton’s study a view of international law shared by 
classical political realists and classical Marxists: International 
law and international organization cannot be insulated from 
international politics, economics, and political power. In fact, 
most “idealists” and liberal internationalists would accept the 
impossibility of positing international law as totally indepen- 
dent of power politics. ILC members may strive to serve “in 
a personal capacity” independent of nation, but their inher- 
ent bias (due to their education and professional training) 
will most likely be toward their ideological worldview of their 
home state. 

In the end, Morton’s conclusions suffer from problems 
often seen in behavioralist studies. Some behavioralists and 
numerous rational choice theorists tend to make exaggerated 
claims about the validity of their approach and the “science” 
behind their methods. Morton presents a strong defense of 
behavioralism as a scientific method of inquiry and argues 
that international law as a field has declined due to its 
separation from these methods (p. 75). He wants interna- 
tional law to adopt “systematic, statistical testing of proposi- 
tions,” which would provide “a firm footing” and allow it “to 
enter into the mainstream” of international relations study 
(p. 76). 

I find this argument unconvincing. I am reminded of the 
caution toward behavioralism expressed by Charlies Lind- 
blom in his presidential address to the American Political 
Science Association in 1981 “Another State of Mind: Presi- 
dential Address, 1981” (American Political Science Review 76 
[March 1982]: 9-21). Lindblom is skeptical about political 
science’s pretensions to be a science. To qualify as a “science” 
means that our procedures would give us “discoveries,” that 
is, new or nontrivial insights about the political sphere. Many 
statistical studies end up confirming the obvious. Scholars of 
international law have long noted the effect of power politics 
and the international system on the creation and implemen- 
tation of legal codes and rules between states. To these legal 
scholars, Morton’s statistical analysis may simply be confirm- 
ing the obvious. 


Traditions of War. Occupation Resistance and the Law. By 
Karma Nabulsi. Oxford: Oxford University Press, 2000. 
293p. $72.00. 


Michael Dillon, University of Lancaster 


This book is a work of considerable scholarship that began 
life as a doctoral thesis under Adam Roberts at Oxford 
University. More than three pages of acknowledgments bear 
witness to its pedigree. They also disclose an intriguing 
combination of English traditionalists and non-English revo- 
lutionaries who sustained the author in the transformation of 
a research thesis into an elegantly produced book, which is a 
classic in the style of the ish School of international 
relations. Clearly, there is yet another story to be written, 
however, about the curious provenance of such a work. It 
may very well be a very much more interesting and intrigui 
story than the one the book pursues, although this itself 1s not 
without interest and importance. 

Traditions of War pursues a number of complex and related 
themes and excavates many historical archives in exploration 
of its core problematic: the failure of the laws of war to 
resolve the problem of distinguishing between lawful and 
unlawful combatants. Traditional approaches—those of the 
central paradigms of international relations theory and the 
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key concerns of international law—ordinarily supply the 
analytical optics employed in these kinds of investigations. 
This book is refreshing because it advances three philosoph- 
ical traditions of war instead: the martial, the Grotian, and 
the republican (indebted to Rousseau). In applying these to 
the rich confusion of military occupation and resistance 
experienced in Europe over the last 250 years, the author 
exposes the many historical as well as conceptual limitations 
and fallibilities of international realism and idealism. 

War for this author is not an aberration that has to be 
explained, excoriated, and outlawed. Neither is war simply a 
legitimate instrument in the realist pursuit of stylized political 
ambitions. The intimate correlation between forms of war 
and forms of life is not a point for which the text argues, but 
one of the points from which it argues. More interestingly, 
what distinguishes this account of the three traditions of war 
upon which it draws is the recognition that forms of war and 
violence are themselves integral also to specific philosophical 

itions and political ideals concerned to champion varying 
interpretations of the good life. No idea is innocent of 
violence. Every idea has a violence that is essential to its 
realization. The legality or illegality of violence is a function 
of the order that different traditions commend. Thus, liberal 
internationalism in its high legal pomp is shocked to be 
reminded that liberal peace is always intimately correlated 
also with liberal ways of war. 

Because this is a classic in the English School of interna- 
tional relations, the wealth of its historical scholarship casts 
much diversionary light on all sorts of issues and themes. Of 

icular interest to me is the account given of the martial 
character of Britain’s imperial experience. Anyone familiar 
with Britain’s postimperial foreign and defense policy cannot 
but be struck by the continuing force of that tradition at 
home and its deep influence throughout Britain’s geostrate- 
gic intimacy with the United States abroad. Alternatively, for 
those interested in the republican tradition, there is a corre- 
sponding wealth of detail on obscure European lican 
activists and theoreticians. The overall worth of the book 
nonetheless depends upon its core analytical method, the 
idea of a tion, and the messages conveyed by that 
analytical medium. Where the English School is concerned, 
the medium is certainly also the message, politically as well as 
analytically. The advantage of tradition is that it directs 
attention to the complex historicality of ideas and experience, 
but the disadvantage is that it elevates continuity over 
disjunction. Moreover, as employed in this school, tradition 
elides a host of other relevant issues. Two seem to me to be 
critical. 

First, we should ask about these three traditions of war. 
Whose traditions were they? What archives have been ig- 
nored in the constitution of this narrowly tripartite account of 
the traditions of war? The focus is on the laws of war and 
their failure satisfactorily to distinguish between lawful and 
unlawful violence, but there is, for example, only a very 
modest postcolonial sensibility at work here, and the Marxist 
tradition simply does not figure at all. It is as if there were no 
Marxist tradition, as if the Soviet Union, together with its 
effect for good or ill upon the trajectory of international 
affairs and the character of the international society upon 
which the English School places such store, had never 
existed. 


Second, tradition here implies continuity and intelligibility. 
This book’s very account of tradition is one in which tradition 
lies in the seamless historical transmission and reception of 
texts rather than in the violent destructions and deconstruc- 
tions that effect breaks, discontinuities, and dynamic incoher- 
ences in our meaning (and legal) systems. In short, the idea 
of tradition in the English School is a deeply conservative one 
that ignores much of what it claims to be able to account for. 
Here, in addition to the demotion of Marist and postcolo- 
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nial challenges to the laws of war, the most important lacuna 
is the radical absence of the massive technological as well as 
sociological revolutions that have transformed the face of war 
as much as they have the very societies, locally and globally, 
that make war. Sag ocean peuvent at 
in the nineteenth early twentieth century were matched 
by those induced by the nuclearization of war after 1945. 

Currently, we are undergoing yet another revolution in 
military affairs. Driven by computer-mediated means of 
communication and information operating through dense 
systems of global and local surveillance, and premised upon 
the new ontology of information, this revolution presages 
dramatic changes in the conduct of war whose implications 
for the correlation of legal-illegal violence may be no less 
significant than those forced into international prominence 
by industrialization and nuclearization. The concept of a 
tradition of war is capable of addressing these questions but 
only if the concept of tradition itself is reworked in ways that 
do not confine it to the provenance from which this thesis 
took its inspiration. Ultimately, it is less the book than its 
pedigree whose limits are shown to be so interestingly if 
desperately narrow here. 


European Order: West European Responses, 
1989-1997. Edited by Robin Niblett and William Wallace. 


New York: Palgrave, 2001. 298p. $75.00. 


f Stuart Croft, University of Birmingham 
The volume is organized around three central hypotheses. 
The first ıs that, faced with significant change in the way in 
which policymakers understood the world after 1989, Euro- 
pean states—both large and small—reacted in a confused 
and ad hoc fashion. The leaders of major international 
organizations such as the European Union and NATO tend 
to present the last decade in terms of a coherent strategy and 
the evolution of policy. Niblett and Wallace, in contrast, 
suggest that overload and difference are more appropriate 


prisms. 

Second, policy and academic study became separated in 
the early 1990s, Faced with the failure to predict the end of 
the Cold War, American academia focused on broad visions 
of the end of history, or the clash of civilizations. European 
(and especially British) academia concerned itself with epis- 
temological debate. In this, “real” policy issues—institutional 
enlargement, relations with Russia—were sidelined. Niblett 
and Wallace hypothesize that academia focused on policy can 
make a real contribution to coherence in the development of 
state and intergovernmental organization strategy. 

Third, the editors hypothesize that the influence of the 
United States, both in policy terms and in setting intellectual 
trends in academia, dominated the creation and development 
of specifically European conceptions and mses to the 
changing perception of the world. The subtitle of the volume 
is illustrative in this regard. 

This book is not concerned with theoretical development 
or the application of theoretical insight to particular case 
studies. Rather, it is an attempt to understand West Euro- 
pean policymaking in a crucial period. Two chapters establish 
a framework. William Wallace is concerned with outlining a 
notion of regional order and the effect of ideas upon such 
notions. He identifies a paradox, a fundamental transforma- 
tion of the continent’s security order. Western Europe has 
continued to focus on its own parochial concerns at a time 
when the eastern part of the continent has undergone 
significant political, economic, and social change. Anthony 
Forster and Robin Niblett take on this theme by tracing 
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different conceptions of Europe as they developed through 
the 1990s. 

Successive chapters cover national policy in Germany, 
France, and Britain, of course, and then Dutch, Swedish, 
Spanish, and Italian developments. These are collectively of 
a high value and establish with care the empirical detail of 
policy debates, initiatives, and responses in the period under 
review. Hartmut Mayer suggests that the German debate was 
shaped by the need to be intellectually “early on the beach” 
(a European, or certainly a British, image of German tourist 
behavior at European holiday resorts), an attempt to “lead 
without anyone realizing it” (as politician Kar) Lammers put 
it, see pp. 80-1). Niblett’s analysis of French policy is framed 
by the argument that France was a beneficiary of Cold War 
structures. Yet, the failure of Mitterrand’s promotion of a 
nationalist “autre in 1981-83 convinced policymak- 
ers of the centrality of building a European pole of power 
with Germany without allowing deep infringements on 
French sovereignty. The power of this contradictory conven- 
tional wisdom lies unchallenged among the elite, for fear of 
provoking a public reaction similar to that which led to the 
referendum rejection of the Maastricht Treaty. Forster and 
Wallace argue that the end of the Cold War provoked an 
emotional debate in the United Kingdom, which focused on 
the future of Germany but actually represented an articula- 
tion of concerns over the nature of British identity. 

For Steven Everts, the 1990s represented a period when 
the Netherlands adjusted to a more continental and leas 
Atlanticist orientation—as he puts it, “adjusting to Germa- 
ny.” Lisbeth Aggestam plots the reconstruction of Swedish 
neutrality. Niblett explains the consistency in Spanish Euro- 
pean policy since the return of democracy, and Christopher 
Hill and Filippo Andreatta explain the challengea both to 
Italy’s confidence as a foreign policy actor and reliance on an 
intellectual construction of European integration as the core 
of its external identity. This picture of incremental adjust- 
ment, perhaps of incoherent response, is mapped further in 
Wallace’s concluding chapter. 

The volume provides a wealth of empirical detail about the 
nature of European policy during the early and middle 1990s. 
It does not simply focus on the major powers in the European 
context but gives a more rounded view of the nature of the 
West European discourse at this crucial historical juncture. 
Three sets of questions emerge from the analysis. 

First, what is to be said about the relative influence of 
different states in the process of rethinking European order? 
We take it as a core assumption that states—or discourses 
within states—are key ordering priorities for the authors. But 
what is the relative difference between France (or French 
discourse) and, say, Sweden (or Swedish discourse)? What 
are the mechanisms of influence, and how does one assess 
which is more active in a particular policy area? There is a 
sense from the structure of the volume that each is equally 
important; indeed, each national debate has very specific 
characteristics that make it different from others. All of that 
may be persuasive, but it does not help the analyst under- 
stand how West European responses, writ large, are arrived 
at. Following from this, there are silences that are not 
explained. Why a Dutch chapter, not a Portuguese? Why 
Sweden and not Austria? And so on. 

Second, what is “Western Europe” in this case? It is often 
a label that means members of the European Union, partic- 
ularly as many of the country analyses argue persuasively that 
foreign policy is constructed through conceptions of Euro- 
pean integration. But what, for the analysis of this volume, is 
the effect of the EU institutions? The European Commission, 
for example, is only mentioned in passing, but it has key 
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responsibilities in defining European order, not least in the 
process of enlargement. 

Third, we do not learn too much on the “why” question. 
Why did Western Europe respond in the ways that are so 
carefully outlined? Why were there no European blueprints? 
On this question, why did Western Europe not interact in 
different ways with the United States (and, for that matter, 
with Russia)? This is a strong volume that describes what was 
going on, but within certain very specific parameters. 


The Asian Financial Crisis and.the Architecture of Global 
Finance. Edited by Gregory Noble and John Ravenhill 
New York: Cambridge University Press, 2000. 310p. $64.95 
cloth, $22.95 paper. 


Ethan B. Kapstein, Insead Business School 


The problem of how social organizations achieve stability has 
occupied political scientists since the very beginning of the 
discipline. For this reason, much literature is devoted to 
analyzing consequential episodes of instability—wars, revo- 
lutions, and political and economic crises. In the subfield of 
international relations, the search for stability brings with it 
particular theoretical and empirical challenges, given the 
absence of centralized, authoritative centers for conflict 
management. « 

This edited volume is a noteworthy addition to the study of 
instability in the global financial system and the domestic and 
international efforts by governments to shore up that system. 
Written on the heels of the Asian financial crisis, the text has 
an immediacy that is refreshing, but sacrifices little in terms 
of analytical rigor. The editors and contributors are to be 
commended for their ability to look at a recent crisis with 
some sense of detachment and produce a work of real value 
to scholars of international political economy. 

During the Asian financial crisis of 1997-98, the Interna- 
tional Monetary Fund (IMF) and other international orga- 
nizations generated a wave of studies on the weaknesses in 
regional financial’ markets:and why these led to a flight to 
safety by creditors. In the aftermath, U.S. Treasury Secretary 
Robert Rubin and other public officials proclaimed the 
necessity to reform and strengthen the global financial archi- 
tecture, in particular the need for greater transparency by 
debtor countries, stronger national regulatory regimes, and 
closer harmonization of those regimes at the international 
level. Scholars and pundits joined in these debates about 
causes, consequences, and reforms, but the emphasis was 
overwhelmingly on economic and technocratic issues associ- 
ated with the governance of financial markets. 

Missing from all this discourse was much in the way. of 
political analysis, and it is this gap that the Noble-Ravenhill 
volume ably fills. The editors and authors examine the 
political institutions that govern financial flows at the na- 
tional and international levels as well as how these institu- 
tions channel interest group. preferences and constrain the 
political choices available to leaders. The underlying argu- 
ment is that “politics matter” because decisions about such 
policies as a nation’s relative openness to international 
capital flows, patterns of credit allocation, and banking 
regulations are made in a heavily political context. 

As an example, consider the countries among those most 
affected by the crisis, Korea and Thailand. Stephan Haggard 
and Andrew MacIntyre argue (p. 57) that “institutional 
arrangements in the two countries rendered them vulnerable 
to problems of policy incoherence.” Thailand’s flawed par- 
liamentary system and Korea’s weak presidential system 
made it difficult to respond decisively when the crisis hit. In 


the discussion of Taiwan, in contrast, Noble and Ravenhill 
ascribe that country’s ability to weather the storm with 
minimal damage to a political history in which a dominant 
party created strong economic institutions and policies, which 
enabled the country to sacrifice “growth to stability” (p. 102). 

At the international level, as Miles Kahler illustrates in his 
chapter, proposals for a “new global financial architecture” 
were also made within the context of governance structures 
that could not be easily changed; any reforms would only be 
made on the margins, at best. Although there were some 
prominent calls for radical reform of institutions, notably for 
dismantling the IMF by the troika of George Schultz, Walter 
Wriston, and William Simon, these ultimately carried little 
weight with public officials around the world. Instead, most of 
the proposed changes to the IMF were relatively minor in 
import, and some of them, such as the renewed focus on crisis 
prevention, may only lift that organization’s importance and 
profile in global financial affairs over time. Indeed, to the 
extent that there has been serious discussion of financial 
reform by public officials and international bureaucrats, it has 
focused on domestic rather than multilateral regulatory 

The Asian financial crisis forced officials and scholars to 
ask once again why it is so difficult to design a stable global 
financial system. It seems that the world has experienced an 
endless series of financial crises since the collapse of Bretton 
Woods in 1971, each time creating global recession and in 
particular countries a prolonged period of negative growth, 
high unemployment, and rising poverty. This book empha- 
sizes the domestic political conditions that make financial 
instability in certain countries more or less likely as well as 
the constraints on the international system that make it 
incapable of doing much more than the current set of 
institutions and agreements already offer. Overall, it appears 
that we are tinkering on the margins, and it is reasonable to 
assume that more financial crises lie ahead. This book will 
provide useful guidance for assessments of the next crisis, and 
in the meantime it will prove valuable to every student of the 
global economy. 


Honor, Symbols, and War. By Barry O'Neill Ann Arbor: 
University of Michigan Press, 1999. 344p. $49.50 doth, 
$19.95 paper. 


Scott Gates, Michigan State University and PRIO 


Symbolism is evident in the handshake between the leaders 
of two countries, the carefully chosen words of an official 
apology (or nonapology), the yellow ribbons of the Gulf War, 
and the rise and fall of the Berlin Wall. Indeed, international 
relations are rife with symbolism. Yet, despite their ubiquity, 
symbols have largely been dismissed as peripheral aspects of 
real international politics, not worthy of serious theory 
building. O’Neill demonstrates that they not only deserve a 
central role in theories of international relations but also are 
worthy of rigorous analysis. 

This delightful book weaves hundreds of histori- 
cal examples from international relations with such ancient 
poems and tales as Sir Gawain and the Green Knight and The 
Feast of Brictu, as well as game theoretic analyses of symbols, 
honor, and social face. Rather than focus on the power 
politics of national interests, military capabilities, and alli- 
ances, or on different political and economic systems as 
explanations for the causes of war. O’Neill concentrates on 
the role of symbols. The central argument is that states still 
fight over issues of honor. Status regulators, which include 
honor, social face, prestige, and moral authority, rely on 
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symbolism and, in turn, play a profound role in defining 
mternational relations. Game theoretic analysis is used to 
demonstrate in a rigorous manner how symbols and status 
regulators affect the choices made by foreign policy decision 
makers. O’Neill provides an analytically precise and theoret- 
ically rich examination of symbolism and its roles in interna- 
tional relations. 

Three categories of symbols are identified: message, focal, 
and value symbols. In May 1994, Nelson Mandela invited his 
white jailer to his.inauguration as a gesture of “hope and 
reconciliation” (p. 6). This is an example of a message 
symbol. Focal symbols stem from events that induce observ- 
ers to expect a certain outcome in the game being played. 
The symbolism of an arms agreement, for example, may be 
more important than its actual content. A flag is a value 
symbol; it elicits strong attitudes but also exhibits a multiplic- 
ity of meanings. Symbols, in turn, play a role in determining 
the nature of challenges to honor, commitments based on 
honor, insults as assaults on face, commitments based on 
face, apologies, prestige, norms, and moral authority. All of 
these play a role in international relations. - 

Are wars caused by insults and challenges to honor? 
O’Neill answers affirmatively and cites a number of examples 
to support his claim. He is careful in his distinction between 
challenges to honor and insults as assaults on face. A 
challenge to honor originates from a party of equal rank, and 
codes of honor dictate the nature of action taken in response. 
Face centers on hierarchy. Insults attack the recipient’s 
position in a society and require common beliefs as to what 
has transpired. O’Neill develops a convincing argument as to 
how the symbolism that affects honor and face can escalate 
hostilities that can lead to war. I do not believe other 
explanations for the cause of war should be disregarded, but 
it is clear that symbols, honor, and social face play a part. 

From a game-theoretic perspective, war—like any other 
outcome—results from actors’ choices, information, and pay- 
offs. Most applications of game theory to the study of war and 
conflict take a realist perspective, which features national 
interests and material resources. By focusing on the interac- 
tion of beliefs and choices regarding subjective assessments 
of probability and beliefs about other players’ beliefs, espe- 
cially as they relate to symbols, honor, social face, and 
challenges, O'Neill incorporates elements of the constructiv- 
ist perspective into his analysis. Discourse and perception 
become the featured aspects of international interactions. In 
this way, this book integrates aspects of realist and construc- 
tivist thought through the analytical tool of game theory. 

Despite the many strengths of this excellent book, there 
are a few shortcomings. The theory would have been 
strengthened by greater focus on the game theory employed 
for analysis. Glimpses of profound insight lie buried in 
footnotes or in stray passages but never receive the full 
attention they deserve. Indeed, the discussion of game theory 
and intangibles is relegated to an appendix. This discussion, 
especially with an elaboration of the role of common knowl- 
edge in communicative acts and the interpretation of sym- 
bols, should have appeared early in the main text to demon- 
strate the utility of the game-theoretic approach in this 
context. 

Another shortcoming is the lack of focus on rhetoric. 
O’Neill’s analysis of symbolic communications between na- 
tions is limited to direct interactions. Honor and social face 
are viewed as direct causes of war, but the use of symbolism 
and rhetoric to persuade a domestic audience and justify 
bellicose actions is not examined. Indeed, O’Neill dismisses 
that perspective and is intent on demonstrating that honor 
plays a direct role in causing war. Public references to 
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“honor,” “national will,” “resolve,” “credibility,” “reputa- 
tion,” and “image of weakness” are not intended for foreign 
consumption alone! In 1990-91, when President Bush in- 
sulted Sadam Hussain before the Gulf War, his intended 
audience was not only the Iraqi but also the American public. 
Rhetoric, and specifically the symbolism that reflects honor 
and social face, plays an important role in rallying the general 
public in support of the actions taken by political leaders. 

This is an important and provocative book. Combining a 
rich set of real-world examples with rigorous game-theoretic 
analysis, O’Neill develops a convincing argument for taking 
symbolism into account when studying international rela- 
tions. His work justly deserves the 54th Annual Woodrow 
Wilson Foundation Book Award for the best book published 
on government, politics, or international relations. 


Marketing the American Creed Abroad: in the 
U.S. and Their Homelands. By Yossi Shain. New York: 
Cambridge University Press, 1999. 294p. $54.95 cloth, 
17.95 paper. 


Rodolfo O. de la Garza, Columbia University 


What is the national interest, and who defines it? These 
questions have become inextricably linked to the debate over 
how the dramatic increase in Asian, African, and Latin 
American immigration have affected national identity. These 
demographic and political changes, combined with globaliza- 
tion, transnationalism, and the increased demand by the 
leadership of the new ethnics to participate in foreign 
policymaking, have prompted scholars such as Samuel Hun- 
tington and Arthur Schlesinger, Jr., to warn against ethnic 
leaders who pursue narrowly defined country-of-origin inter- 
ests that undermine the national interest. 

Yossi Shain’s objective goes beyond challenging such 
claims fo arguing that the increased involvement of ethnic 
leaders in foreign policy actually enhances the national 
interest. They not only support American core values, such as 
democracy and individual freedoms, but also strongly advo- 
cate that country-of-origin governments adopt core Ameri- 
can values as the price of improving relations with the United 
States and advancing economically. Because of their connec- 
tions to the home country, Shain argues, ethnic leaders are 
ideally situated to market the American creed. 

This work is of broad scope. To support his claim that 
ethnics abandon their home-country allegiances when these 
conflict with “American” interests, Shain reviews the histor- 
ical examples of the Germans, Irish, Japanese, and others. To 
address the current debate regarding the effect of immi- 
grants, he focuses on Arabs and Mexicans but also addresses 
the experiences of many other groups, such as Czechs, 
Slovaks, Koreans, and Cubans. His analysis emphasizes their 
concern to be recognized as legitimate participants in Amer- 
ican society. Absent such status they will experience social, 
political, and economic discrimination. Moreover, they will 
be incapable of influencing U.S. policy toward their country 
of origin, and they will suffer if relations between the United 
States and their homeland deteriorate or become openly 
hostile, as U.S.-Iran and U.S.-Iraq relations illustrate. 

The volume’s principal value is that it adds substantial new 
material to the well-known arguments developed by Louis 
Gerson, Alexander DeConde, and, most significantly, Law- 
rence Fuchs. The latter's influence is especially visible in 
Shain’s argument regarding the efforts by ethnics to establish 
their American credentials so that, if they engage in foreign 
policy, they do so as patriots rather than as citizens with 
divided loyalties. Shain brings that argument up to date by 


American Political Science Review 


VoL 95, No. 4 





locating it within the context of a new international environ- 
ment. Specifically, he considers how states such as Mexico, 
which were historically antagonistic toward emigrants, now 
seek closer ties with emigrants m the hope that they will in 
turn help influence U.S. policies that are in line with home 
country interests. 

The study would have been greatly enhanced if the theo- 
retical foundations had been made explicit and the argument 
had been developed more systematically. Each of Shain’s 
cases uses some combination of variables—such as diaspora 
demographics, immigration history, home country percep- 
tions of the diaspora, home country regime type and inter- 
national status, and characteristics of home country relations 
with the United States—to explain how each example relates 
to Shain’s core proposition. Yet, the author does not system- 
atically weigh the relevance of each of these in his numerous 
case studies, and he does not rank order them in terms of 
their overall relevance to the general issue of diasporic 
foreign policy involvement. The result is a volume rich in 
detail but short on theory, which leaves unclear the signifi- 
cance of the case studies. 

The lack of theory makes it impossible to evaluate claims 
that all diasporas support core American values and promote 
them in their dealing with homeland governments. Such 
assertions implicitly challenge the utility of other literatures 
on diasporic foreign policy involvement, such as those on 
transnationalism, domestic sources of foreign policy, and 
socioeconomic assimilation. Shain seems to be calling for a 
new approach, but he never explicitly delineates it. 

Because the case studies identify multiple factors that 
affect a diaspora’s foreign policy involvement, Shain’s asser- 
tion that all groups ultimately promote American democratic 
values in the home country is both empirically questionable 
and theoretically puzzling. Empirically, his own examples 
counter his claim. For example, Mexican American (as 
contrasted with Mexican immigrant) leaders have never 
publicly promoted democratic reform in Mexico. Similarly, 
his description of the attitudes of mainstream Arab Ameri- 
cans as compared to rightist and leftist Muslims toward the 
United States and its Middle East policy indicates that many 
leaders in this diaspora neither embrace core American 
values nor promote them in the homeland. Shain also does 
not document that promoting democracy and related values 
is vital to the Chinese diaspora. 

Theoretically, the argument suggests that none of the 
variables considered in the case studies have any ultimate 
effect on diasporic foreign policy activities, which is a dubious 
assertion. This may be most obvious with regard to the effects 
of time. How long must immigrants be settled before they act 
as Shain describes? Is it years or generations? Do the effects 
of time vary as a function of the characteristics of the home 
country regime? Of the social capital with which immigrants 
arrive? With how they are received? The three groups cited 
above differ in terms of history, demographics, ideology, and 
home country regime type, and they all act counter to Shain’s 
assertion regarding the promotion of democratic practices in 
the homeland. What explains this? In addition, Shain argues 
that all groups must claim they are acting in the national 
interest when they engage foreign policy. Does this suggest 
that often such claims are made only for strategic reasons? 
These are some of the issues that the study fails to address 
because of its atheoretical approach. 

With the possible exception of the Muslim case, Shain’s 
examples (predictably) rebut the charges that the involve- 
ment of new immi ts in foreign policy undermines the 
national interest. He should have noted, however, that dias- 
poras have a right to participate in public debates on the 


national interest, and, depending on a variety of factors, may 
support significant changes in American foreign policy. In 
other words, Shain should have acknowledged that some of 
these new actors may offer a different vision of the national 
interest, but he should have explicitly challenged the argu- 
ment that such competing pe: ives undermine the na- 
tional interest by noting that all citizens have the right to 
challenge extant definitions of the national interest, either 
through public debates or congressional lobbying. Those who 
would deny that right to diasporas in effect seem to be calling 
for a return to the undemocratic decision-making processes 
of the past that are increasingly unacceptable to the Ameri- 
can public. 

In conclusion, this study offers a detailed description of 
diasporic involvement in foreign policy. It would have been 
more valuable if it also had been theoretically grounded. 


Theory of the Global State: Globality as an Unfinished 
Revolution. By Martin Shaw. Cambridge: Cambridge Uni- 
versity Press. 2000. 295p. $59.95 cloth, $21.95 paper. 

William I. Robinson, University of California, Santa Barbara 
At a time when globalization has thrown traditional social 
science paradigms into a quandary, one would expect inter- 
national relations (IR) scholars to question the continued 
centrality of state-based explanations to worldwide social 
phenomena. Martin Shaw is acutely aware of the limitations 
of mainstream social science in the face of emergent global- 
ity. Curiously, however, he presents a study that attacks the 
problem of the limits to state centrism with a thesis that 
retains as the state as the center of analysis. Globalization has 
made clear the untenability of subsuming an array of social, 
economic, cultural, political, and ideological processes and 
forces under the statist conception of social life central to 
traditional IR approaches. Yet, this is exactly what Shaw does 
in his attempt to develop a theory of the global state. 

To be sure, this is an important and innovative study. Shaw 
argues that we are undergomg a period of historic transfor- 
mation such that the “global” needs a “core meaning” of its 
own (p. 5). The global should be understood through three 
major concepts: globality, global revolution, and the global 


` state. With this triad Shaw wants to move beyond typical 


globalization theories that stress transformed conceptions of 
time and space in social relations. He finds the notion that 
globalization as process lacking and prefers to identify the 
phenomenon as a state or condition, hence globality. He 
concedes that transformation of time and space has recast 
social relations, but for him new social structures do not 
constitute the essence of globality; rather, it is the new social 
meaning these transformations involve, ın particular “the 
development of a common consciousness of human society on 
a world scale” (pp. 11-2, emphasis in original). This consti- 
tutes the idealist core of his theory of the global state, in 
which social relations “become global when they are signifi- 
cantly and systematically informed by an awareness of the 
common framework of worldwide human society” (p. 12). 
How should we conceive of the current period of world- 
wide transformation? Postmodern approaches see cultural 
change as causal, says Shaw, whereas post-Cold War ap- 
proaches emphasize political and military transition, and 
theories of globalization underscore technical and economic 
change. These dimensions, he claims, are subordinate to the 
central driving dynamic of our time, the emergence of a 
global state. The global revolution is alternatively conceived 
by Shaw as the rise of this global state (“so that the political 
structure of social relations on a world scale has fundamentally 
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changed,” p. 17, emphasis in original) and as the spread of 
democracy in global society (the “global democratic revolu- 
tion”). Indeed, he suggests that the “democratic revolution” 
centered in the former Soviet bloc paved the way for the rise 
of a global state. “In contrast to those accounts of globality 
which plot the extending worldwide web of economic and 
cultural linkages, I focus on the traumatic political upheavals 
(the global democratic revolution)” at the center of globality 
as the causal factor in the transition to a “global state-system” 
p. 145). : 

i a is a circularity here that indicates the underlying 
problem of the lack of an identifiable historical determina- 
tion in the globalist outcome. Economic causality is dismissed 
as “economism,” and the causality of social forces is likewise 
dismissed as “sociologism.” Instead, globality should be 
grasped, as primarily statist (p. 81), and explanatons of the 
current epoch should focus on “the state core of contempo- 
rary globality” (p. 90). But how does Shaw reconcile the 
antinomy of his idealist definition of globality as conscious- 
ness with his material emphasis on the rise of a global state? 
What ultimately drives global state formation is the incom- 
plete process whereby “global consciousness” demands 
“global state relations” (pp. 20-1). This becomes a circular 
argument, since in Shaw’s formulation global state relations 
are said to have shaped global consciousness. 

Shaw then proceeds to adopt Michael Mann’s neo-Webe- 
rian definition of the state as a territorially demarcated, 
differentiated set of institutions and personnel with a center 
that exercises authoritative rulemaking backed up by force. 
In order to arrive at his conception of what exactly comprises 
the global state, Shaw adds the qualification that a particular 
power center must be “inclusive and constitutive of other 
forms of layers of state power” (p. 190). With this qualifica- 
tion, he can conceive of a global state as a higher layer of 
state power constituted by, but superseding, national states, 
such that the latter become components of a larger entity. 
This superior layer of state power is a “global Western state,” 
territorially bound and comprised of the North Atlantic bloc 
plus Japan, Australia, and New Zealand. This configuration 
of core capitalist states constitutes a global state, he suggests, 
because it is organized into a military alliance and because its 
rulemaking backed up by force extends to the entire interna- 
tional system. Despite Shaw’s formal rejection of realism, we 
seem to have here a neorealist theory of hegemonic stability 
account of the capitalist core as a new hegemon within an 
otherwise anarchaic system of states and state blocs. 

Indeed, Shaw identifies three types of states in the current 
global order. The global Western state is “an integrated 
authoritative organization of violence which includes a large 
number of both juridically defined states and international 
interstate organizations” and “functions as a single center of 
military state power in relation to other centers” (pp. 199- 
200). The “quasi-imperial nation-states” are exemplified by 
“numerous major non-Western states, across Asia, Latin 
America, Africa and the Middle East.” The “new proto- and 
quasi-states” include autonomous provinces and districts in 
what was formerly the Soviet bloc or the contested and 
tenuous states of a number of African nations. 

“The power relations between states of these three types 
are crucial to understanding world order” (p. 213). Through 
transformed relations within and among these different state 
blocs, “external relations between states are transformed into 
the internal relations of new state power... complex global 
concentrations of state power [... ] have actually come into 
existence” in a form other than an “uncomplicated center of 
state power, a singular sovereign” (p. 92). This is a crucial 
insight into our understanding of transnational state forma- 
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tion, and it constitutes a major contribution of the study. But 
why is Shaw so concerned with placing the state at the center 
of globality? A global society, economy, and culture can be 
said to exist “to the extent that global patterns of relations 
have become inclusive and constitutive of these (social, 
economic, cultural) relations in general” (pp. 177-8). In turn, 
in Shaw’s account, these social relations are seen as sub- 
sumed by (or at best, organized by and around) state 
structures. The state drives and organizes these relations, 
which for Shaw are derived from state forms. 

I do not want to negate the importance of this work, 
especially since it explores uncharted terrain, but neverthe- 
less it remains inconclusive, contradictory, and deeply trou- 
bling on several counts. First, Shaw eschews political econ- 
omy as an instrument of analysis and is explicit in his hostility 
to the “theory of capitalism” (p. 42). Hence, entirely absent is 
any discussion of economic globalization, of capitalism as a 
global system, or of class groups, elites, and other social 
forces in the current period of transformation. World system 
theory is dismissed out of hand, and there is no engagement 
with the burgeoning literature on international political 
economy (IPE). This rejection of political economy is symp- 
tomatic of an underlying problem: There 1s no consistent 
theoretical approach or any systematic application of analyt- 
ical precepts. Shaw’s account of the global state is con- 
structed through an eclectic admixture of neo-Weberian, 
realist, liberal, postmodern, and institutional analysis. His 
core concepts are never fully unpacked or operationalized 
and hence remain undertheorized. ; 

Second, Shaw thoroughly reifies the state as a macrosocial 
agent sui generis. Despite his insistence that the current 
epoch cannot be conceived in terms of the old nation-state 
categories of social science, his study is ultimately—and 
thoroughly—a state-centered and realist account. Shaw de- 
scribes the global as a “global mterstate world” (p. 16, 
emphasis added), in which globality is reduced to a reorga- 
nization of the historical interstate system of discrete nation- 
states. The transition from the “bloc-state system” of the 
Cold War into an “incipient global state-system” is the key 
transformation of our epoch. Shaw develops the insightful 
notion that globality involves moving beyond the “ordering 
principle” of nationality-internationality as “a historically 
specific structure of state-society relationship” (p. 28), but it 
is not clear how his own account does in fact move us beyond 
this principle. 

Third, the structure of Shaw’s thesis is normative and 
ideological. The global Western state is equated with a 
“progressive” and benign version of Kautsky’s “ultra imperi- 
alism,” a heroic bloc championing democracy and civility 
worldwide (p. 161), in contrast to “major non-Western 
states,” which are prone to intra- and interstate warfare, 
authoritarianism, lawlessness, and “ruthless discrimination 
against peripheral, especially indigenous, peoples” (p. 209). 
The Third World is largely ignored, aside from sweeping 
generalizations about Africa, Asia, and Latin America, where 
social change in the epoch of globality is reduced to a struggle 
between forces of “democratic change and violent reaction” 
(p. 168). The narrative is laced with strong doses of crude 
anticommunism. Adjectives such as “brutal,” “genocidal,” 
“illegitimate,” “totalitarian,” “stagnant,” “bureaucratic,” and 
“repressive"—to mention but a few—abound in Shaw’s dis- 
cussion of the former Soviet Union and detract from a more 
objective narrative. 

Conspicuously absent is the historic role of Western colo- 
nialism and imperialism in contributing to current global 
structures. Since theories of global capitalism, the world- 
system, or IPE are dismissed, linkages between the non- 
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Western world and a larger global system dominated by a 
Western capitalist core play no causal role or are not 
accorded any determination or explanatory value in the 
condition of both the Western and the non-Western worlds. 
There is little recognition by Shaw of the deep and rising 
racial, gender, and class inequalities; escalating polarization 
between the global poor and the global rich; unprecedented 
levels of deprivation, marginality, injustice, and militarized 
borders; and the concentration of political power in emergent 
transnational elites that so perforate the brave new world of 
global capitalism. Shaw need not agree with this more critical 
view of emergent global society, but his one-sided enthusiasm 
is not convincing. In this regard, it is difficult to avoid 
concluding that this is a story of the West and the rest; a 
renovated colonial tale of Western superiority and the “peo- 
ples without history” that provides a comforting theoretical 
rationale for the emergent global capitalist order. 


Limited Adversaries: Post-Cold War Sino-American Mutual 
Images. By Jianwei Wang. Oxford and New York: Oxford 
University Press, 2000. 305p. $35.00. 


John W. Garver, Georgia Institute of Technology 


This interesting and solid study of Chinese and American 
mutual elite perceptions is based on 268 interviews con- 
ducted during 1990-92. Interviewees fell into three catego- 
ries: academics largely involved in study of the other country, 
diplomats, and business people involved in commercial activ- 
ities with the other country. The views of these elites were 
solicited via a standardized set of 66 questions probing the 
views of the other country’s people, society, culture, and 
international behavior. Respondents were also asked to 
describe how their views had changed since about 1970, and 
especially how they had been affected by the Beijing massacre 
of June 1989. Wang concludes that the effect of June 1989 
was far less than is often asserted, based on the relatively low 
change in perceptions reported by his respondents. Wang is 
well aware of the methodological problems with such an 
approach—the tendency of people to conclude post hoc that 
“I saw it coming”—but he believes his findings are valid. He 
also acknowledges that elite perceptions, formed on the basis 
of greater information and personal interaction, may well 
differ from mass perceptions. 

Wang finds that elite images on each side are not simple or 
unidimensional but fairly complex. Elites bold positive mirror 
images of the “people” of the other. The Americans tend to 
see Chinese as open, having a sense of humor, resourceful, 
and intelligent. The Chinese tend to see Americans’as open, 
direct, honest, enterprising, and pragmatic. Both sides see the 
other as considerably more emotionally and intellectually 
accessible than the people of other nations—Russians, Jap- 
anese, or Europeans. Yet, there are negative views. The 
Americans simultaneously see Chinese as poor and politically 
impotent, and the Chinese see Americans as shallow, disso- 
lute, and arrogant. Wang suggests that the perceived similar- 
itles between Chinese and Americans may make stable 
Sino-American cooperation difficult by covering up deep, 
underlying differences in actual national character. Initial 
establishment of rapport is easy because of perceived simi- 
larities, but disillusionment may set in once underlying 
differences surface. 

Wang finds asymmetrical perceptions regarding “society” 
and “culture.” Chinese elites tend to view the U.S. system of 
government positively, stressing its political stability, effi- 
ciency, ability to incorporate a range of diverse views, and 
effective central-local relations. American elites, interviewed 


Vol. 95, No. 4 


not long after the events of June 1989, viewed the Chinese 
government with disgust. Chinese elites were repelled by 
intractable American social problems and seeming govern- 
mental indifference to social injustice, but they judged posi- 
tively the American work ethic, sense of personal responsi- 
bility, and spirit of entrepreneurship. American elites 

admiration for Chinese culture but made clear that 
this referred to China’s traditional culture, not to the culture 
of contemporary, communist-ruled, post-1949 China. 

Regarding international behavior, Wang again finds mirror 
images: Elites in both countries tend to see the other as a 
normal power pursuing national interests. There was una- 
nimity among Chinese elites regarding the preponderant 
nature of U.S. power, but different normative valuations were 
associated with that power. Some Chinese see the United 
States as inspired by hegemonic motives and attempting to 
establish a U.S.-dominated unipolar world order. Others 
evince sympathy and understanding for U.S. global leader- 
ship. Effective resolution of global problems requires a 
leader, according to this Chinese view, and only the United 
States has adequate “comprehensive power” to play a lead- 
ership role. Moreover, the substantive content of U.S. lead- 
ership generally accords with China’s own interests. Some 
Chinese even went so far as to suggest that the United States 
is a relatively benign great power, comparable to China 
during its periods of premodern greatness in East Asia. 

There is a similar bifurcation of perceptions on the U.S. 
side. There is basic unanimity that China does not constitute 
a direct military threat to the United States. Beyond that, 
perceptions split. Some of the Americans view China as a 
nonthreatening power that is gradually joining the interna- 
tional mainstream. Others see it as a deviant from universal 
norms of behavior, both domestic and international As such, 
China is often disruptive and a source of international 

On the basis of the considerable area of shared benign and 
positive images, Wang concludes that the two countries are 
not “enemies,” only “limited adversaries.” His presentation is 
cast in terms of the deterioration in relations that occurred 
during the 1990s, when strong voices in both countries 
proclaimed the other as an enemy. Such a black-and-white 
view does not accord with the complex, sophisticated elite 
perceptions uncovered by the interviews, Wang maintains. 
Most elites in both countries do not regard the other as 
threatening or as an outlaw country, and they are very much 
aware of the interdependence of interest and consequent 
need for cooperation. Thus, the two countries are “cooper- 
ating adversaries,” neither enemies nor allies, but normal 
powers with a range of convergent as well as conflicting 
interests. 

Wang has some very interesting things to say about how 
various well-intentioned U.S. actions in the 1990s, intended 
to demonstrate U.S. sympathy with the Chinese people, were 
viewed in China as pressure, arrogance, and intimidation. He 
asserts that these cognitive processes have not transformed 
the psychology of the Sino-American relationship into a 
perception of “enemies.” One is still left with the fact that 
many negative events occurred after Wang’s interviews: the 
imbroglio over the 2000 Olympics, the linking and delinking 
of most-favored-nation status, Lee Teng-hui’s visit to the 
United States and the Straits confrontation, the bombing of 
the Chinese embassy in Belgrade and the besieging of the 
U.S. one, the spy plane incident, and so on. Assertions of 
faith, authority, and hope aside, we simply cannot know how 
much these events have altered the relatively positive elite 
perceptions Wang found. His study provides a solid bench- 
mark for measuring subsequent change. 
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Constructing National Interests: The United States and the 
Cuban Missile Crisis. By Jutta Weldes. Minneapolis: 
University of Minnesota Press, 1999. 316p. $47.95 cloth, 
$18.95 paper. 

Aaron Belkin, Unversity of California, Santa Barbara 

The Cuban missile crisis may be one of the most studied 

(overstudied?) cases in the literature on international rela- 

tions. Theorists of almost every stripe have turned to it for 

evidence, and a brief search through the University of 

California’s on-line library catalog indicates 83 books and 103 

journal articles on the case. Despite the abundance of 

scholarship, however, Jutta Weldes has written a fresh and 
compelling book, one that no scholar of foreign policy crises 
should skip. 

Weldes begins with an interesting question (pp. 2-3): 
“How do we get from the Soviet missile deployment in Cuba 
to the Cuban missile crisis?” After the Kennedy administra- 
tion discovered Nikita Khrushchev’s attempts to sneak me- 
dium-range ballistic missiles into Cuba, the American inter- 
est in getting rid of them was self-evident, and the White 
House never considered inaction to be a viable response. 
Although theorists have spent millions of foundation dollars 
investigating Soviet motives, no one has asked why the 
administration believed it could not ignore the situation. The 
answer is nonobvious and essential, Weldes suggests, for 
understanding the origins of the crisis. 

Why did American national interest require removal of the 
Soviet missiles from Cuba? Weldes argues that the unthink- 
ability of ignoring the deployment resulted from “the con- 
stant, numbing repetition of the same stock phrases and 
descriptions,” repetitions that “contributed to the reception 
of these representations as common sense” (pp. 226-7). 
These stock phrases (e.g., “the Soviet Union is bent on world 
domination”) were the linguistic effects of the “security 
imaginary,” a set of ideas about entities that populated the 
international system as well as relations among them. The 
postwar American security imaginary depicted the United 
States as the defender of freedom, while appeasement, Soviet 
duplicity, the creeping subversion of international commu- 
nism, and the Cuban revolution were figured as threats. 
Security imaginaries clarify who we are and who our enemies 
are and how and why they threaten us. 

Two processes—articulation and interpellation—explain 
how the national interest is stitched together out of the 
security imaginary. Articulation refers to the process by 
which “different terms and ideas come to ‘summon’ one 
another” (p. 98). For example, “references to Castro and his 
revolutionary associates were persistently articulated to the 
adjective ‘bearded.’ ” Invoking Castro’s beard implied that he 
was “irresponsible, uncivilized, and a danger to the United 
States” (p. 98). Weldes argues that the national interest 
draws on numerous articulations (“communist,” “aggres- 
sion,” and so on) and that, through repetition, these pairings 
can come to appear natural and part of common sense. 

Interpellation refers to processes by which “identities are 
created and concrete individuals are interpellated by, or 
‘hailed’ into those subject positions” (p. 103). “Out of an 
abstraction designating a territory, a population, and a set of 
governing principles and apparatuses is created an anthro- 
pormorphization, the fiction of an apparently acting subject 
with motives and interests” (p. 104). Weldes shows that 
postwar American identity came to incorporate several over- 
lapping ideals, including leadership, defense of freedom, and 
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the maintenance of credibility. The construction of U.S. 
subjectivity helps explain why President Kennedy could not 
imagine ignoring the mussiles, as failure to remove them 
would have undermined the core of American identity. 

This study may call into question some of the value of 
applying scientific methods to the analysis of international 
crises. Certainly, the Cuban missile crisis can be conceptual- 
ized as an instance of a broader pattern of interactive events 
whose predictors and internal logic might be discovered 
through scientifically disciplined inquiry. It seems unreason- 
able to suggest, to take one of many possible examples, that 
there is nothing at stake in efforts to determine whether the 
missile crisis is best viewed as a prisoner's dilemma or a failed 
bluff (Glenn H. Snyder and Paul Diesing, Conflict among 
Nations, 19T7, p. 114). But the reader comes away from 
Weldes’s analysis suspecting that an appreciation of idiosyn- 
crasy probably is essential for understanding what matters 
most about the story. 

Of course, Weldes does not try to answer every important 
question about the omgins of the Cuban missile crisis. She 
acknowledges thoughtfully that her emphasis on socially 
constructed aspects of international relations is not intended 
to imply that politics are arbitrary, and she says that power 
relations constrained the range of possible outcomes (p. 102). 
Perhaps in future work she and other scholars will explore 
interactions among reality constraints that are grounded in 
power and socially constructed national interests. 

The findings of this study gesture ironically at a compli- 
cated set of relationships between scholarship and decision 
making. Motivated by the best of intentions, scholars who 
study international crises have spilled more ink formulating 
recommendations for policymakers than perhaps any other 
subset of the discipline. For better or worse, few if any of 
these recommendations appear to have been taken seriously 
in Washington (Richard K. Betts, “Should Strategic Studies 
Survive?” World Politics 50 [October 1997]: 7-34). Yet, 
Weldes suggests that scholarship did have an important 
although possibly unintended influence on policy during the 
Cuban missile crisis. “By authoritatively defining the real, 
[international relations theory] removed from critical analysis 
and political debate what was m fact a set of socially 
constructed representations” and “led to the reception... of 
one particular U.S. national interest (that of securing the 
removal of Soviet missiles from Cuba) as common sense” (p. 
241). Weldes is not the first to notice this phenomenon, but 
her expertise on the origins of the national interest lends 
weight to the argument. 

Another ironic implication of the study is that Amencan 
decision makers may have had very little freedom of choice. 
Weldes and other critical theorists, such as Alexander Wendt, 
argue convincingly that individuals shape as well as respond 
to structure. If the missiles’ meaning was socially constructed, 
perhaps Kennedy and his advisors could have interpreted the 
situation differently. Weldes does such a convincing job of 
linking the Cold War vilification of Soviet communism to the 
administration’s belief that ıt could not ignore the missiles, 
however, that one wonders whether the White House could 
have imagined the weapons as anything but a metaphor for 
the East-West rivalry. Perhaps the president and his advisors 
could have chosen to understand the deployment in a differ- 
ent way, but they would have been swimming against a very 
strong discursive current that was only partially of their own 
making. What a terrifying thought to consider. 
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